Skip to content
View devngmo's full-sized avatar
  • VNPT DNI
  • Viet Nam

Block or report devngmo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

27 stars written in Python
Clear filter

Learn how to design large-scale systems. Prep for the system design interview. Includes Anki flashcards.

Python 287,323 47,883 Updated Dec 2, 2024

Robust Speech Recognition via Large-Scale Weak Supervision

Python 75,227 8,992 Updated Jan 4, 2025

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,891 5,118 Updated Oct 10, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch and FLAX.

Python 27,310 5,597 Updated Jan 29, 2025

The no-magic web API and microservices framework for Python developers, with an emphasis on reliability and performance at scale.

Python 9,581 950 Updated Jan 21, 2025

A PyTorch-based Speech Toolkit

Python 9,271 1,424 Updated Jan 22, 2025

Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

Python 8,162 773 Updated Oct 7, 2024

Stable Diffusion web UI

Python 7,890 882 Updated Aug 14, 2024

🔥 2D and 3D Face alignment library build using pytorch

Python 7,189 1,355 Updated Aug 30, 2024

Joint 3D Face Reconstruction and Dense Alignment with Position Map Regression Network (ECCV 2018)

Python 4,981 941 Updated Jul 25, 2022

KAG is a logical form-guided reasoning and retrieval framework based on OpenSPG engine and LLMs. It is used to build logical reasoning and factual Q&A solutions for professional domain knowledge ba…

Python 4,790 300 Updated Jan 22, 2025

Speech-to-Text-WaveNet : End-to-end sentence level English speech recognition based on DeepMind's WaveNet and tensorflow

Python 3,967 792 Updated Oct 8, 2021

🪩 Create Disco Diffusion artworks in one line

Python 3,845 248 Updated May 16, 2023

[CVPR2019] Fast Online Object Tracking and Segmentation: A Unifying Approach

Python 3,481 812 Updated Dec 19, 2023

End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow

Python 2,841 534 Updated Mar 24, 2023

A python package to analyze and compare voices with deep learning

Python 2,835 437 Updated Oct 12, 2023

This is the code for Deformable Neural Radiance Fields, a.k.a. Nerfies.

Python 1,747 232 Updated Apr 22, 2024

Python Fast Dataflow programming framework for Data pipeline work( Web Crawler,Machine Learning,Quantitative Trading.etc)

Python 1,199 102 Updated Dec 31, 2020
Python 1,161 146 Updated Sep 29, 2022

Deep Speaker: an End-to-End Neural Speaker Embedding System.

Python 912 242 Updated Apr 13, 2024

Project Page of 'GANFIT: Generative Adversarial Network Fitting for High Fidelity 3D Face Reconstruction' [CVPR2019]

Python 641 65 Updated Nov 9, 2021

speech to text with self-supervised learning based on wav2vec 2.0 framework

Python 383 115 Updated Nov 22, 2021

fast-stable-diffusion, +25-50% speed increase + memory efficient + DreamBooth

Python 217 11 Updated Jan 18, 2023

Speedml is a Python package to speed start machine learning projects.

Python 211 32 Updated Dec 4, 2019

Reconstructing real-time 3D faces from 2D images using deep learning.

Python 131 35 Updated Mar 22, 2020

Speech Recognition model based off of FAIR research paper built using Pytorch.

Python 83 22 Updated Dec 11, 2018
Python 77 7 Updated Aug 11, 2021