Skip to content
View cshang-rbx's full-sized avatar
💭
💫
💭
💫

Block or report cshang-rbx

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Implementation of Voicebox, new SOTA Text-to-speech network from MetaAI, in Pytorch

Python 633 53 Updated Oct 1, 2024

Foundational Models for State-of-the-Art Speech and Text Translation

Jupyter Notebook 11,258 1,105 Updated Nov 14, 2024

[ICLR24] Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors

Jupyter Notebook 1,563 95 Updated Apr 22, 2024

Pytorch implementation of DiffusionNet for fast and robust learning on 3D surfaces like meshes or point clouds.

Python 428 56 Updated Aug 8, 2022

[ICCV 2023] StyleGANEX: StyleGAN-Based Manipulation Beyond Cropped Aligned Faces

Jupyter Notebook 516 39 Updated Sep 19, 2023

Tensor library for machine learning

C++ 11,687 1,101 Updated Jan 29, 2025

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

Python 6,729 478 Updated Jan 26, 2025

GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

Python 7,677 606 Updated Jul 25, 2023

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

Python 13,607 1,479 Updated Jan 27, 2025

Adaptive Experimentation Platform

Python 2,422 319 Updated Feb 1, 2025

Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus

Jupyter Notebook 171 38 Updated Dec 6, 2024

Python logging made (stupidly) simple

Python 20,653 715 Updated Jan 23, 2025

Let us control diffusion models!

Python 31,322 2,806 Updated Feb 25, 2024

The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,036 4,833 Updated Jan 31, 2025

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,152 3,419 Updated Jul 23, 2024

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,376 445 Updated Mar 14, 2022

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Jupyter Notebook 5,749 1,044 Updated Aug 7, 2024

Bringing Characters to Life with Computer Brains in Unity

C++ 7,977 1,063 Updated Jul 23, 2024

Scaled-YOLOv4: Scaling Cross Stage Partial Network

Python 2,022 572 Updated Nov 3, 2024

Monocular, One-stage, Regression of Multiple 3D People and their 3D positions & trajectories in camera & global coordinates. ROMP[ICCV21], BEV[CVPR22], TRACE[CVPR2023]

Python 1,377 230 Updated Nov 14, 2024

Official implementation of Diffusion Autoencoders

Jupyter Notebook 887 133 Updated Sep 12, 2024

Human Pose Estimation Related Publication

1,348 208 Updated Aug 7, 2020

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,874 4,200 Updated Jan 23, 2025

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,502 437 Updated Jan 3, 2025

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,230 2,840 Updated Jan 31, 2025

Command line utility for forced alignment using Kaldi

Python 1,392 251 Updated Dec 2, 2024

Official PyTorch implementation of "Neural Head Avatars from Monocular RGB Videos"

Python 549 75 Updated Jul 24, 2022

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,679 276 Updated Feb 15, 2023

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 325 44 Updated Feb 21, 2022
Next