The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…

Python 33,036 4,833 Updated Jan 31, 2025

openai / CLIP

CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

Jupyter Notebook 27,152 3,419 Updated Jul 23, 2024

mravanelli / pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding a…

Python 2,376 445 Updated Mar 14, 2022

meituan / YOLOv6

YOLOv6: a single-stage object detection framework dedicated to industrial applications.

Jupyter Notebook 5,749 1,044 Updated Aug 7, 2024

sebastianstarke / AI4Animation

Bringing Characters to Life with Computer Brains in Unity

C++ 7,977 1,063 Updated Jul 23, 2024

WongKinYiu / ScaledYOLOv4

Scaled-YOLOv4: Scaling Cross Stage Partial Network

Python 2,022 572 Updated Nov 3, 2024

Arthur151 / ROMP

Monocular, One-stage, Regression of Multiple 3D People and their 3D positions & trajectories in camera & global coordinates. ROMP[ICCV21], BEV[CVPR22], TRACE[CVPR2023]

Python 1,377 230 Updated Nov 14, 2024

phizaz / diffae

Official implementation of Diffusion Autoencoders

Jupyter Notebook 887 133 Updated Sep 12, 2024

tensorboy / pytorch_Realtime_Multi-Person_Pose_Estimation

Python 1,370 411 Updated Feb 7, 2023

wangzheallen / awesome-human-pose-estimation

Human Pose Estimation Related Publication

1,348 208 Updated Aug 7, 2020

Tencent / ncnn

ncnn is a high-performance neural network inference framework optimized for the mobile platform

C++ 20,874 4,200 Updated Jan 23, 2025

LCAV / pyroomacoustics

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

Python 1,502 437 Updated Jan 3, 2025

lutzroeder / netron

Visualizer for neural network, deep learning and machine learning models

JavaScript 29,230 2,840 Updated Jan 31, 2025

MontrealCorpusTools / Montreal-Forced-Aligner

Command line utility for forced alignment using Kaldi

Python 1,392 251 Updated Dec 2, 2024

philgras / neural-head-avatars

Official PyTorch implementation of "Neural Head Avatars from Monocular RGB Videos"

Python 549 75 Updated Jul 24, 2022

rosinality / vq-vae-2-pytorch

Implementation of Generating Diverse High-Fidelity Images with VQ-VAE-2 in PyTorch

Python 1,679 276 Updated Feb 15, 2023

keonlee9420 / DiffGAN-TTS

PyTorch Implementation of DiffGAN-TTS: High-Fidelity and Efficient Text-to-Speech with Denoising Diffusion GANs

Python 325 44 Updated Feb 21, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Charles cshang-rbx

Block or report cshang-rbx

Stars

lucidrains / voicebox-pytorch

facebookresearch / seamless_communication

guochengqian / Magic123

nmwsharp / diffusion-net

williamyang1991 / StyleGANEX

ggerganov / ggml

InternLM / InternLM

THUDM / GLM-130B

m-bain / whisperX

facebook / Ax

harvard-edge / multilingual_kws

Delgan / loguru

lllyasviel / ControlNet

huggingface / pytorch-image-models