-
Peking University
Highlights
- Pro
Lists (3)
Sort Name ascending (A-Z)
Stars
Google Research
High-Resolution Image Synthesis with Latent Diffusion Models
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…
LAVIS - A One-stop Library for Language-Vision Intelligence
FaceChain is a deep-learning toolchain for generating your Digital-Twin.
Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion
Using Low-rank adaptation to quickly fine-tune diffusion models.
Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models l…
Pytorch🍊🍉 is delicious, just eat it! 😋😋
The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Easily compute clip embeddings and build a clip retrieval system with them
Code Repository for The Kaggle Book, Published by Packt Publishing
Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop
手把手带你实战 Huggingface Transformers 课程视频同步更新在B站与YouTube
Grounded SAM 2: Ground and Track Anything in Videos with Grounding DINO, Florence-2 and SAM 2
Official implementation of Diffusion Autoencoders
Attention is all you need implementation
This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.
Stable Diffusion implemented from scratch in PyTorch
[CVPR 2023] CodeTalker: Speech-Driven 3D Facial Animation with Discrete Motion Prior
Code for "GVHMR: World-Grounded Human Motion Recovery via Gravity-View Coordinates", Siggraph Asia 2024
Layout Generation and Baseline implementations
Jupyter notebook tutorials for MMClassification