-
SCUT
- Guangzhou
Lists (32)
Sort Name descending (Z-A)
wxminip
video
TTS
sth else
Sentence Embedding
sd & promptgen for sd
SCUT
SAM
qa
prompt
NLP-Classification
multimodal
mllm
meta-learning
make_money
LLM-EVAL
LLM
interview
game
few-shot
dialogue & qa
dataaug
data eng
CV
Controllable Text Generation
Contrastive Learning For CV
context_len
Competition
ChatGPTs
audio
AIGC
AI idol
Stars
[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Stable diffusion for inpainting
[NeurIPS'23] Emergent Correspondence from Image Diffusion
Repository for code used in the xVal paper
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
一个高自由度的端到端的可定制AI-VTuber。支持对接哔哩哔哩直播间,以智谱API作为语言基座模型,拥有意图识别、长短期记忆(直接记忆和联想记忆),支持搭建认知库、歌曲作品库,接入了当前热门的一些语音转换、语音合成、图像生成、数字人驱动项目,并提供了一个便于操作的客户端。
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…