-
SCUT
- Guangzhou
Lists (32)
Sort Name ascending (A-Z)
AI idol
AIGC
audio
ChatGPTs
Competition
context_len
Contrastive Learning For CV
Controllable Text Generation
CV
data eng
dataaug
dialogue & qa
few-shot
game
interview
LLM
LLM-EVAL
make_money
meta-learning
mllm
multimodal
NLP-Classification
prompt
qa
SAM
SCUT
sd & promptgen for sd
Sentence Embedding
sth else
TTS
video
wxminip
Stars
Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
Stable diffusion for inpainting
[NeurIPS'23] Emergent Correspondence from Image Diffusion
Repository for code used in the xVal paper
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
一个高自由度的端到端的可定制AI-VTuber。支持对接哔哩哔哩直播间,以智谱API作为语言基座模型,拥有意图识别、长短期记忆(直接记忆和联想记忆),支持搭建认知库、歌曲作品库,接入了当前热门的一些语音转换、语音合成、图像生成、数字人驱动项目,并提供了一个便于操作的客户端。
[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥
[NeurIPS 2024 Oral][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-sim…
Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥