Skip to content
View NicholasCao's full-sized avatar
💖
💖

Organizations

@goa-go

Block or report NicholasCao

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR

Jupyter Notebook 580 39 Updated Sep 26, 2024

Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 7,123 430 Updated Jan 9, 2025

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 887 31 Updated Jan 12, 2025
Python 343 27 Updated Jan 16, 2025

Official repository for LTX-Video

Python 2,603 217 Updated Jan 3, 2025

Stable diffusion for inpainting

Python 182 17 Updated Jul 25, 2023

[NeurIPS'23] Emergent Correspondence from Image Diffusion

Python 643 36 Updated May 14, 2024
Python 357 27 Updated Nov 4, 2024

Diffusion Feedback Helps CLIP See Better

Python 242 12 Updated Aug 24, 2024

Repository for code used in the xVal paper

Jupyter Notebook 128 9 Updated Apr 4, 2024

短视频去水印微信小程序

JavaScript 192 58 Updated Sep 19, 2023

Containers for machine learning

Python 8,292 578 Updated Jan 20, 2025

Simply animate your 2D waifu.

Python 98 8 Updated Jul 16, 2024

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

Python 39,024 4,409 Updated Jan 18, 2025

一个高自由度的端到端的可定制AI-VTuber。支持对接哔哩哔哩直播间,以智谱API作为语言基座模型,拥有意图识别、长短期记忆(直接记忆和联想记忆),支持搭建认知库、歌曲作品库,接入了当前热门的一些语音转换、语音合成、图像生成、数字人驱动项目,并提供了一个便于操作的客户端。

Python 352 42 Updated Sep 22, 2024

A reading list of video generation

480 34 Updated Jan 20, 2025

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,000 211 Updated Nov 27, 2024
Python 1,833 128 Updated Nov 8, 2024
Python 125 Updated Oct 9, 2024

🎯 AI 游戏,编织代码、文字,如梦如幻,如诗如歌。

277 120 Updated Nov 15, 2023

文字游戏: 我的文字修仙全靠刷

JavaScript 1,019 135 Updated Oct 5, 2024

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 38,488 4,732 Updated Jan 20, 2025

DDPO for finetuning diffusion models, implemented in PyTorch with LoRA support

Python 465 42 Updated Mar 22, 2024

[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"

Python 186 19 Updated Apr 6, 2024

Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States

Python 1,106 68 Updated Jul 14, 2024

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 2,798 429 Updated Nov 11, 2024
Python 223 16 Updated Apr 10, 2024

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,853 1,043 Updated Jan 20, 2025

InstantStyle: Free Lunch towards Style-Preserving in Text-to-Image Generation 🔥

Jupyter Notebook 1,736 108 Updated Sep 18, 2024

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,427 420 Updated Jan 12, 2025
Next