Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…

Python 6,895 418 Updated Jan 9, 2025

ant-research / AniDoc

Official Implementations for Paper - AniDoc: Animation Creation Made Easier

Python 442 29 Updated Dec 31, 2024

Holasyb918 / PersonaTalk_Hack

PersonaTalk Hack

Python 13 Updated Jan 10, 2025

mayuelala / FollowYourEmoji

[Siggraph Asia 2024] Follow-Your-Emoji: This repo is the official implementation of "Follow-Your-Emoji: Fine-Controllable and Expressive Freestyle Portrait Animation"

Python 352 28 Updated Sep 11, 2024

KwaiVGI / SynCamMaster

[ARXIV'24] SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse Viewpoints

Python 438 12 Updated Dec 11, 2024

Chenglin-Yang / 1.58bit.flux

226 1 Updated Dec 31, 2024

linyqh / NarratoAI

利用AI大模型，一键解说并剪辑视频； Using AI models to automatically provide commentary and edit videos with a single click.

Python 3,336 368 Updated Jan 11, 2025

SpatialVision / Orient-Anything

Python 201 7 Updated Dec 30, 2024

wangjiangshan0725 / RF-Solver-Edit

Taming FLUX for Image Inversion & Editing; OpenSora for Video Inversion & Editing! (Official implementation for Taming Rectified Flow for Inversion and Editing.)

Python 361 8 Updated Dec 16, 2024

aim-uofa / Framer

Official PyTorch implementation of "Framer: Interactive Frame Interpolation".

Python 403 18 Updated Jan 9, 2025

LizhenWangT / FaceVerse_v4

This Git offers a faster and easy-to-use 3DMM tracking pipeline with FaceVerse V4 (CVPR 2022), which is a full head model that includes separate eyeballs, teeth, and tongue.

Python 15 Updated Dec 20, 2024

SeldonIO / MLServer

An inference server for your machine learning models, including support for multiple frameworks, multi-model serving and more

Python 753 187 Updated Jan 14, 2025

facebookresearch / flow_matching

A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.

Python 1,769 71 Updated Jan 2, 2025

LituRout / RF-Inversion

Rectified Flow Inversion (RF-Inversion)

322 13 Updated Jan 7, 2025

FoundationVision / Infinity

Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis

Python 868 31 Updated Jan 12, 2025

BUAADreamer / EasyRAG

Easy-to-Use RAG Framework; CCF AIOps International Challenge 2024 Top3 Solution; CCF AIOps 国际挑战赛 2024 季军方案

Python 265 33 Updated Nov 17, 2024

mcmonkeyprojects / SwarmUI

SwarmUI (formerly StableSwarmUI), A Modular Stable Diffusion Web-User-Interface, with an emphasis on making powertools easily accessible, high performance, and extensibility.

C# 1,801 136 Updated Jan 16, 2025

chenyangzhu1 / InstantSwap

[arXiv 2024] InstantSwap: This repo is the official implementation of "InstantSwap: Fast Customized Concept Swapping across Sharp Shape Differences"

34 Updated Dec 3, 2024

Tencent / HunyuanVideo

HunyuanVideo: A Systematic Framework For Large Video Generation Model

Python 7,431 573 Updated Jan 16, 2025

aigc3d / AniGS

AniGS: Animatable Gaussian Avatar from a Single Image with Inconsistent Gaussian Reconstruction

336 27 Updated Jan 10, 2025

cangcz / AnchorCrafter

418 18 Updated Nov 27, 2024

lllyasviel / IC-Light

More relighting!

Python 7,349 433 Updated Nov 28, 2024

NexaAI / nexa-sdk

Nexa SDK is a comprehensive toolkit for supporting GGML and ONNX models. It supports text generation, image generation, vision-language models (VLM), Audio Language Model, auto-speech-recognition (…

Python 4,248 618 Updated Jan 8, 2025

Lightricks / LTX-Video

Official repository for LTX-Video

Python 2,567 207 Updated Jan 3, 2025

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

Python 2,677 154 Updated Jan 16, 2025

zbdehh

Lists (7)

AboutBody

Audio

FaceTool

ForVirtualLife

NERF,3D,VISION

TalkFace

Tools

Stars