Highlights
- Pro
Lists (5)
Sort Name ascending (A-Z)
Starred repositories
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Code release for "SegLLM: Multi-round Reasoning Segmentation"
A minimal and universal controller for FLUX.1.
Memory-optimized training scripts for video models based on Diffusers
Official implementation of "DepthLab: From Partial to Complete"
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
Official implementation for paper - LeviTor: 3D Trajectory Oriented Image-to-Video Synthesis
A generative world for general-purpose robotics & embodied AI learning.
Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
Official Implementations for Paper - AniDoc: Animation Creation Made Easier
NOVA: Autoregressive Video Generation without Vector Quantization
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Official PyTorch implementation of Lumos: Learning Visual Generative Priors without Text
You See it, You Got it: Learning 3D Creation on Pose-Free Videos at Scale
ASCII generator (image to text, image to image, video to video)
Official Implementations for Paper - MagicQuill: An Intelligent Interactive Image Editing System
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
提取微信聊天记录,将其导出成HTML、Word、Excel文档永久保存,对聊天记录进行分析生成年度聊天报告,用聊天数据训练专属于个人的AI聊天助手
A comprehensive list of papers for the definition of World Models and using World Models for General Video Generation, Embodied AI, and Autonomous Driving, including papers, codes, and related webs…
🚀 Efficiently (pre)training foundation models with native PyTorch features, including FSDP for training and SDPA implementation of Flash attention v2.
Agent S: an open agentic framework that uses computers like a human
Tencent Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation
A suite of image and video neural tokenizers