-
Sun Yat-sen University
- Guangzhou, China
-
22:29
(UTC +08:00)
Highlights
- Pro
Stars
Writing AI Conference Papers: A Handbook for Beginners
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge managemen…
Fast and memory-efficient exact attention
Official Implementation of EAGLE-1 (ICML'24) and EAGLE-2 (EMNLP'24)
我的个人技术博客(Python、Django、Docker、Go、Redis、ElasticSearch、Kafka、Linux)
Cosmos is a world model development platform that consists of world foundation models, tokenizers and video processing pipeline to accelerate the development of Physical AI at Robotics & AV labs. C…
World's First Large-scale High-quality Robotic Manipulation Benchmark
A holistic way of understanding how Llama and its components run in practice, with code and detailed documentation.
Official Pytorch Implementation of Synthesizing Coherent Story with Auto-Regressive Latent Diffusion Models
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RN…
“FlowAR: Scale-wise Autoregressive Image Generation Meets Flow Matching” FlowAR employs a simplest scale design and is compatible with any VAE.
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
A PyTorch library for implementing flow matching algorithms, featuring continuous and discrete flow matching implementations. It includes practical examples for both text and image modalities.
You can easily calculate FVD, PSNR, SSIM, LPIPS for evaluating the quality of generated or predicted videos.
[CVPR 2024] On the Content Bias in Fréchet Video Distance
🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".
Official Pytorch Implementation for "TokenFlow: Consistent Diffusion Features for Consistent Video Editing" presenting "TokenFlow" (ICLR 2024)