-
CS @ UC Davis
- Davis, CA
- https://alanyannick.github.io
Stars
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
Janus-Series: Unified Multimodal Understanding and Generation Models
A bibliography and survey of the papers surrounding o1
official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"
The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"
a state-of-the-art-level open visual language model | 多模态预训练模型
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Official inference repo for FLUX.1 models
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel a…
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Full-Body Avatars.
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
The communications platform that puts data protection first.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
ControlNet++: All-in-one ControlNet for image generations and editing!
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝
A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…