-
CS @ UC Davis
- Davis, CA
- https://alanyannick.github.io
Stars
Integrate the DeepSeek API into popular softwares
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Fully open reproduction of DeepSeek-R1
Janus-Series: Unified Multimodal Understanding and Generation Models
A bibliography and survey of the papers surrounding o1
official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"
[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation
a state-of-the-art-level open visual language model | 多模态预训练模型
SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.
[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors
Official inference repo for FLUX.1 models
dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel a…
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Full-Body Avatars.
[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…
The communications platform that puts data protection first.
🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
ControlNet++: All-in-one ControlNet for image generations and editing!