Skip to content
View alanyannick's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report alanyannick

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 6,169 709 Updated Feb 1, 2025

Fully open reproduction of DeepSeek-R1

Python 14,375 1,128 Updated Jan 31, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 13,703 1,706 Updated Feb 1, 2025

A bibliography and survey of the papers surrounding o1

TeX 1,090 45 Updated Nov 16, 2024

official repo of paper for "CamI2V: Camera-Controlled Image-to-Video Diffusion Model"

Python 91 4 Updated Jan 12, 2025

Inference script for Oasis 500M

Python 1,730 147 Updated Nov 8, 2024
Python 10,041 1,292 Updated Feb 1, 2025

The official repository for paper "Tora: Trajectory-oriented Diffusion Transformer for Video Generation"

Python 1,058 46 Updated Jan 6, 2025

a state-of-the-art-level open visual language model | 多模态预训练模型

Python 6,314 426 Updated May 29, 2024

SwissArmyTransformer is a flexible and powerful library to develop your own Transformer variants.

Python 1,047 98 Updated Dec 26, 2024

[ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors

Python 2,736 222 Updated Sep 8, 2024

Official inference repo for FLUX.1 models

Python 19,868 1,390 Updated Jan 31, 2025

dstack is a lightweight, open-source alternative to Kubernetes & Slurm, simplifying AI container orchestration with multi-cloud & on-prem support. It natively supports NVIDIA, AMD, TPU, and Intel a…

Python 1,660 164 Updated Jan 31, 2025

3D Gaussian Splat Editor

TypeScript 1,790 169 Updated Jan 30, 2025

text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)

Python 10,478 983 Updated Jan 22, 2025

Bring portraits to life!

Python 13,782 1,477 Updated Feb 2, 2025

Goliath Dataset and Official PyTorch Implementation of RelightableHands, Relightable Gaussian Codec Avatars, and Driving-Signal Aware Full-Body Avatars.

Python 242 19 Updated Dec 4, 2024

[ECCV2024] This is an official inference code of the paper "Glyph-ByT5: A Customized Text Encoder for Accurate Visual Text Rendering" and "Glyph-ByT5-v2: A Strong Aesthetic Baseline for Accurate Mu…

Jupyter Notebook 542 23 Updated Jul 13, 2024

The communications platform that puts data protection first.

TypeScript 41,770 11,139 Updated Feb 2, 2025

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,229 242 Updated Mar 5, 2024

Kolors Team

Python 4,136 308 Updated Nov 13, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 14,482 960 Updated Jan 23, 2025
Python 4 Updated Jun 28, 2024
Jupyter Notebook 11 Updated Jun 28, 2024

ControlNet++: All-in-one ControlNet for image generations and editing!

Python 1,852 46 Updated Sep 30, 2024

FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds. AI拟音大师,给你的无声视频添加生动而且同步的音效 😝

Python 515 48 Updated Jul 26, 2024

A curated list of recent diffusion models for video generation, editing, restoration, understanding, etc.

3,874 219 Updated Feb 1, 2025

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editin…

Python 3,212 232 Updated Aug 20, 2024
Next