wh0x

🎯

Focusing

wh0x

🎯

Focusing

7 followers · 32 following

Achievements

Starred repositories

bighuang624 / AI-research-tools

🔨AI 方向好用的科研工具

2,463 353 Updated Jun 10, 2024

Alpha-VLLM / LLaMA2-Accessory

An Open-source Toolkit for LLM Development

Python 2,751 175 Updated Jan 13, 2025

Fannovel16 / comfyui_controlnet_aux

ComfyUI's ControlNet Auxiliary Preprocessors

Python 2,585 228 Updated Oct 28, 2024

ChunmingHe / awesome-diffusion-models-in-low-level-vision

351 8 Updated Jan 22, 2025

apple / ml-mdm

Train high-quality text-to-image diffusion models in a data & compute efficient manner

Python 473 36 Updated Jan 17, 2025

genmoai / mochi

The best OSS video generation models

Python 2,827 291 Updated Jan 8, 2025

unity-research / IP-Adapter-Instruct

IP Adapter Instruct

Python 194 4 Updated Aug 10, 2024

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 25,293 1,914 Updated Jan 27, 2025

THUDM / GLM-4

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 5,821 489 Updated Jan 17, 2025

Tencent / HunyuanDiT

Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding

Jupyter Notebook 3,862 325 Updated Jan 13, 2025

mli / paper-reading

深度学习经典、新论文逐段精读

28,019 2,493 Updated Nov 17, 2024

Tramac / paper-reading-note

和李沐一起读论文

156 23 Updated Jan 8, 2025

BAAI-DCAI / Bunny

A family of lightweight multimodal models.

Python 982 74 Updated Nov 18, 2024

lucidrains / vit-pytorch

Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

Python 21,639 3,160 Updated Jan 19, 2025

mini-sora / minisora

MiniSora: A community aims to explore the implementation path and future development direction of Sora.

Python 1,251 152 Updated Dec 19, 2024

NUS-HPC-AI-Lab / VideoSys

VideoSys: An easy and efficient system for video generation

Python 1,903 129 Updated Jan 1, 2025

facebookresearch / DiT

Official PyTorch Implementation of "Scalable Diffusion Models with Transformers"

Python 6,773 602 Updated May 31, 2024

CryhanFang / CLIP2Video

Python 239 30 Updated Dec 10, 2022

m-bain / webvid

Large-scale text-video dataset. 10 million captioned short videos.

Python 620 39 Updated Aug 14, 2024

MooreThreads / Moore-AnimateAnyone

Character Animation (AnimateAnyone, Face Reenactment)

Python 3,297 258 Updated May 31, 2024

tyxsspa / AnyText

Official implementation code of the paper <AnyText: Multilingual Visual Text Generation And Editing>

Python 4,503 290 Updated Jun 21, 2024

applenob / Cpp_Primer_Practice

搞定C++:punch:。C++ Primer 中文版第5版学习仓库，包括笔记和课后练习答案。

C++ 8,133 1,990 Updated Sep 12, 2024

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,808 147 Updated Dec 30, 2024

dvlab-research / LLaMA-VID

LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models (ECCV 2024)

Python 761 45 Updated Jul 29, 2024

bmaltais / kohya_ss

Python 10,061 1,298 Updated Feb 1, 2025

ChenHsing / Awesome-Video-Diffusion-Models

[CSUR] A Survey on Video Diffusion Models

1,928 96 Updated Dec 9, 2024

Stability-AI / generative-models

Generative Models by Stability AI

Python 25,215 2,791 Updated Sep 4, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

13,751 886 Updated Jan 28, 2025

baaivision / CapsFusion

[CVPR 2024] CapsFusion: Rethinking Image-Text Data at Scale

Python 202 5 Updated Feb 27, 2024

opendatalab / VIGC

AAAI 2024: Visual Instruction Generation and Correction

Python 91 3 Updated Feb 4, 2024

wh0x

Starred repositories

HTTP

Java