Skip to content
View Fazziekey's full-sized avatar

Block or report Fazziekey

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Everything you need to build state-of-the-art foundation models, end-to-end.

Python 7,427 530 Updated Feb 28, 2025

Align Anything: Training All-modality Model with Feedback

Python 2,440 337 Updated Feb 28, 2025

GUI Odyssey is a comprehensive dataset for training and evaluating cross-app navigation agents. GUI Odyssey consists of 7,735 episodes from 6 mobile devices, spanning 6 types of cross-app tasks, 20…

Python 90 4 Updated Nov 12, 2024

MambaOut: Do We Really Need Mamba for Vision?

Python 2,135 38 Updated Oct 22, 2024

✨✨VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction

Python 2,121 162 Updated Feb 13, 2025

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 1,934 280 Updated Feb 28, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 3,952 361 Updated Feb 28, 2025
47 Updated Dec 13, 2024
C++ 404 56 Updated Feb 28, 2025

Official inference repo for FLUX.1 models

Python 20,488 1,439 Updated Feb 6, 2025

FastVideo is a lightweight framework for accelerating large video diffusion models.

Python 1,182 69 Updated Feb 28, 2025

Tile primitives for speedy kernels

Cuda 2,088 119 Updated Feb 28, 2025

PyTorch bindings for CUTLASS grouped GEMM.

Cuda 98 31 Updated Jan 2, 2025

[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ult…

Jupyter Notebook 6,742 441 Updated Jan 12, 2025

The paper collections for the autoregressive models in vision.

418 14 Updated Feb 28, 2025

PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.

Python 9,923 1,771 Updated Feb 20, 2025

[ICML 2024 Spotlight] FiT: Flexible Vision Transformer for Diffusion Model

Python 404 11 Updated Nov 10, 2024

nnScaler: Compiling DNN models for Parallel Training

Python 97 13 Updated Feb 14, 2025

Mamba SSM architecture

Python 14,093 1,227 Updated Jan 18, 2025

open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.

Python 3,174 275 Updated Nov 5, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,048 897 Updated Feb 25, 2025

Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 2, and other large language models.

Go 130,203 10,654 Updated Feb 28, 2025

A library for accelerating Transformer models on NVIDIA GPUs, including using 8-bit floating point (FP8) precision on Hopper and Ada GPUs, to provide better performance with lower memory utilizatio…

Python 2,226 367 Updated Feb 28, 2025

A fast MoE impl for PyTorch

Python 1,641 191 Updated Feb 10, 2025

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.

Jupyter Notebook 5,676 362 Updated Jun 28, 2024

Efficient Triton Kernels for LLM Training

Python 4,523 274 Updated Feb 28, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,062 1,106 Updated Feb 28, 2025

🚀 Efficient implementations of state-of-the-art linear attention models in Torch and Triton

Python 2,020 123 Updated Feb 28, 2025
Next