Skip to content
View serser's full-sized avatar

Block or report serser

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

OCR Annotations from Amazon Textract for Industry Documents Library

Python 103 6 Updated Aug 20, 2022

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 710 63 Updated Mar 14, 2025

A curated list of awesome TikZ packages and resources

14 3 Updated Jan 1, 2019

Oracle Bone Script data collected by VLRLab of HUST

Python 45 1 Updated Sep 2, 2024

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 15,573 1,837 Updated Apr 15, 2025

[NeurIPS'24]Efficient and accurate memory saving method towards W4A4 large multi-modal models.

Python 71 5 Updated Jan 3, 2025

Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.

Python 709 42 Updated Apr 15, 2025

MoBA: Mixture of Block Attention for Long-Context LLMs

Python 1,736 103 Updated Apr 3, 2025

USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference

Python 472 37 Updated Apr 14, 2025

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal is…

Python 3,596 266 Updated Apr 15, 2025

Automating the Search for Artificial Life with Foundation Models!

Jupyter Notebook 406 44 Updated Jan 12, 2025

提供历代书法绘画作品的描述信息和图片数据文件下载,为书画AI研究提供训练数据,也可以用于其他和传统艺术相关的科研工作。

37 1 Updated Nov 9, 2022

An Open Large Reasoning Model for Real-World Solutions

Python 1,483 77 Updated Mar 4, 2025
Python 1,351 52 Updated Nov 21, 2024

Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec.

Python 8,078 669 Updated Apr 15, 2025

AllenAI's post-training codebase

Python 2,896 373 Updated Apr 15, 2025

Low Precision Arithmetic Simulation in PyTorch

Python 274 75 Updated May 20, 2024

Pruning the VLLMs

Python 91 4 Updated Dec 9, 2024

LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning

Python 1,950 75 Updated Apr 13, 2025

Making large AI models cheaper, faster and more accessible

Python 40,772 4,492 Updated Apr 15, 2025

Evaluating text-to-image/video/3D models with VQAScore

Python 282 19 Updated Mar 16, 2025

Fast Multimodal LLM on Mobile Devices

C++ 820 93 Updated Mar 21, 2025

High-speed Large Language Model Serving for Local Deployment

C++ 8,170 428 Updated Feb 19, 2025

[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.

Python 156 11 Updated Oct 3, 2024

Next-Token Prediction is All You Need

Python 2,076 79 Updated Mar 17, 2025

Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷

Python 4,192 227 Updated Apr 15, 2025

Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x faster on consumer devices.

Python 261 35 Updated Oct 12, 2024

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models

Python 262 20 Updated Oct 8, 2024
Next