Skip to content
View Aierhaimian's full-sized avatar

Block or report Aierhaimian

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

AutoMQ is a cloud-native alternative to Kafka by decoupling durability to cloud storage services like S3. 10x Cost-Effective. No Cross-AZ Traffic Cost. Autoscale in seconds. Single-digit ms latency…

Java 5,408 366 Updated Apr 17, 2025

Port of Facebook's LLaMA model in C/C++

C++ 45 9 Updated Apr 13, 2025

LLM inference in C/C++

C++ 78,300 11,432 Updated Apr 17, 2025

📚FFPA(Split-D): Yet another Faster Flash Attention with O(1) GPU SRAM complexity large headdim, 1.8x~3x↑🎉 faster than SDPA EA.

Cuda 168 7 Updated Apr 6, 2025

📚Modern CUDA Learn Notes: 200+ Tensor/CUDA Cores Kernels🎉, HGEMM, FA2 via MMA and CuTe, 98~100% TFLOPS of cuBLAS/FA2.

Cuda 3,469 375 Updated Apr 15, 2025

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 165 5 Updated Apr 3, 2025

The cross-platform open-source app built for handwriting

Dart 2,979 187 Updated Apr 16, 2025

🚀 「大模型」1小时从0训练26M参数的视觉多模态VLM!🌏 Train a 26M-parameter VLM from scratch in just 1 hours!

Python 2,664 277 Updated Apr 10, 2025

A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on addi…

C++ 3,553 244 Updated Apr 16, 2025

10 Lessons to Get Started Building AI Agents

Jupyter Notebook 15,467 3,730 Updated Apr 14, 2025

A Python native, OS native GUI toolkit.

Python 4,805 708 Updated Apr 14, 2025

RF-DETR is a real-time object detection model architecture developed by Roboflow, SOTA on COCO & designed for fine-tuning.

Python 1,888 183 Updated Apr 17, 2025

This project aims to achieve good results in handwritten mathematical formulas, printed formulas, complex formula samples, or comprehensive optical character recognition tasks.

Python 21 5 Updated Mar 10, 2025

Modular hardware build system

Python 974 99 Updated Apr 17, 2025

Masked Generative Distillation (ECCV 2022)

Python 221 23 Updated Nov 9, 2022

NVIDIA Isaac GR00T N1 is the world's first open foundation model for generalized humanoid robot reasoning and skills.

Jupyter Notebook 3,417 419 Updated Apr 16, 2025

verl: Volcano Engine Reinforcement Learning for LLMs

Python 6,771 733 Updated Apr 17, 2025

The macOS & iOS file archiver

PHP 5,348 253 Updated Apr 8, 2025

Official PyTorch implementation of the paper "Dataset Distillation with Neural Characteristic Function: A Minmax Perspective" (NCFM) in CVPR 2025 (Highlight).

Python 329 19 Updated Apr 11, 2025

一键安装程序,欢迎大家提交代码和小鱼一起一键安装停止浪费生命

Python 1,965 226 Updated Jan 21, 2025

Janus-Series: Unified Multimodal Understanding and Generation Models

Python 17,125 2,229 Updated Feb 1, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,222 563 Updated Apr 16, 2025

Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation

7,644 266 Updated Apr 14, 2025

DeepEP: an efficient expert-parallel communication library

Cuda 7,443 713 Updated Apr 16, 2025

FlashMLA: Efficient MLA decoding kernels

C++ 11,441 822 Updated Mar 1, 2025

A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations

Python 13,575 952 Updated Apr 17, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen2.5, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3…

Python 6,998 597 Updated Apr 17, 2025

A Python-embedded modeling language for convex optimization problems.

C++ 5,718 1,091 Updated Apr 15, 2025

A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )

C++ 1,659 237 Updated Feb 10, 2025

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,715 1,715 Updated Feb 26, 2025
Next