Lists (23)
Sort Name ascending (A-Z)
3dparty
常用第三方工具包agent
agent workflowCodingCook
CudaCook
DataProcess
DeployKit
DialogueKit
DigitHuman
InforExtract
LatextCook
LLMs
大模型相关MLsysCook
Others
Prompt
优化提示词RAG
Recommend
RL-cook
TextEmbedding
TextMatch
TextSimilar
TextTool
TopicExtract
TTS
Stars
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Apache ECharts is a powerful, interactive charting and data visualization library for browser
Finetune Llama 3.3, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
NumPy aware dynamic Python compiler using LLVM
Wan: Open and Advanced Large-Scale Video Generative Models
No fortress, purely open ground. OpenManus is Coming.
Open Source Deep Research Alternative to Reason and Search on Private Data. Written in Python.
DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)
Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
flash attention tutorial written in python, triton, cuda, cutlass
MoBA: Mixture of Block Attention for Long-Context LLMs
one for all free music in china (origin edition)
This repository contains the Hugging Face Agents Course.
Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy
Efficient Triton Kernels for LLM Training
Distilabel is a framework for synthetic data and AI feedback for engineers who need fast, reliable and scalable pipelines based on verified research papers.
A course on aligning smol models.
Fully open reproduction of DeepSeek-R1
Clean, minimal, accessible reproduction of DeepSeek R1-Zero