
Starred repositories
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
A free and open source, self hosted Ai based live meeting note taker and minutes summary generator that can completely run in your Local device (Mac OS and windows OS Support added. Working on addi…
A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,T…
Netflix-level subtitle cutting, translation, alignment, and even dubbing - one-click fully automated AI video subtitle team | Netflix级字幕切割、翻译、对齐、甚至加上配音,一键全自动视频搬运AI字幕组
[Support 0.48.x](Reset Cursor AI MachineID & Bypass Higher Token Limit) Cursor Ai ,自动重置机器ID , 免费升级使用Pro功能: You've reached your trial request limit. / Too many free trial accounts used on this machi…
A curated collection of resources, tutorials, and best practices for learning and mastering NVIDIA CUTLASS
AIInfra(AI 基础设施)指AI系统从底层芯片等硬件,到上层软件栈支持AI大模型训练和推理。
Wan: Open and Advanced Large-Scale Video Generative Models
📚A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, FlashAttention, PagedAttention, MLA, Parallelism etc.
📚Modern CUDA Learn Notes: 200+ Tensor/CUDA Cores Kernels🎉, HGEMM, FA2 via MMA and CuTe, 98~100% TFLOPS of cuBLAS/FA2.
DeepEP: an efficient expert-parallel communication library
A simple screen parsing tool towards pure vision based GUI agent
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Development repository for the Triton language and compiler
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Accessible large language models via k-bit quantization for PyTorch.
The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, MCP compatibility, and more.
Clean, minimal, accessible reproduction of DeepSeek R1-Zero
Janus-Series: Unified Multimodal Understanding and Generation Models
Finetune Llama 4, DeepSeek-R1, Gemma 3 & Reasoning LLMs 2x faster with 70% less memory! 🦥
Making large AI models cheaper, faster and more accessible
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Zero Bubble Pipeline Parallelism
Ongoing research training transformer models at scale
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting yo…