Skip to content
Change the repository type filter

All

    Repositories list

    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      MIT License
      3164k111Updated Feb 27, 2025Feb 27, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      MIT License
      4357400Updated Feb 27, 2025Feb 27, 2025
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
      Python
      MIT License
      941.6k35Updated Feb 27, 2025Feb 27, 2025
    • Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      Creative Commons Zero v1.0 Universal
      865.3k00Updated Feb 27, 2025Feb 27, 2025
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      4746.3k130Updated Feb 27, 2025Feb 27, 2025
    • Analyze computation-communication overlap in V3/R1.
      3848331Updated Feb 27, 2025Feb 27, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient MLA Decoding Kernel for Hopper GPUs
      C++
      MIT License
      66110k332Updated Feb 27, 2025Feb 27, 2025
    • Integrate the DeepSeek API into popular softwares
      Creative Commons Zero v1.0 Universal
      2.5k23k5642Updated Feb 26, 2025Feb 26, 2025
    • DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      MIT License
      1.6k4.2k6715Updated Feb 26, 2025Feb 26, 2025
    • Python
      MIT License
      14k90k9825Updated Feb 24, 2025Feb 24, 2025
    • MIT License
      11k83k25738Updated Feb 24, 2025Feb 24, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k16k12524Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      MIT License
      4974.8k763Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      MIT License
      7955.4k452Updated Sep 24, 2024Sep 24, 2024
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      MIT License
      23755260Updated Sep 22, 2024Sep 22, 2024
    • [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      3392.8k340Updated Aug 21, 2024Aug 21, 2024
    • Python
      MIT License
      22144440Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek Coder: Let the Code Write Itself
      Python
      MIT License
      2.3k21k9915Updated May 21, 2024May 21, 2024
    • DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      MIT License
      5343.6k352Updated Apr 24, 2024Apr 24, 2024
    • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      MIT License
      4692.4k301Updated Apr 15, 2024Apr 15, 2024
    • A curated list of open-source projects related to DeepSeek Coder
      18860000Updated Apr 3, 2024Apr 3, 2024
    • DeepSeek LLM: Let there be answers
      Makefile
      MIT License
      9266k221Updated Feb 4, 2024Feb 4, 2024
    • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
      Python
      MIT License
      2561.5k163Updated Jan 16, 2024Jan 16, 2024