Skip to content
Change the repository type filter

All

    Repositories list

    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      6847.3k350Updated Mar 28, 2025Mar 28, 2025
    • Integrate the DeepSeek API into popular softwares
      Creative Commons Zero v1.0 Universal
      3.3k31k7235Updated Mar 28, 2025Mar 28, 2025
    • DeepGEMM

      Public
      DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling
      Cuda
      MIT License
      5365.1k100Updated Mar 28, 2025Mar 28, 2025
    • 3FS

      Public
      A high-performance distributed file system designed to address the challenges of AI training and inference workloads.
      C++
      MIT License
      8148.4k6514Updated Mar 27, 2025Mar 27, 2025
    • EPLB

      Public
      Expert Parallelism Load Balancer
      Python
      MIT License
      1771.1k31Updated Mar 24, 2025Mar 24, 2025
    • Analyze computation-communication overlap in V3/R1.
      130970110Updated Mar 21, 2025Mar 21, 2025
    • Python
      MIT License
      15k95k9840Updated Mar 16, 2025Mar 16, 2025
    • DualPipe

      Public
      A bidirectional pipeline parallelism algorithm for computation-communication overlap in V3/R1 training.
      Python
      MIT License
      2812.7k30Updated Mar 10, 2025Mar 10, 2025
    • smallpond

      Public
      A lightweight data processing framework built on DuckDB and 3FS.
      Python
      MIT License
      3894.4k196Updated Mar 5, 2025Mar 5, 2025
    • Production-tested AI infrastructure tools for efficient AGI development and community-driven innovation
      Creative Commons Zero v1.0 Universal
      2306.9k00Updated Mar 4, 2025Mar 4, 2025
    • FlashMLA

      Public
      FlashMLA: Efficient MLA decoding kernels
      C++
      MIT License
      81111k401Updated Mar 1, 2025Mar 1, 2025
    • DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
      Python
      MIT License
      1.7k4.6k8116Updated Feb 26, 2025Feb 26, 2025
    • MIT License
      11k88k29139Updated Feb 24, 2025Feb 24, 2025
    • Janus

      Public
      Janus-Series: Unified Multimodal Understanding and Generation Models
      Python
      MIT License
      2.2k17k14225Updated Feb 1, 2025Feb 1, 2025
    • DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
      MIT License
      5084.9k753Updated Sep 25, 2024Sep 25, 2024
    • DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence
      MIT License
      8385.5k482Updated Sep 24, 2024Sep 24, 2024
    • ESFT

      Public
      Expert Specialized Fine-Tuning
      Python
      MIT License
      24459660Updated Sep 22, 2024Sep 22, 2024
    • [ICLR 2024] Official implementation of DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
      Python
      MIT License
      3452.9k340Updated Aug 21, 2024Aug 21, 2024
    • Python
      MIT License
      22748460Updated Aug 16, 2024Aug 16, 2024
    • DeepSeek Coder: Let the Code Write Itself
      Python
      MIT License
      2.4k21k10019Updated May 21, 2024May 21, 2024
    • DeepSeek-VL: Towards Real-World Vision-Language Understanding
      Python
      MIT License
      5553.7k362Updated Apr 24, 2024Apr 24, 2024
    • DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
      Python
      MIT License
      4952.6k321Updated Apr 15, 2024Apr 15, 2024
    • A curated list of open-source projects related to DeepSeek Coder
      19565800Updated Apr 3, 2024Apr 3, 2024
    • DeepSeek LLM: Let there be answers
      Makefile
      MIT License
      9676.2k242Updated Feb 4, 2024Feb 4, 2024
    • DeepSeekMoE: Towards Ultimate Expert Specialization in Mixture-of-Experts Language Models
      Python
      MIT License
      2751.6k154Updated Jan 16, 2024Jan 16, 2024