Popular repositories Loading
-
-
-
syscall_intercept
syscall_intercept PublicForked from pmem/syscall_intercept
The system call intercepting library
Repositories
- Mooncake Public Forked from kvcache-ai/Mooncake
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
madsys-dev/Mooncake’s past year of commit activity - rbg Public Forked from sgl-project/rbg
A workload for deploying LLM inference services on Kubernetes
madsys-dev/rbg’s past year of commit activity - checkpoint-engine Public Forked from MoonshotAI/checkpoint-engine
Checkpoint-engine is a simple middleware to update model weights in LLM inference engines
madsys-dev/checkpoint-engine’s past year of commit activity - sglang Public Forked from sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
madsys-dev/sglang’s past year of commit activity - DSA-2LM Public
The Artifact Evaluation Version of ATC'25 Paper #395 DSA-2LM: A CPU-Free Tiered Memory Architecture with Intel DSA
madsys-dev/DSA-2LM’s past year of commit activity - scalio-osdi25-ae Public
This is the repository for our OSDI '25 paper: Scalio: Scaling up DPU-based JBOF Key-value Store with NVMe-oF Target Offload.
madsys-dev/scalio-osdi25-ae’s past year of commit activity - Semi-PD Public Forked from infinigence/Semi-PD
A prefill & decode disaggregated LLM serving framework with shared GPU memory and fine-grained compute isolation.
madsys-dev/Semi-PD’s past year of commit activity - deepseekv2-profile Public
madsys-dev/deepseekv2-profile’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…