DefTruth

Follow

🎯

#pragma unroll

DefTruth DefTruth

🎯

#pragma unroll

Follow

AI Infra Engineer @vipshop, Owner @xlite-dev, Prev @PaddlePaddle🤖

2.1k followers · 183 following

@xlite-dev, @vipshop
Guangzhou, China
09:50 (UTC +08:00)
https://deftruth.github.io

Achievements

Achievements

Organizations

DefTruth/README.md

Pinned Loading

xlite-dev/LeetCUDA xlite-dev/LeetCUDA Public

📚LeetCUDA: Modern CUDA Learn Notes with PyTorch for Beginners🐑, 200+ CUDA Kernels, Tensor Cores, HGEMM, FA-2 MMA.🎉

Cuda 10.8k 1.1k
xlite-dev/lite.ai.toolkit xlite-dev/lite.ai.toolkit Public

🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉

C++ 4.4k 777
PaddlePaddle/FastDeploy PaddlePaddle/FastDeploy Public

High-performance Inference and Deployment Toolkit for LLMs and VLMs based on PaddlePaddle

Python 3.7k 742
sgl-project/sglang sgl-project/sglang Public

SGLang is a high-performance serving framework for large language models and multimodal models.

Python 26.7k 5.6k
vipshop/cache-dit vipshop/cache-dit Public

A PyTorch-native inference engine with cache, parallelism, quantization for Diffusion Transformers.

Python 1.2k 70
xlite-dev/ffpa-attn xlite-dev/ffpa-attn Public

FFPA: Extend FlashAttention-2 with Split-D, ~O(1) SRAM complexity for large headdim, 1.8x~3x↑🎉 vs SDPA.

Cuda 276 16