Skip to content
@IST-DASLab

IST Austria Distributed Algorithms and Systems Lab

Popular repositories Loading

  1. gptq gptq Public

    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".

    Python 2.2k 182

  2. marlin marlin Public

    FP16xINT4 LLM inference kernel that can achieve near-ideal ~4x speedups up to medium batchsizes of 16-32 tokens.

    Python 905 75

  3. sparsegpt sparsegpt Public

    Code for the ICML 2023 paper "SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot".

    Python 840 112

  4. PanzaMail PanzaMail Public

    Python 294 19

  5. qmoe qmoe Public

    Code for the paper "QMoE: Practical Sub-1-Bit Compression of Trillion-Parameter Models".

    Python 277 23

  6. QUIK QUIK Public

    Repository for the QUIK project, enabling the use of 4bit kernels for generative inference - EMNLP 2024

    C++ 182 13

Repositories

Showing 10 of 65 repositories
  • llmq Public

    Quantized LLM training in pure CUDA/C++.

    IST-DASLab/llmq’s past year of commit activity
    C++ 179 7 0 0 Updated Oct 7, 2025
  • FP-Quant Public
    IST-DASLab/FP-Quant’s past year of commit activity
    Python 48 6 5 1 Updated Oct 6, 2025
  • qutlass Public

    QuTLASS: CUTLASS-Powered Quantized BLAS for Deep Learning

    IST-DASLab/qutlass’s past year of commit activity
    C++ 111 Apache-2.0 9 0 1 Updated Oct 1, 2025
  • IST-DASLab/ISTA-DASLab-Optimizers’s past year of commit activity
    Python 9 Apache-2.0 0 0 0 Updated Sep 26, 2025
  • gptq-gguf-toolkit Public

    GPTQ and efficient search for GGUF

    IST-DASLab/gptq-gguf-toolkit’s past year of commit activity
    Python 51 4 0 1 Updated Sep 17, 2025
  • Quartet Public
    IST-DASLab/Quartet’s past year of commit activity
    Jupyter Notebook 99 MIT 10 5 0 Updated Aug 24, 2025
  • EvoPress Public
    IST-DASLab/EvoPress’s past year of commit activity
    Python 33 2 0 0 Updated Jul 30, 2025
  • QuEST Public

    Work in progress.

    IST-DASLab/QuEST’s past year of commit activity
    Jupyter Notebook 74 MIT 6 2 0 Updated Jun 29, 2025
  • Yolov8-Pose-Detection-on-Browser Public Forked from akbartus/Yolov8-Pose-Detection-on-Browser

    Example of YOLOv8 pose detection (estimation) on browser. It shows implementations powered by ONNX and TFJS served through JavaScript without any frameworks. It demonstrates pose detection (estimation) on image as well as live web camera,

    IST-DASLab/Yolov8-Pose-Detection-on-Browser’s past year of commit activity
    HTML 0 MIT 3 0 0 Updated Jun 13, 2025
  • MoE-Quant Public

    Code for data-aware compression of DeepSeek models

    IST-DASLab/MoE-Quant’s past year of commit activity
    Python 55 9 2 0 Updated Jun 10, 2025

Most used topics

Loading…