Skip to content
Change the repository type filter

All

    Repositories list

    • VideoMMMU

      Public
      Python
      Other
      13001Updated Feb 23, 2025Feb 23, 2025
    • lmms-eval

      Public
      Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
      Python
      Other
      2092.1k1863Updated Feb 22, 2025Feb 22, 2025
    • .github

      Public
      0100Updated Feb 12, 2025Feb 12, 2025
    • A fork to add multimodal model training to open-r1
      Python
      Apache License 2.0
      47825161Updated Feb 8, 2025Feb 8, 2025
    • Auto Interpretation Pipeline and many other functionalities for Multimodal SAE Analysis.
      Python
      Other
      510500Updated Jan 24, 2025Jan 24, 2025
    • my-python-template

      Public template
      My template repo for setting up a new python repo
      Python
      1000Updated Dec 11, 2024Dec 11, 2024
    • LongVA

      Public
      Long Context Transfer from Language to Vision
      Python
      Apache License 2.0
      19361270Updated Nov 20, 2024Nov 20, 2024
    • demos

      Public
      Python
      0000Updated Sep 18, 2024Sep 18, 2024
    • sglang

      Public
      SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
      Python
      Apache License 2.0
      1k400Updated Sep 18, 2024Sep 18, 2024
    • Otter

      Public
      🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
      Python
      MIT License
      2143.2k612Updated Mar 5, 2024Mar 5, 2024
    • Relate Anything Model is capable of taking an image as input and utilizing SAM to identify the corresponding mask within the image.
      Python
      Apache License 2.0
      2144960Updated Jul 4, 2023Jul 4, 2023