Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
56 changes: 55 additions & 1 deletion docs/flagrelease_en/model_list.txt
Original file line number Diff line number Diff line change
@@ -1,3 +1,6 @@
FlagRelease/BAAI-Cardiac-Agent-hygon-FlagOS
FlagRelease/C2S-Scale-Gemma-2-27B-hygon-FlagOS
FlagRelease/C2S-Scale-Gemma-2-27B-nvidia-FlagOS
FlagRelease/DeepSeek-R1-Distill-Qwen-32B-FlagOS-Cambricon
FlagRelease/DeepSeek-R1-Distill-Qwen-32B-FlagOS-NVIDIA
FlagRelease/DeepSeek-R1-FlagOS-Cambricon-BF16
Expand All @@ -7,14 +10,39 @@ FlagRelease/DeepSeek-R1-FlagOS-Metax-BF16
FlagRelease/DeepSeek-R1-FlagOS-Nvidia-BF16
FlagRelease/DeepSeek-R1-INT4-FlagOS-Iluvatar
FlagRelease/DeepSeek-V3.2-Exp-FlagOS
FlagRelease/DeepSeek-V4-Flash-ascend-FlagOS
FlagRelease/DeepSeek-V4-Flash-hygon-FlagOS
FlagRelease/DeepSeek-V4-Flash-iluvatar-FlagOS
FlagRelease/DeepSeek-V4-Flash-kunlunxin-FlagOS
FlagRelease/DeepSeek-V4-Flash-metax-FlagOS
FlagRelease/DeepSeek-V4-Flash-mthreads-FlagOS
FlagRelease/DeepSeek-V4-Flash-nvidia-FlagOS
FlagRelease/DeepSeek-V4-Flash-sunrise-FlagOS
FlagRelease/DeepSeek-V4-Flash-tsingmicro-FlagOS
FlagRelease/DeepSeek-V4-Flash-zhenwu-FlagOS
FlagRelease/DeepSeek-V4-Pro-ascend-FlagOS
FlagRelease/DeepSeek-V4-Pro-hygon-FlagOS
FlagRelease/DeepSeek-V4-Pro-metax-FlagOS
FlagRelease/DeepSeek-V4-Pro-mthreads-FlagOS
FlagRelease/DeepSeek-V4-Pro-nvidia-FlagOS
FlagRelease/ERNIE-4.5-300B-A47B-PT-FlagOS
FlagRelease/Emu3.5-FlagOS
FlagRelease/GLM-4.5-FlagOS
FlagRelease/GLM-5-FP8-FlagOS
FlagRelease/GLM-5-ascend-FlagOS
FlagRelease/HY-MT2-1.8B-ascend-FlagOS
FlagRelease/HY-MT2-1.8B-nvidia-FlagOS
FlagRelease/HY-MT2-1.8B-zhenwu-FlagOS
FlagRelease/HY-MT2-30B-A3B-ascend-FlagOS
FlagRelease/HY-MT2-30B-A3B-nvidia-FlagOS
FlagRelease/HY-MT2-30B-A3B-zhenwu-FlagOS
FlagRelease/HY-MT2-7B-ascend-FlagOS
FlagRelease/HY-MT2-7B-nvidia-FlagOS
FlagRelease/HY-MT2-7B-zhenwu-FlagOS
FlagRelease/Hunyuan-A13B-Instruct-FlagOS
FlagRelease/Kimi-K2-Instruct-FlagOS
FlagRelease/Kimi-K2-Thinking-FlagOS
FlagRelease/Kimi-Linear-48B-A3B-Instruct-nvidia-FlagOS
FlagRelease/MiniCPM-V-4-FlagOS
FlagRelease/MiniCPM-V-4-metax-FlagOS
FlagRelease/MiniCPM-o-4.5-ascend-FlagOS
Expand All @@ -23,10 +51,25 @@ FlagRelease/MiniCPM-o-4.5-iluvatar-FlagOS
FlagRelease/MiniCPM-o-4.5-metax-FlagOS
FlagRelease/MiniCPM-o-4.5-nvidia-FlagOS
FlagRelease/MiniCPM-o-4.5-zhenwu-FlagOS
FlagRelease/MiniCPM5-1B-Armv9-FlagOS
FlagRelease/MiniCPM5-1B-ascend-FlagOS
FlagRelease/MiniCPM5-1B-hygon-FlagOS
FlagRelease/MiniCPM5-1B-iluvatar-FlagOS
FlagRelease/MiniCPM5-1B-kunlunxin-FlagOS
FlagRelease/MiniCPM5-1B-metax-FlagOS
FlagRelease/MiniCPM5-1B-mthreads-FlagOS
FlagRelease/MiniCPM5-1B-nvidia-FlagOS
FlagRelease/MiniCPM5-1B-zhenwu-FlagOS
FlagRelease/MiniCPM_o_2.6-FlagOS-Cambricon
FlagRelease/MiniCPM_o_2.6-FlagOS-NVIDIA
FlagRelease/MiniMax-M1-80k-FlagOS
FlagRelease/MiniMax-M2-FlagOS
FlagRelease/MiniMax-M2.7-ascend-FlagOS
FlagRelease/MiniMax-M2.7-hygon-FlagOS
FlagRelease/MiniMax-M2.7-iluvatar-FlagOS
FlagRelease/MiniMax-M2.7-metax-FlagOS
FlagRelease/MiniMax-M2.7-nvidia-FlagOS
FlagRelease/MiniMax-M2.7-zhenwu-FlagOS
FlagRelease/QwQ-32B-FlagOS-Cambricon
FlagRelease/QwQ-32B-FlagOS-Iluvatar
FlagRelease/QwQ-32B-FlagOS-Nvidia
Expand All @@ -51,7 +94,7 @@ FlagRelease/Qwen3-4B-hygon-FlagOS
FlagRelease/Qwen3-8B-FlagOS
FlagRelease/Qwen3-8B-iluvatar-FlagOS
FlagRelease/Qwen3-8B-metax-FlagOS
FlagRelease/Qwen3-8B-mthreads-FlagOS
FlagRelease/Qwen3-Next-80B-A3B-FlagOS
FlagRelease/Qwen3-Next-80B-A3B-Instruct-FlagOS
FlagRelease/Qwen3-Next-80B-A3B-Instruct-metax-FlagOS
FlagRelease/Qwen3-Omni-30B-A3B-Instruct-FlagOS
Expand All @@ -62,6 +105,13 @@ FlagRelease/Qwen3.5-35B-A3B-iluvatar-FlagOS
FlagRelease/Qwen3.5-397B-A17B-metax-FlagOS
FlagRelease/Qwen3.5-397B-A17B-nvidia-FlagOS
FlagRelease/Qwen3.5-397B-A17B-zhenwu-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-ascend-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-hygon-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-iluvatar-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-kunlunxin-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-metax-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-nvidia-FlagOS
FlagRelease/Qwen3.6-35B-A3B-nomtp-zhenwu-FlagOS
FlagRelease/RoboBrain-X0
FlagRelease/RoboBrain-X0-Preview-FlagOS
FlagRelease/RoboBrain-X0-Preview-ascend-FlagOS
Expand All @@ -76,8 +126,12 @@ FlagRelease/RoboBrain2.5-8B-FlagOS
FlagRelease/RoboBrain2.5-8B-ascend-FlagOS
FlagRelease/Seed-OSS-36B-Instruct-FlagOS
FlagRelease/TeleChat3-36B-Thinking-mthreads-FlagOS
FlagRelease/farm_molecular_representation-hygon-FlagOS
FlagRelease/farm_molecular_representation-nvidia-FlagOS
FlagRelease/gpt-oss-120b-FlagOS
FlagRelease/grok-2-FlagOS
FlagRelease/materials.smi-ted-hygon-FlagOS
FlagRelease/materials.smi-ted-nvidia-FlagOS
FlagRelease/phi-4-FlagOS
FlagRelease/phi-4-hygon-FlagOS
FlagRelease/phi-4-metax-FlagOS
Expand Down
Original file line number Diff line number Diff line change
@@ -0,0 +1,122 @@
---
base_model:
- ""
language:
- zh
- en
license: apache-2.0
---

# Introduction
BAAI-Cardiac-Agent is an intelligent cardiac MRI analysis system developed by the Beijing Academy of Artificial Intelligence (BAAI). It adopts an Agent-Expert architecture: a multimodal Agent built on a fine-tuned LLaVA model serves as the core for task understanding and scheduling, while multiple specialized deep learning models act as Expert Workers to carry out specific analyses. The system supports DICOM/NIfTI image input and automates the full pipeline including cardiac sequence identification, structural segmentation, disease screening (three categories), cardiomyopathy subtype classification (five subtypes), cardiac quantitative metric calculation, and PDF report generation.

### Integrated Deployment
- Out-of-the-box inference scripts with pre-configured hardware and software parameters
- Released **FlagOS-Hygon** container image supporting deployment within minutes
### Consistency Validation
- Rigorously evaluated through benchmark testing: Performance and results from the FlagOS software stack are compared against native stacks on multiple public.

# Evaluation Results

## Benchmark Result
Overall Functions: Passed

Sub-functions: All consistency metrics passed

Diagnosis: The two are consistent


| Metrics | BAAI-Cardiac-Agent-Nvidia-Origin | BAAI-Cardiac-Agent-Hygon-FlagOS |
|--------------|--------------------------------------|--------------------------------------|
| LVEF | 31.7 | 31.7 |
| LVEDV |155.5 | 155.5 |
| LVESV | 106.1 | 106.1 |
| SV | 49.4 | 49.4 |
| LVM | 157.2 | 157.2 |
| LVEDD | 49.2 | 49.2 |



# User Guide
Environment Setup

| Item | Version |
|------------------|----------------------|
| Docker Version | Docker version 28.2.2, build 28.2.2-0ubuntu1~22.04.1 |
| Operating System | Ubuntu 22.04.4 LTS (Jammy Jellyfish) |

## Operation Steps

### Download FlagOS Image
```bash
docker pull harbor.baai.ac.cn/flagrelease-public/cardiac-agent-hygon:2026052515
```

### Download Open-source Model Weights
```bash
pip install modelscope
modelscope download --model FlagRelease/BAAI-Cardiac-Agent-hygon-FlagOS --local_dir /data/BAAI-Cardiac-Agent
```

### Start the Container
```bash
docker run -itd --name flagos --network host --ipc host --privileged --shm-size=32g -v /data:/data harbor.baai.ac.cn/flagrelease-public/cardiac-agent-hygon:2026052515 /bin/bash
docker exec -it flagos /bin/bash
```

## Service Invocation
### Invocation Script
```bash
cd /workspace/Cardiac-Agent && bash app/start_hygon.sh full
```


### AnythingLLM Integration Guide

#### 1. Download & Install

- Visit the official site: https://anythingllm.com/
- Choose the appropriate version for your OS (Windows/macOS/Linux)
- Follow the installation wizard to complete the setup

#### 2. Configuration

- Launch AnythingLLM
- Open settings (bottom left, fourth tab)
- Configure core LLM parameters
- Click "Save Settings" to apply changes

#### 3. Model Interaction

- After model loading is complete:
- Click **"New Conversation"**
- Enter your question (e.g., “Explain the basics of quantum computing”)
- Click the send button to get a response
# Technical Overview
**FlagOS** is a fully open-source system software stack designed to unify the "model–system–chip" layers and foster an open, collaborative ecosystem. It enables a “develop once, run anywhere” workflow across diverse AI accelerators, unlocking hardware performance, eliminating fragmentation among vendor-specific software stacks, and substantially lowering the cost of porting and maintaining AI workloads. With core technologies such as the **FlagScale**, together with vllm-plugin-fl, distributed training/inference framework, **FlagGems** universal operator library, **FlagCX** communication library, and **FlagTree** unified compiler, the **FlagRelease** platform leverages the **FlagOS** stack to automatically produce and release various combinations of \<chip + open-source model\>. This enables efficient and automated model migration across diverse chips, opening a new chapter for large model deployment and application.
## FlagGems
FlagGems is a high-performance, generic operator libraryimplemented in [Triton](https://github.com/openai/triton) language. It is built on a collection of backend-neutralkernels that aims to accelerate LLM (Large-Language Models) training and inference across diverse hardware platforms.
## FlagTree
FlagTree is an open source, unified compiler for multipleAI chips project dedicated to developing a diverse ecosystem of AI chip compilers and related tooling platforms, thereby fostering and strengthening the upstream and downstream Triton ecosystem. Currently in its initial phase, the project aims to maintain compatibility with existing adaptation solutions while unifying the codebase to rapidly implement single-repository multi-backend support. Forupstream model users, it provides unified compilation capabilities across multiple backends; for downstream chip manufacturers, it offers examples of Triton ecosystem integration.
## FlagScale and vllm-plugin-fl
Flagscale is a comprehensive toolkit designed to supportthe entire lifecycle of large models. It builds on the strengths of several prominent open-source projects, including [Megatron-LM](https://github.com/NVIDIA/Megatron-LM) and [vLLM](https://github.com/vllm-project/vllm), to provide a robust, end-to-end solution for managing and scaling large models.
vllm-plugin-fl is a vLLM plugin built on the FlagOS unified multi-chip backend, to help flagscale support multi-chip on vllm framework.
## **FlagCX**
FlagCX is a scalable and adaptive cross-chip communication library. It serves as a platform where developers, researchers, and AI engineers can collaborate on various projects, contribute to the development of cutting-edge AI solutions, and share their work with the global community.

## **FlagEval Evaluation Framework**
FlagEval is a comprehensive evaluation system and open platform for large models launched in 2023. It aims to establish scientific, fair, and open benchmarks, methodologies, and tools to help researchers assess model and training algorithm performance. It features:
- **Multi-dimensional Evaluation**: Supports 800+ modelevaluations across NLP, CV, Audio, and Multimodal fields,covering 20+ downstream tasks including language understanding and image-text generation.
- **Industry-Grade Use Cases**: Has completed horizonta1 evaluations of mainstream large models, providing authoritative benchmarks for chip-model performance validation.

# Contributing

We warmly welcome global developers to join us:

1. Submit Issues to report problems
2. Create Pull Requests to contribute code
3. Improve technical documentation
4. Expand hardware adaptation support
# License
The model weights are derived from TaipingQu/BAAI-Cardiac-Agent and are open‑sourced under the Apache License 2.0: https://www.apache.org/licenses/LICENSE-2.0.txt

Original file line number Diff line number Diff line change
@@ -0,0 +1,121 @@
# Introduction
C2S-Scale-Gemma-27B was jointly developed by the van Dijk Lab at Yale University, Google Research, and Google DeepMind. Built on the Gemma-2 27B architecture, it was trained using the Cell2Sentence (C2S) framework, which converts single-cell RNA sequencing data into "cell sentences" for model training. Trained on over 57 million cells, the model supports tasks such as cell type prediction, tissue classification, and gene expression profile generation, demonstrating the tremendous potential of applying large language models to single-cell biology.

### Integrated Deployment
- Out-of-the-box inference scripts with pre-configured hardware and software parameters
- Released **FlagOS-Hygon** container image supporting deployment within minutes
### Consistency Validation
- Rigorously evaluated through benchmark testing: Performance and results from the FlagOS software stack are compared against native stacks on multiple public.


# Evaluation Results
## Benchmark Result
| Metrics | C2S-Scale-Gemma-2-27B-Nvidia-Origin | C2S-Scale-Gemma-2-27B-Hygon-FlagOS |
|--------------|--------------------------------|--------------------------------------|
| medqa_4options | 0.5169 | 0.5208 |
| pubmedqa | 0.542 | 0.542 |

# User Guide
Environment Setup

| Item | Version |
|------------------|----------------------|
| Docker Version | Docker version 20.10.24, build 297e128 |
| Operating System | Sugon OS 8.9 |

## Operation Steps

### Download FlagOS Image
```
docker pull harbor.baai.ac.cn/flagrelease-public/flagrelease-c2s-scale-gemma-2-27b-hygon-tree_0.5.0_hcu3.0-gems_5.0.0-vllm_0.15.1_das.opt1.alpha.dtk2604.20260220.g2799735a-plugin_none-cx_none-python_3.10.12-torch_2.9.0_das.opt1.dt:202605211626
```

### Download Open-source Model Weights
```bash
pip install modelscope
modelscope download --model FlagRelease/C2S-Scale-Gemma-2-27B-hygon-FlagOS --local_dir /data/C2S-Scale-Gemma-2-27B
```

### Start the Container
```bash
docker run \
--name flagos \
--network=host \
--ipc=host \
--device=/dev/kfd \
--device=/dev/mkfd \
--device=/dev/dri \
-v /opt/hyhal:/opt/hyhal \
-v /root/perfxlab:/workspace \
-v /data:/data \
--group-add video \
--cap-add=SYS_PTRACE \
--security-opt seccomp=unconfined \
-itd \
harbor.baai.ac.cn/flagrelease-public/flagrelease-c2s-scale-gemma-2-27b-hygon-tree_0.5.0_hcu3.0-gems_5.0.0-vllm_0.15.1_das.opt1.alpha.dtk2604.20260220.g2799735a-plugin_none-cx_none-python_3.10.12-torch_2.9.0_das.opt1.dt:202605211626
```
### Start the Server
```bash
vllm serve /data/C2S-Scale-Gemma-2-27B --enforce-eager -tp 2 --served-model-name flagos
```

## Service Invocation
### Invocation Script
```bash
curl http://localhost:8000/v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "flagos",
"messages": [{"role": "user", "content": "你好"}]
}'
```


### AnythingLLM Integration Guide

#### 1. Download & Install

- Visit the official site: https://anythingllm.com/
- Choose the appropriate version for your OS (Windows/macOS/Linux)
- Follow the installation wizard to complete the setup

#### 2. Configuration

- Launch AnythingLLM
- Open settings (bottom left, fourth tab)
- Configure core LLM parameters
- Click "Save Settings" to apply changes

#### 3. Model Interaction

- After model loading is complete:
- Click **"New Conversation"**
- Enter your question (e.g., “Explain the basics of quantum computing”)
- Click the send button to get a response
# Technical Overview
**FlagOS** is a fully open-source system software stack designed to unify the "model–system–chip" layers and foster an open, collaborative ecosystem. It enables a “develop once, run anywhere” workflow across diverse AI accelerators, unlocking hardware performance, eliminating fragmentation among vendor-specific software stacks, and substantially lowering the cost of porting and maintaining AI workloads. With core technologies such as the **FlagScale**, together with vllm-plugin-fl, distributed training/inference framework, **FlagGems** universal operator library, **FlagCX** communication library, and **FlagTree** unified compiler, the **FlagRelease** platform leverages the **FlagOS** stack to automatically produce and release various combinations of \<chip + open-source model\>. This enables efficient and automated model migration across diverse chips, opening a new chapter for large model deployment and application.
## FlagGems
FlagGems is a high-performance, generic operator libraryimplemented in [Triton](https://github.com/openai/triton) language. It is built on a collection of backend-neutralkernels that aims to accelerate LLM (Large-Language Models) training and inference across diverse hardware platforms.
## FlagTree
FlagTree is an open source, unified compiler for multipleAI chips project dedicated to developing a diverse ecosystem of AI chip compilers and related tooling platforms, thereby fostering and strengthening the upstream and downstream Triton ecosystem. Currently in its initial phase, the project aims to maintain compatibility with existing adaptation solutions while unifying the codebase to rapidly implement single-repository multi-backend support. Forupstream model users, it provides unified compilation capabilities across multiple backends; for downstream chip manufacturers, it offers examples of Triton ecosystem integration.
## FlagScale and vllm-plugin-fl
Flagscale is a comprehensive toolkit designed to supportthe entire lifecycle of large models. It builds on the strengths of several prominent open-source projects, including [Megatron-LM](https://github.com/NVIDIA/Megatron-LM) and [vLLM](https://github.com/vllm-project/vllm), to provide a robust, end-to-end solution for managing and scaling large models.
vllm-plugin-fl is a vLLM plugin built on the FlagOS unified multi-chip backend, to help flagscale support multi-chip on vllm framework.
## **FlagCX**
FlagCX is a scalable and adaptive cross-chip communication library. It serves as a platform where developers, researchers, and AI engineers can collaborate on various projects, contribute to the development of cutting-edge AI solutions, and share their work with the global community.

## **FlagEval Evaluation Framework**
FlagEval is a comprehensive evaluation system and open platform for large models launched in 2023. It aims to establish scientific, fair, and open benchmarks, methodologies, and tools to help researchers assess model and training algorithm performance. It features:
- **Multi-dimensional Evaluation**: Supports 800+ modelevaluations across NLP, CV, Audio, and Multimodal fields,covering 20+ downstream tasks including language understanding and image-text generation.
- **Industry-Grade Use Cases**: Has completed horizonta1 evaluations of mainstream large models, providing authoritative benchmarks for chip-model performance validation.

# Contributing

We warmly welcome global developers to join us:

1. Submit Issues to report problems
2. Create Pull Requests to contribute code
3. Improve technical documentation
4. Expand hardware adaptation support
# License
The model weights are derived from vandijklab/C2S-Scale-Gemma-2-27B and are open‑sourced under the Apache License 2.0: https://www.apache.org/licenses/LICENSE-2.0.txt
Loading