Skip to content

Commit c5e4eeb

Browse files
authored
Update deepseek-r1-distill-llama-8b.mdx
1 parent d1662ee commit c5e4eeb

File tree

1 file changed

+6
-8
lines changed

1 file changed

+6
-8
lines changed

pages/managed-inference/reference-content/deepseek-r1-distill-llama-8b.mdx

Lines changed: 6 additions & 8 deletions
Original file line numberDiff line numberDiff line change
@@ -19,23 +19,21 @@ categories:
1919
|-----------------|------------------------------------|
2020
| Provider | [Deepseek](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-8B) |
2121
| License | [MIT](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md) |
22-
| Compatible Instances | L4, H100, H100-2 (FP8, BF16) |
23-
| Context Length | up to 32k tokens |
22+
| Compatible Instances | L4, H100 (BF16) |
23+
| Context Length | up to 131k tokens |
2424

2525
## Model names
2626

2727
```bash
28-
meta/deepseek-r1-distill-llama-8b:fp8
29-
meta/deepseek-r1-distill-llama-8b:bf16
28+
deepseek/deepseek-r1-distill-llama-8b:bf16
3029
```
3130

3231
## Compatible Instances
3332

3433
| Instance type | Max context length |
3534
| ------------- |-------------|
36-
| L4 | 32k (FP8, BF16) |
37-
| H100 | 32k (FP8, BF16) |
38-
| H100-2 | 32k (FP8, BF16) |
35+
| L4 | 39k (BF16) |
36+
| H100 | 131k (BF16) |
3937

4038
## Model introduction
4139

@@ -47,7 +45,7 @@ DeepSeek R1 Distill Llama 8B is designed to improve performance of Llama models
4745
It is great to see Deepseek improving open(weight) models, and we are excited to fully support their mission with integration in the Scaleway ecosystem.
4846

4947
- DeepSeek-R1-Distill-Llama was optimized to reach accuracy close to Deepseek-R1 in tasks like mathematics and coding, while keeping inference costs limited and tokens speed efficient.
50-
- DeepSeek-R1-Distill-Llama supports a context window up to 32K tokens and tool calling, keeping interaction with other components possible.
48+
- DeepSeek-R1-Distill-Llama supports a context window up to 131K tokens and tool calling, keeping interaction with other components possible.
5149

5250
## How to use it
5351

0 commit comments

Comments
 (0)