Update deepseek-r1-distill-llama-70b.mdx

fpagny · web-flow · commit d1662eef8c7f · 2025-02-06T16:06:22.000+01:00
diff --git a/pages/managed-inference/reference-content/deepseek-r1-distill-llama-70b.mdx b/pages/managed-inference/reference-content/deepseek-r1-distill-llama-70b.mdx
@@ -19,22 +19,20 @@ categories:
 |-----------------|------------------------------------|
 | Provider        | [Deepseek](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Llama-70B)  |
 | License        | [MIT](https://huggingface.co/datasets/choosealicense/licenses/blob/main/markdown/mit.md)  |
-| Compatible Instances | H100 (FP8), H100-2 (FP8, BF16) |
-| Context Length | up to 32k tokens |
+| Compatible Instances | H100-2 (BF16) |
+| Context Length | up to 56k tokens |
 
 ## Model names
 
 ```bash
-meta/deepseek-r1-distill-llama-70b:fp8
-meta/deepseek-r1-distill-llama-70b:bf16
+deepseek/deepseek-r1-distill-llama-70b:bf16
 ```
 
 ## Compatible Instances
 
 | Instance type  | Max context length |
 | ------------- |-------------|
-| H100      | 32k (FP8) |
-| H100-2      | 32k (FP8, BF16) |
+| H100-2      | 56k (BF16) |
 
 ## Model introduction
 
@@ -46,7 +44,7 @@ DeepSeek R1 Distill Llama 70B is designed to improve performance of Llama models
 It is great to see Deepseek improving open(weight) models, and we are excited to fully support their mission with integration in the Scaleway ecosystem.
 
 - DeepSeek-R1-Distill-Llama was optimized to reach accuracy close to Deepseek-R1 in tasks like mathematics and coding, while keeping inference costs limited and tokens speed efficient. 
-- DeepSeek-R1-Distill-Llama supports a context window up to 32K tokens and tool calling, keeping interaction with other components possible.
+- DeepSeek-R1-Distill-Llama supports a context window up to 56K tokens and tool calling, keeping interaction with other components possible.
 
 ## How to use it