Skip to content

Commit 966308a

Browse files
authored
Add L40S to Managed Inference models
1 parent 7c7bd99 commit 966308a

File tree

1 file changed

+4
-2
lines changed

1 file changed

+4
-2
lines changed

pages/managed-inference/reference-content/llama-3-8b-instruct.mdx

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -18,7 +18,7 @@ categories:
1818
| Attribute | Details |
1919
|-----------------|------------------------------------|
2020
| Provider | [Meta](https://llama.meta.com/llama3/) |
21-
| Compatible Instances | L4, H100 (FP8, BF16) |
21+
| Compatible Instances | L4, L40S, H100, H100-2 (FP8, BF16) |
2222
| Context size | 8192 tokens |
2323

2424
## Model names
@@ -33,7 +33,9 @@ meta/llama-3-8b-instruct:fp8
3333
| Instance type | Max context length |
3434
| ------------- |-------------|
3535
| L4 | 8192 (FP8, BF16) |
36-
| H100 | 8192 (FP8, BF16)
36+
| L40S | 8192 (FP8, BF16) |
37+
| H100 | 8192 (FP8, BF16) |
38+
| H100-2 | 8192 (FP8, BF16) |
3739

3840
## Model introduction
3941

0 commit comments

Comments
 (0)