-
Notifications
You must be signed in to change notification settings - Fork 35
ScaleLLM ModelCards
teddyli edited this page Nov 14, 2023
·
1 revision
Models | Tensor Parallel | Quantization | Chat API | HF models examples |
---|---|---|---|---|
Yi | Yes | Yes | No | 01-ai/Yi-6B, 01-ai/Yi-6B-200K, casperhansen/yi-6b-awq, TheBloke/Yi-34B-GPTQ |
Llama2 | Yes | Yes | Yes | meta-llama/Llama-2-7b, TheBloke/Llama-2-13B-chat-GPTQ, TheBloke/Llama-2-70B-AWQ |
Aquila | Yes | Yes | Yes | BAAI/Aquila-7B, BAAI/AquilaChat-7B |
Bloom | Yes | Yes | No | bigscience/bloom |
GPT_j | Yes | Yes | No | EleutherAI/gpt-j-6b |
GPT_NeoX | Yes | Yes | No | EleutherAI/gpt-neox-20b |
GPT2 | Yes | Yes | No | gpt2 |
InternLM | Yes | Yes | Yes | internlm/internlm-7b |
Mistral | Yes | Yes | Yes | mistralai/Mistral-7B-v0.1 |
MPT | Yes | Yes | No | mosaicml/mpt-30b |