Skip to content

ScaleLLM ModelCards

teddyli edited this page Nov 14, 2023 · 1 revision
Models Tensor Parallel Quantization Chat API HF models examples
Yi Yes Yes No 01-ai/Yi-6B, 01-ai/Yi-6B-200K, casperhansen/yi-6b-awq, TheBloke/Yi-34B-GPTQ
Llama2 Yes Yes Yes meta-llama/Llama-2-7b, TheBloke/Llama-2-13B-chat-GPTQ, TheBloke/Llama-2-70B-AWQ
Aquila Yes Yes Yes BAAI/Aquila-7B, BAAI/AquilaChat-7B
Bloom Yes Yes No bigscience/bloom
GPT_j Yes Yes No EleutherAI/gpt-j-6b
GPT_NeoX Yes Yes No EleutherAI/gpt-neox-20b
GPT2 Yes Yes No gpt2
InternLM Yes Yes Yes internlm/internlm-7b
Mistral Yes Yes Yes mistralai/Mistral-7B-v0.1
MPT Yes Yes No mosaicml/mpt-30b
Clone this wiki locally