@@ -144,15 +144,15 @@ If you want to explore some new framework like atorch, you could check:
144144We are excited to release the following two CodeLLMs trained by MFTCoder, now available on both HuggingFace and ModelScope:
145145
146146
147- | Model | | | Base Model | Num of examples trained | Batch Size | Seq Length |
148- | --------------------------------------| ------------------------| ---| ----------------------| ------| ------------| ------------|
149- | 🔥🔥 CodeFuse-DeepSeek-33B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-DeepSeek-33B ) | DeepSeek-coder-33B | 60万 | 80 | 4096 |
150- | 🔥🔥 CodeFuse-Mixtral-8x7B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-Mixtral-8x7B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-Mixtral-8x7B ) | Mixtral-8x7B | 60万 | 80 | 4096 |
151- | 🔥🔥 CodeFuse-CodeLlama-34B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-CodeLlama-34B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-CodeLlama-34B ) | CodeLlama-34b-Python | 60万 | 80 | 4096 |
152- | 🔥🔥 CodeFuse-CodeLlama-34B-4bits | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-CodeLlama-34B-4bits ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-CodeLlama-34B-4bits ) | CodeLlama-34b-Python | | | 4096 |
153- | 🔥🔥 CodeFuse-StarCoder-15B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-StarCoder-15B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-StarCoder-15B ) | StarCoder-15B | 60万 | 80 | 4096 |
154- | 🔥🔥 CodeFuse-QWen-14B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-QWen-14B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-QWen-14B ) | Qwen-14b | 110万 | 256 | 4096 |
155- | 🔥🔥 CodeFuse-CodeGeex2-6B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-CodeGeex2-6B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-CodeGeex2-6B ) | CodeGeex2-6B | 110万 | 256 | 4096 |
147+ | Model | HuggingFace Links | ModelScope Links | Base Model | Num of examples trained | Batch Size | Seq Length |
148+ | --------------------------------------| --------------------------------------------------------------------------- | ------------------------------------------------------------------------------ ---| ----------------------| ------| ------------| ------------|
149+ | 🔥🔥 CodeFuse-DeepSeek-33B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-DeepSeek-33B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-DeepSeek-33B ) | DeepSeek-coder-33B | 60万 | 80 | 4096 |
150+ | 🔥🔥 CodeFuse-Mixtral-8x7B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-Mixtral-8x7B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-Mixtral-8x7B ) | Mixtral-8x7B | 60万 | 80 | 4096 |
151+ | 🔥🔥 CodeFuse-CodeLlama-34B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-CodeLlama-34B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-CodeLlama-34B ) | CodeLlama-34b-Python | 60万 | 80 | 4096 |
152+ | 🔥🔥 CodeFuse-CodeLlama-34B-4bits | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-CodeLlama-34B-4bits ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-CodeLlama-34B-4bits ) | CodeLlama-34b-Python | | | 4096 |
153+ | 🔥🔥 CodeFuse-StarCoder-15B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-StarCoder-15B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-StarCoder-15B ) | StarCoder-15B | 60万 | 80 | 4096 |
154+ | 🔥🔥 CodeFuse-QWen-14B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-QWen-14B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-QWen-14B ) | Qwen-14b | 110万 | 256 | 4096 |
155+ | 🔥🔥 CodeFuse-CodeGeex2-6B | [ h-link] ( https://huggingface.co/codefuse-ai/CodeFuse-CodeGeex2-6B ) | [ m-link] ( https://modelscope.cn/models/codefuse-ai/CodeFuse-CodeGeex2-6B ) | CodeGeex2-6B | 110万 | 256 | 4096 |
156156
157157
158158## Datasets
0 commit comments