Skip to content

Releases: InternLM/lmdeploy

v0.7.0.post3

10 Feb 06:00
e98fd6a
Compare
Choose a tag to compare

What's Changed

💥 Improvements

🐞 Bug fixes

🌐 Other

New Contributors

Full Changelog: v0.7.0.post2...v0.7.0.post3

LMDeploy Release V0.7.0.post2

27 Jan 15:57
637435f
Compare
Choose a tag to compare

What's Changed

💥 Improvements

🐞 Bug fixes

🌐 Other

Full Changelog: v0.7.0.post1...v0.7.0.post2

LMDeploy Release V0.7.0.post1

25 Jan 11:35
552bf3a
Compare
Choose a tag to compare

What's Changed

💥 Improvements

🐞 Bug fixes

🌐 Other

Full Changelog: v0.7.0...v0.7.0.post1

LMDeploy Release v0.7.0

15 Jan 10:04
9fcb3b1
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

🌐 Other

New Contributors

Full Changelog: 0.6.5...v0.7.0

LMDeploy Release v0.6.5

30 Dec 10:15
af0fcf2
Compare
Choose a tag to compare

What's Changed

🚀 Features

  • [dlinfer] feat: add DlinferFlashAttention to support qwen vl. by @Reinerzhou in #2952

💥 Improvements

🐞 Bug fixes

🌐 Other

New Contributors

Full Changelog: v0.6.4...0.6.5

LMDeploy Release v0.6.4

09 Dec 12:08
14b64c7
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

  • disable prefix-caching for vl model by @grimoire in #2825
  • Fix gemma2 accuracy through the correct softcapping logic by @AllentDan in #2842
  • fix accessing before initialization by @lvhan028 in #2845
  • fix the logic to verify whether AutoAWQ has been successfully installed by @grimoire in #2844
  • check whether backend_config is None or not before accessing its attr by @lvhan028 in #2848
  • [ascend] convert kv cache to nd format in ascend graph mode by @tangzhiyi11 in #2853

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.6.3...v0.6.4

LMDeploy Release V0.6.3

16 Nov 04:31
0c80baa
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.6.2...v0.6.3

LMDeploy Release v0.6.2.post1

07 Nov 07:41
4fc9479
Compare
Choose a tag to compare

What's Changed

Bugs

🌐 Other

Full Changelog: v0.6.2...v0.6.2.post1

LMDeploy Release v0.6.2

29 Oct 06:42
522108c
Compare
Choose a tag to compare

Highlights

  • PyTorch engine supports graph mode on ascend platform, doubling the inference speed
  • Support llama3.2-vision models in PyTorch engine
  • Support Mixtral in TurboMind engine, achieving 20+ RPS using SharedGPT dataset with 2 A100-80G GPUs

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

📚 Documentations

🌐 Other

New Contributors

Full Changelog: v0.6.1...v0.6.2

LMDeploy Release V0.6.1

28 Sep 11:34
2e49fc3
Compare
Choose a tag to compare

What's Changed

🚀 Features

💥 Improvements

🐞 Bug fixes

🌐 Other

New Contributors

Full Changelog: v0.6.0...v0.6.1