v0.5.5
What's new in 0.5.5 (2023-10-26)
These are the changes in inference v0.5.5.
Enhancements
- ENH: display language tags by @Minamiyama in #558
- ENH: filter models by type by @Minamiyama in #559
- ENH: disable create embeddings using LLMs by @UranusSeven in #570
- ENH: benchmark latency by @UranusSeven in #576
- ENH: configurable
XINFERENCE_HOME
env by @ChengjieLi28 in #566
Bug fixes
- BUG: Fix
bge-base-zh
andbge-large-zh
from ModelScope by @ChengjieLi28 in #571 - BUG: When change model revision, xinference still uses the previous model by @ChengjieLi28 in #573
- BUG: incorrect vLLM config by @UranusSeven in #579
- BUG: fix llama-2 stop words by @UranusSeven in #580
Documentation
- DOC: Incompatibility Between NVIDIA Driver and PyTorch Version by @onesuper in #551
- DOC: Examples and resources page by @onesuper in #561
Full Changelog: v0.5.4...v0.5.5