Skip to content

Releases: xorbitsai/inference

v0.10.1

12 Apr 02:47
e3a947e
Compare
Choose a tag to compare

What's new in 0.10.1 (2024-04-12)

These are the changes in inference v0.10.1.

New features

Enhancements

Bug fixes

Tests

Documentation

New Contributors

Full Changelog: v0.10.0...v0.10.1

v0.10.0

29 Mar 04:56
2857ec4
Compare
Choose a tag to compare

What's new in 0.10.0 (2024-03-29)

These are the changes in inference v0.10.0.

New features

Enhancements

Bug fixes

  • BUG: Fix the incorrect model interface address caused a 307 redirect to HTTP, blocking the request and preventing the display of the model list. by @wertycn in #1182
  • BUG: fix doc fail introduced by #1171 & update readme by @qinxuye in #1203
  • BUG: Increase validator types for thie 'input' parameter of embeddings to match OpenAI API by @Minamiyama in #1201

Documentation

New Contributors

Full Changelog: v0.9.4...v0.10.0

v0.9.4

21 Mar 07:06
2c9465a
Compare
Choose a tag to compare

What's new in 0.9.4 (2024-03-21)

These are the changes in inference v0.9.4.

New features

Enhancements

  • ENH: vLLM latest models support by @1572161937 in #1155

Bug fixes

Documentation

Others

New Contributors

Full Changelog: v0.9.3...v0.9.4

v0.9.3

15 Mar 06:36
60f098c
Compare
Choose a tag to compare

What's new in 0.9.3 (2024-03-15)

These are the changes in inference v0.9.3.

New features

Enhancements

Bug fixes

  • BUG: fix the useless fstring. by @mikeshi80 in #1130
  • BUG: Fixing the issue of model list loading failure caused by a large number of invalid requests on the model list page. by @wertycn in #1111
  • BUG: Fix cache status for embedding, rerank and image models on the web UI by @ChengjieLi28 in #1135
  • BUG: Fix missing information for xinference registrations and xinference list command by @ChengjieLi28 in #1140
  • BUG: Fix cannot continue to chat after canceling the streaming chat via ctrl+c by @ChengjieLi28 in #1144

Tests

Documentation

New Contributors

Full Changelog: v0.9.2...v0.9.3

v0.9.2

08 Mar 06:09
29f4c10
Compare
Choose a tag to compare

What's new in 0.9.2 (2024-03-08)

These are the changes in inference v0.9.2.

New features

Enhancements

  • ENH: Supports n_gpu_layers parameter for llama-cpp-python by @ChengjieLi28 in #1070
  • ENH: Add a dropdown to the web UI to support adjusting GPU offload layers for llama.cpp loader by @notsyncing in #1073
  • ENH: [UI] Show replica on running model page by @ChengjieLi28 in #1093
  • ENH: Add "[DONE]" to the end of stream generation for better openai SDK compatibility by @ZhangTianrong in #1062
  • ENH: [UI] Support setting CPU when selecting n_gpu by @ChengjieLi28 in #1096

Documentation

Others

  • Update llm_family.json to correct the context length of glaive coder by @mikeshi80 in #1083

New Contributors

Full Changelog: v0.9.1...v0.9.2

v0.9.1

01 Mar 07:04
7b20f76
Compare
Choose a tag to compare

What's new in 0.9.1 (2024-03-01)

These are the changes in inference v0.9.1.

New features

Enhancements

Bug fixes

  • BUG: Fix some models that cannot download from modelscope by @ChengjieLi28 in #1066
  • BUG: Fix early truncation due to max_token being default to 16 instead of 1024 by @ZhangTianrong in #1061

Documentation

New Contributors

Full Changelog: v0.9.0...v0.9.1

v0.9.0

22 Feb 08:03
c653c97
Compare
Choose a tag to compare

What's new in 0.9.0 (2024-02-22)

These are the changes in inference v0.9.0.

New features

  • FEAT: Refactor device related code and add initial Intel GPU support by @notsyncing in #968
  • FEAT: Support gemma series model by @aresnow1 in #1024

Enhancements

Bug fixes

Documentation

  • DOC: Polish model operation command doc by @onesuper in #1000
  • DOC: Fix note on secret_key generation and algorithm selection for OAuth2 by @ChengjieLi28 in #1012

New Contributors

Full Changelog: v0.8.5...v0.9.0

v0.8.5

06 Feb 05:37
e903e05
Compare
Choose a tag to compare

What's new in 0.8.5 (2024-02-06)

These are the changes in inference v0.8.5.

New features

Enhancements

Bug fixes

Others

New Contributors

Full Changelog: v0.8.4...v0.8.5

v0.8.4

04 Feb 09:17
1b9b8c8
Compare
Choose a tag to compare

What's new in 0.8.4 (2024-02-04)

These are the changes in inference v0.8.4.

Enhancements

Bug fixes

Documentation

Full Changelog: v0.8.3.1...v0.8.4

v0.8.3.1

02 Feb 08:06
cfbe5ba
Compare
Choose a tag to compare

What's new in 0.8.3.1 (2024-02-02)

These are the changes in inference v0.8.3.1.

Bug fixes

Full Changelog: v0.8.3...v0.8.3.1