Is it possible to auto truncate text when generating embeddings? #13636

gerritvd · 2025-02-21T00:10:31Z

gerritvd
Feb 21, 2025

When sending text that is larger than the context lenght of the model, vLLM throws an error with:

This model\'s maximum context length is 256 tokens. However, you requested 258 tokens in the input for embedding generation. Please reduce the length of the input.

Is there an option to enable some sort of auto truncation?

gerritvd · 2025-02-24T02:04:10Z

gerritvd
Feb 24, 2025
Author

I now understand truncate_prompt_tokens provides a way to truncate the text in e.g. embeddings or rerank. I there a plan to specify directionality of truncation?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is it possible to auto truncate text when generating embeddings? #13636

{{title}}

Replies: 1 comment

{{title}}

Select a reply

Is it possible to auto truncate text when generating embeddings? #13636

gerritvd Feb 21, 2025

Replies: 1 comment

gerritvd Feb 24, 2025 Author

gerritvd
Feb 21, 2025

gerritvd
Feb 24, 2025
Author