Improve glossary lookup performance by caching automaton #9949

nijel · 2023-09-14T07:27:02Z

Describe the problem

On massive glossaries, glossary terms lookup can take hundredths of milliseconds.

Describe the solution you'd like

Use serialized automaton, this is currently discussed at G-Research/ahocorasick_rs#41.

Describe alternatives you've considered

No response

Screenshots

No response

Additional context

It used to be there since #5545, but it turned out not to be that good as the stored object was huge and restoring it from the cache took considerable time. It was reverted in dff3984. Since d8a62a4 the build times are much lower, but still, caching automaton would be probably beneficial.

The text was updated successfully, but these errors were encountered:

nijel added enhancement Adding or requesting a new feature. help wanted Extra attention is needed. External knowledge welcome. labels Sep 14, 2023

nijel changed the title ~~Improve glossary lookup performance~~ Improve glossary lookup performance by caching automaton Jan 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve glossary lookup performance by caching automaton #9949

Improve glossary lookup performance by caching automaton #9949

nijel commented Sep 14, 2023 •

edited

Loading

Improve glossary lookup performance by caching automaton #9949

Improve glossary lookup performance by caching automaton #9949

Comments

nijel commented Sep 14, 2023 • edited Loading

Describe the problem

Describe the solution you'd like

Describe alternatives you've considered

Screenshots

Additional context

nijel commented Sep 14, 2023 •

edited

Loading