Only load searchindex when needed #2553

GuillaumeGomez · 2025-02-20T22:34:54Z

This PR makes it so that the searchindex.js file is only loaded when the user arrives on a page with ?search= in the URL or if the user opens the search. It means that the book size will be only the text content until the user actually needs to run a search.

cc @notriddle

notriddle · 2025-02-20T22:55:29Z

If you're going to mess with the search engine, can you also write some gui tests that exercise it?

GuillaumeGomez · 2025-02-20T23:00:23Z

Very good point.

GuillaumeGomez · 2025-02-20T23:42:48Z

Added the GUI test. :)

notriddle · 2025-02-21T00:29:11Z

tests/gui/sidebar-nojs.goml

@@ -2,9 +2,6 @@
 // an iframe (because of JS disabled).
 // Regression test for <https://github.com/rust-lang/mdBook/issues/2528>.

-// We disable the requests checks because `searchindex.json` will always fail
-// locally.
-fail-on-request-error: false


Great! 👍

src/theme/searcher/searcher.js

GuillaumeGomez · 2025-02-21T17:31:55Z

When the JS is being retrieved, there is now a spinner and the input is not disabled anymore:

GuillaumeGomez · 2025-03-10T12:02:33Z

Rebased and fixed merge conflict.

notriddle · 2025-03-16T04:07:59Z

It looks like you reverted the loading throbber, and now it disables the search input while it's loading again.

16cbfd6 added it, but the pull request doesn't have it any more.

GuillaumeGomez · 2025-03-16T21:07:38Z

Arg indeed. Rebase went wrong I guess. Great catch, thanks!

notriddle · 2025-03-21T01:25:59Z

Everything seems good here!

GuillaumeGomez · 2025-03-23T09:05:53Z

Rebased.

GuillaumeGomez · 2025-03-23T20:17:13Z

Re-rebased and also ran eslint.

GuillaumeGomez · 2025-03-31T19:56:51Z

Rebased. If you could take a look @ehuss. If you need help to understand what this PR is doing, don't hesitate to ask!

ehuss · 2025-03-31T20:06:18Z

Can you say more about the reasoning behind this change? For me, it comes off as a worse experience, so we are somehow not seeing the same thing. For example, after opening a book and looking at it, I may want to search for something. Today this loads instantly (since it is eagerly loaded), but with this change I may need to wait 10+ seconds for anything to happen. Is this trying to save bandwidth? Or is it some performance issue?

I'm not completely against this kind of change, but the description here doesn't explain the motivation or acknowledge the potential downsides.

GuillaumeGomez · 2025-03-31T20:13:33Z

Today this loads instantly (since it is eagerly loaded), but with this change I may need to wait 10+ seconds for anything to happen. Is this trying to save bandwidth? Or is it some performance issue?

I'm very surprised. What book do you have this 10+ seconds wait?

The whole point of this PR is to actually speed up the page load and reduce the (page) size by default until you actually need the search. This is becomes more and more useful as the book size (and its search index) grows. This is how we allow rustdoc to always load extremely quickly, whatever the size of the crate (and of its search index).

ehuss · 2025-04-02T18:17:41Z

The RFCs book has a 22MB index. I suppose 10s might be a bit of an exaggeration for many people, as I was just doing some throttling tests, though I think we should be sensitive to people without fast internet.

Can you help me understand how this speeds up page load? My understanding is that the index is loaded async in the background. My expectation is that since it is loading in the background, it shouldn't have any measurable effect on initial render time. What little page-load profiling I've done doesn't show a difference with this PR.

GuillaumeGomez · 2025-04-02T19:08:29Z

It's mostly for people with limited internet access (not in bandwidth but in data) that will get their life improved with this PR: it only loads extra content on demand.

Can you help me understand how this speeds up page load? My understanding is that the index is loaded async in the background. My expectation is that since it is loading in the background, it shouldn't have any measurable effect on initial render time. What little page-load profiling I've done doesn't show a difference with this PR.

I wasn't clear, my bad. The load time should normally be close to not impacted (except I forgot to convert the JS to JSON.parse(), meaning that for big search indexes, all pages will have an impact when the JS will be parsed, because JS parsing is performed in the main thread, should be fixed in #2633). Here, the impact is to reduce memory usage for the tab (until you actually need the search) and saving data.

GuillaumeGomez · 2025-04-02T19:13:01Z

Ah I found a good analogy: with the RFC book, instead of loading the 22MB search index on all pages whether you need or not, you will only load it when you want to perform a search.

ehuss · 2025-05-30T17:32:58Z

I'm still a little concerned about the reduced responsiveness of the search box, though not enough to block this change. For example, given the RFC repo's 22MB search index:

Speed	Time
5mbps	37s
40mbps	4.6s
100mbps	1.8s
500mbps	0.37s
1000mbps	0.18s

I don't know what would be a typical network connection speed we should be aiming for. I think a 1 or 2s delay for slower connections although unfortunate, should be fine. And the delay should only happen on the first search while it stays cached.

Can you rebase and resolve the conflict, and then we can merge?

notriddle · 2025-05-30T20:57:05Z

There's a reason I was so negative on disabling the search field: you should be able to type your query in while the search index is loading, so you don't actually have to wait.

(at least, assuming it takes a couple seconds to type the search query in, and assuming you're not on the 5Mbps link)

GuillaumeGomez · 2025-05-31T07:35:01Z

We should also work on reducing the size of the search index.

GuillaumeGomez · 2025-05-31T08:07:13Z

Fixed merge conflict. :)

ehuss

Thanks!

rustbot added the S-waiting-on-review Status: waiting on a review label Feb 20, 2025

GuillaumeGomez force-pushed the load-on-need branch from 8ab48db to 8b5125d Compare February 20, 2025 23:34

notriddle reviewed Feb 21, 2025

View reviewed changes

GuillaumeGomez force-pushed the load-on-need branch from e3244b2 to 16cbfd6 Compare February 21, 2025 17:30

notriddle approved these changes Feb 21, 2025

View reviewed changes

GuillaumeGomez requested a review from ehuss February 21, 2025 22:14

GuillaumeGomez force-pushed the load-on-need branch 2 times, most recently from 38b4ea7 to c96b7e4 Compare March 10, 2025 11:13

notriddle approved these changes Mar 21, 2025

View reviewed changes

GuillaumeGomez force-pushed the load-on-need branch from a002ecb to 27d7465 Compare March 23, 2025 09:05

GuillaumeGomez force-pushed the load-on-need branch from 27d7465 to afd2be1 Compare March 23, 2025 20:16

notriddle approved these changes Mar 25, 2025

View reviewed changes

GuillaumeGomez force-pushed the load-on-need branch from afd2be1 to 5557b3f Compare March 31, 2025 19:56

This comment has been minimized.

Sign in to view

GuillaumeGomez force-pushed the load-on-need branch from 5557b3f to 55fd7ad Compare May 31, 2025 07:34

GuillaumeGomez added 3 commits May 31, 2025 09:36

Only load searchindex when needed

1fb91d6

Add GUI test for search

d64a863

Add a spinner when search is in progress

2fa13cf

GuillaumeGomez force-pushed the load-on-need branch from 55fd7ad to 2fa13cf Compare May 31, 2025 07:36

Update search test

dc6b0a6

ehuss approved these changes Jun 2, 2025

View reviewed changes

ehuss added this pull request to the merge queue Jun 2, 2025

Merged via the queue into rust-lang:master with commit 94f9a9c Jun 2, 2025
14 checks passed

GuillaumeGomez deleted the load-on-need branch June 2, 2025 19:31

Only load searchindex when needed #2553

Only load searchindex when needed #2553

Uh oh!

Conversation

GuillaumeGomez commented Feb 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

notriddle commented Feb 20, 2025

Uh oh!

GuillaumeGomez commented Feb 20, 2025

Uh oh!

GuillaumeGomez commented Feb 20, 2025

Uh oh!

notriddle Feb 21, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

GuillaumeGomez commented Feb 21, 2025

Uh oh!

GuillaumeGomez commented Mar 10, 2025

Uh oh!

notriddle commented Mar 16, 2025

Uh oh!

GuillaumeGomez commented Mar 16, 2025

Uh oh!

notriddle commented Mar 21, 2025

Uh oh!

GuillaumeGomez commented Mar 23, 2025

Uh oh!

GuillaumeGomez commented Mar 23, 2025

Uh oh!

GuillaumeGomez commented Mar 31, 2025

Uh oh!

ehuss commented Mar 31, 2025

Uh oh!

GuillaumeGomez commented Mar 31, 2025

Uh oh!

ehuss commented Apr 2, 2025

Uh oh!

GuillaumeGomez commented Apr 2, 2025

Uh oh!

GuillaumeGomez commented Apr 2, 2025

Uh oh!

This comment has been minimized.

ehuss commented May 30, 2025

Uh oh!

notriddle commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

GuillaumeGomez commented May 31, 2025

Uh oh!

GuillaumeGomez commented May 31, 2025

Uh oh!

ehuss left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

GuillaumeGomez commented Feb 20, 2025 •

edited

Loading

notriddle commented May 30, 2025 •

edited

Loading