perf: improve performance of 2018 day 2 part 2 #11

Rolv-Apneseth · 2025-07-12T12:33:36Z

Hi again. This is a quick one for improving the performance of 2018 day 2 part 2 (85μs => 15μs for me). Let me know if you want any changes, and no hard feelings if you don't want the PR at all.

maneatingape · 2025-07-12T12:53:08Z

Let me know if you want any changes, and no hard feelings if you don't want the PR at all.

I really appreciate anyone taking the time to make performance improvements or showing how solutions could be better, so all PR are much appreciated!

src/year2018/day02.rs

maneatingape · 2025-07-12T13:09:35Z

src/year2018/day02.rs

-            buffer[0..width].copy_from_slice(id);
-            buffer[column] = b'*';
+            let mut diff = false;
+            for (a, b) in id1.iter().zip(id2) {


This causes a performance regression on my input, 87 µs to 129 µs.

Two nested loops have worst case quadratic complexity O(n²).
Hashing is a constant time operation so the original solution worst case complexity is O(1) * 26 * O(n) = O(n).

So it looks like you're getting lucky, while I'm getting unlucky with the input order.

I am no expert in O(N) notation, but wouldn't the N here be the number of IDs, so it should be lower than O(N^2) (second loop starts at the index of the first loop)?

Anyway, fair enough about the performance being worse. Just luck based that it's so much faster for my input then. I'll close the PR if that's alright with you, but out of curiosity would mind sharing your input too?

I am no expert in O(N) notation, but wouldn't the N here be the number of IDs, so it should be lower than O(N^2) (second loop starts at the index of the first loop)?

Say you have 10 ids. First inner loop iteration will check 9 items, next iteration 8 and so on...
9 + 8 + 7 + ... + 2 + 1 = n(n + 1) / 2 = (1/2)(n² + n)
So you save half the comparisons but it's still overall quadratic.

Anyway, fair enough about the performance being worse. Just luck based that it's so much faster for my input then. I'll close the PR if that's alright with you, but out of curiosity would mind sharing your input too?

If you re-order the elements (for example swap the first and second halves) you can change the performance drastically. With a few random swaps I got benchmarks from 33 µs to 250 µs.

I still think it might be possible to improve the performance by taking advantage of the special structure of the input (always 26 lower case ASCII characters).

So you save half the comparisons but it's still overall quadratic.

I seeee, thank you for helping me understand.

If you re-order the elements (for example swap the first and second halves) you can change the performance drastically

Ah yes, can confirm moving one of the solutions can make it much worse. I really did get quite lucky.

I still think it might be possible to improve the performance by taking advantage of the special structure of the input (always 26 lower case ASCII characters).

I'll have a look at it again later and see if I can come up with anything (unless you do first)

maneatingape · 2025-07-12T15:00:51Z

BTW I welcome PRs (even if they don't always work out). It's always an interesting discussion and could spark an improvement either in this or another solution.

Rolv-Apneseth · 2025-07-12T15:01:31Z

BTW I welcome PRs (even if they don't always work out). It's always an interesting discussion and could spark an improvement either in this or another solution.

Sure yeah. Thank you for being so understanding.

…prefix and suffix of IDs into the hashset

Rolv-Apneseth · 2025-07-12T18:10:30Z

I tried a couple different (more complicated) approaches without much luck. Went back to the original approach and changed it from storing the ID with a character replaced in the set, to just storing the prefix and suffix around the character, and I believe there is some performance gained from that

Rolv-Apneseth · 2025-07-12T18:26:28Z

If you're interested @maneatingape, it was just a simple change but for me it's giving around 30-45% speed up:

pub fn part2(input: &[&[u8]]) -> String {
    let width = input[0].len();

    let mut seen = FastSet::with_capacity(input.len());

    // Use a set to check for duplicates by comparing the prefix and suffix of IDs excluding one
    // column at a time.
    for column in 0..width {
        for &id in input {
            let prefix = &id[..column];
            let suffix = &id[column + 1..];

            if !seen.insert([prefix, suffix]) {
                // Convert to String
                return prefix.iter().chain(suffix).cloned().map(char::from).collect();
            }
        }

        seen.clear();
    }

    unreachable!()
}

maneatingape

Neat! Faster and more elegant.

79 µs => 49 µs.

Rolv-Apneseth · 2025-07-12T18:51:59Z

Missed that lint - not getting these from clippy locally but yeah cloned -> copied makes sense

Rolv-Apneseth added 3 commits July 12, 2025 13:27

perf: improve performance of 2018 day 2 part 2

6977c03

fix: clippy lint

22569fd

fix: clippy lint for the clippy lint

f6c50ce

Rolv-Apneseth force-pushed the 2018_day2_p2_perf branch from 02094ef to f6c50ce Compare July 12, 2025 12:39

maneatingape requested changes Jul 12, 2025

View reviewed changes

refactor: revert back to the original approach - but only insert the …

845d760

…prefix and suffix of IDs into the hashset

maneatingape closed this Jul 12, 2025

maneatingape reopened this Jul 12, 2025

maneatingape approved these changes Jul 12, 2025

View reviewed changes

maneatingape merged commit a8a3007 into maneatingape:main Jul 12, 2025
4 of 6 checks passed

Rolv-Apneseth deleted the 2018_day2_p2_perf branch July 12, 2025 18:52

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

perf: improve performance of 2018 day 2 part 2 #11

perf: improve performance of 2018 day 2 part 2 #11

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

maneatingape commented Jul 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maneatingape Jul 12, 2025 •

edited

Loading

Uh oh!

Rolv-Apneseth Jul 12, 2025

Uh oh!

maneatingape Jul 12, 2025

Uh oh!

maneatingape Jul 12, 2025

Uh oh!

Rolv-Apneseth Jul 12, 2025

Uh oh!

maneatingape commented Jul 12, 2025

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

maneatingape left a comment •

edited

Loading

Uh oh!

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

Uh oh!

perf: improve performance of 2018 day 2 part 2 #11

perf: improve performance of 2018 day 2 part 2 #11

Uh oh!

Conversation

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

maneatingape commented Jul 12, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

maneatingape Jul 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Rolv-Apneseth Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

maneatingape Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

maneatingape Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

Rolv-Apneseth Jul 12, 2025

Choose a reason for hiding this comment

Uh oh!

maneatingape commented Jul 12, 2025

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

maneatingape left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Rolv-Apneseth commented Jul 12, 2025

Uh oh!

Uh oh!

maneatingape Jul 12, 2025 •

edited

Loading

maneatingape left a comment •

edited

Loading