Fast extension matching #87

tavianator · 2024-07-05T01:51:13Z

~~I say "fast" but I didn't actually benchmark it yet.~~ Benchmarks below. It's definitely better from a time complexity perspective.

This is basically how I do it in bfs, except I used aho-corasick rather than write my own trie implementation.

Includes #86.

…e sensitivity handling

By using a trie, we can match suffixes in linear time in the length of the *suffix itself*, rather than the length of the *list of suffixes*.

matrixhead · 2024-07-05T12:06:14Z

woah 😃, this is much better!

tavianator · 2024-07-11T15:30:20Z

So it looks like this is about a 4% single-threaded perf win for fd --color=always:

Command	Mean [s]	Min [s]	Max [s]	Relative
`./fd-master -j1 --color=always --search-path ~/code/bfs/bench/corpus/rust`	1.488 ± 0.009	1.471	1.500	1.04 ± 0.01
`./fd-pr87 -j1 --color=always --search-path ~/code/bfs/bench/corpus/rust`	1.424 ± 0.011	1.408	1.441	1.00

With multiple threads it's neutral, I guess suffix matching is off the critical path:

Command	Mean [ms]	Min [ms]	Max [ms]	Relative
`./fd-master --color=always --search-path ~/code/bfs/bench/corpus/rust`	126.3 ± 3.2	117.8	130.5	1.00
`./fd-pr87 --color=always --search-path ~/code/bfs/bench/corpus/rust`	127.9 ± 4.0	120.6	135.3	1.01 ± 0.04

tavianator · 2024-07-15T21:58:49Z

The win is more significant for larger LS_COLORS. E.g. with LS_COLORS=$(vivid generate molokai):

Command	Mean [s]	Min [s]	Max [s]	Relative
`./fd-master -j1 --color=always --search-path ~/code/bfs/bench/corpus/rust`	1.571 ± 0.016	1.551	1.594	1.14 ± 0.02
`./fd-pr87 -j1 --color=always --search-path ~/code/bfs/bench/corpus/rust`	1.382 ± 0.013	1.370	1.404	1.00

sharkdp · 2024-07-16T19:24:49Z

Thank you very much, this looks great.

Doing an "integration" benchmark on fd is a nice test & verification of your work, but I wonder if it would be more helpful to define two separate micro-benchmarks inside this crate, e.g. using criterion. I did something similar for another project yesterday and it usually just takes a few minutes to set this up (see tutorial here: https://bheisler.github.io/criterion.rs/book/user_guide/migrating_from_libtest.html#the-benchmark).

I imagine one benchmark would measure how long it takes to initialize a LsColors instance. And the other would focus on how fast we can match and colorize a given path. We could parametrize this on multiple inputs, e.g. short/medium/long LS_COLORS, and a few different classes of paths (short/long, with-match/without-match).

I'm not saying that you should do this, I just think it would give us a clearer picture of the situation. Performance work on this crate is extremely valuable. Initialization time has a direct impact on the startup time of many CLI programs. And the time to match a given path is crucial in programs like fd that need to print thousands of paths. Sorry if I digress, but I find it entertaining to think about the collectively saved time when doing even tiny performance improvements on those functions. And this seems to be a lot more than a tiny improvement. But at the moment, I can't really tell how this divides into initialization time and matching time.

tavianator · 2024-07-16T20:51:55Z

Yeah some microbenchmarks would be nice. I'm a little busy right now though, but I can keep that on the back burner unless you get to it first!

matrixhead and others added 5 commits July 2, 2024 19:39

Refactor suffix mapping to use HashMap a*.jpg=01;32nd HashSet for cas…

861034d

…e sensitivity handling

suffix mapping: reduced complexity; fix regression of suffix overriding

6c458e7

use max priority suffix when case doesn't matter

6f8a7a3

suffix mapping reduced complexity

73fb17b

Optimize suffix matching

21fdd06

By using a trie, we can match suffixes in linear time in the length of the *suffix itself*, rather than the length of the *list of suffixes*.

matrixhead mentioned this pull request Jul 5, 2024

ls: gnu test case color-ext fix uutils/coreutils#6537

Merged

Avoid dynamic allocations when matching suffixes

d1ea403

sharkdp mentioned this pull request Jul 16, 2024

gnu compatiblity fix for case sensitivity handling of suffixes #86

Closed

sharkdp merged commit 430c591 into sharkdp:master Aug 19, 2024
48 checks passed

tavianator deleted the fast-extensions branch August 19, 2024 15:46

tavianator mentioned this pull request Sep 10, 2024

Sensitive cases #69

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fast extension matching #87

Fast extension matching #87

tavianator commented Jul 5, 2024 •

edited

Loading

matrixhead commented Jul 5, 2024

tavianator commented Jul 11, 2024

tavianator commented Jul 15, 2024

sharkdp commented Jul 16, 2024

tavianator commented Jul 16, 2024

Fast extension matching #87

Fast extension matching #87

Conversation

tavianator commented Jul 5, 2024 • edited Loading

matrixhead commented Jul 5, 2024

tavianator commented Jul 11, 2024

tavianator commented Jul 15, 2024

sharkdp commented Jul 16, 2024

tavianator commented Jul 16, 2024

tavianator commented Jul 5, 2024 •

edited

Loading