Minor performance improvements #110

Colecf · 2024-01-22T05:33:06Z

Pass owned strings to id_from_canonical instead of references
Precalculate the length of evaluated strings, and reserve space in the result string
Only hash strings once in id_from_canonical
Switch some HashMaps to FxHashMaps

These changes give about a 20% performance improvement to bench_load_synthetic, and save a couple seconds off of loading the android ninja files. Most of this is from the switch to FxHashMap, the other changes are only around a 5% improvement.

If you have concerns about any of these commits let me know and I'll take those ones out of the PR.

id_from_canonical ideally takes owned strings instead of references to avoid a copy.

evmar · 2024-01-24T18:25:25Z

src/eval.rs

+                    let mut result = 0;
+                    for (i, env) in envs.iter().enumerate() {
+                        if let Some(v) = env.get_var(v.as_ref()) {
+                            result = v.calc_evaluated_length(&envs[i + 1..]);


This is better as return v.calc_eval... with a 0 return at the end of the loop

Good point, done.

evmar · 2024-01-24T18:31:13Z

src/graph.rs

+    /// need to create a new id, but would also be possible to create a version
+    /// of this function that accepts string references that is more optimized
+    /// for the case where the entry already exists. But so far, all of our
+    /// usages of this function have an owned string easily accessible anyways.


I think the case where this might come up is when loading .n2_db, where the string paths are stored canonicalized so we know they don't need any mutation as we parse them. But I think that's only worth really thinking about if/when it comes time to speed up that codepath. (This comment is just refreshing my memory on this.)

True, though currently read_str in db.rs reads owned strings. Using references would also be a tradeoff that requires keeping the db in memory, but we could do it.

The way it works is the db is read fully at startup and all the paths are mapped to ids as they're read, so it doesn't really need an owned string in there. But I think it's also probably not the slow part, yet...

evmar · 2024-01-24T18:33:14Z

src/load.rs

@@ -76,10 +65,19 @@ impl Loader {
        loader
    }

+    /// Convert a path string to a FileId.  For performance reasons
+    /// this requires an owned 'path' param.
+    fn path(&mut self, mut path: String) -> FileId {


It's too bad, I went to all the effort to make this area reuse a single String buffer, but I understand why it must be done. RIP

evmar · 2024-01-24T18:35:09Z

src/graph.rs

+    /// for the case where the entry already exists. But so far, all of our
+    /// usages of this function have an owned string easily accessible anyways.
+    pub fn id_from_canonical(&mut self, file: String) -> FileId {
+        // TODO: so many string copies :<


BTW the comment here was a reminder to myself around my struggle to not have so many string copies here. The hashmap is mapping string -> File, and file has a .name field which is a string, so in theory you don't need a second copy of that string as the hashmap key. But I couldn't get all the lifetimes to work when I tried that. Fixing that would mean we no longer need two copies of every path string and would also save a lot of allocations on the load path...

So that we can do one memory allocation for them.

Use HashMap.entry() instead of a lookup + insert.

FxHashMap has a faster hashing algorithm, at the expense of not being resistent to DOS attacks.

Pass owned paths to id_from_canonical

76eddcb

id_from_canonical ideally takes owned strings instead of references to avoid a copy.

Colecf force-pushed the minor_perf_improvements branch from 4dfa366 to a687379 Compare January 22, 2024 06:57

evmar reviewed Jan 24, 2024

View reviewed changes

Colecf added 3 commits January 24, 2024 13:51

Precalcuate length of evaluated strings

83a15dc

So that we can do one memory allocation for them.

Only hash strings once in id_from_canonical

9ea052e

Use HashMap.entry() instead of a lookup + insert.

Switch some HashMaps to FxHashMaps

a24eb9c

FxHashMap has a faster hashing algorithm, at the expense of not being resistent to DOS attacks.

Colecf force-pushed the minor_perf_improvements branch from a687379 to a24eb9c Compare January 24, 2024 21:56

evmar merged commit 668d9ab into evmar:main Jan 25, 2024
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minor performance improvements #110

Minor performance improvements #110

Colecf commented Jan 22, 2024 •

edited

Loading

evmar Jan 24, 2024 •

edited

Loading

Colecf Jan 24, 2024

evmar Jan 24, 2024

Colecf Jan 24, 2024

evmar Jan 25, 2024

evmar Jan 24, 2024

evmar Jan 24, 2024

Minor performance improvements #110

Minor performance improvements #110

Conversation

Colecf commented Jan 22, 2024 • edited Loading

evmar Jan 24, 2024 • edited Loading

Choose a reason for hiding this comment

Colecf Jan 24, 2024

Choose a reason for hiding this comment

evmar Jan 24, 2024

Choose a reason for hiding this comment

Colecf Jan 24, 2024

Choose a reason for hiding this comment

evmar Jan 25, 2024

Choose a reason for hiding this comment

evmar Jan 24, 2024

Choose a reason for hiding this comment

evmar Jan 24, 2024

Choose a reason for hiding this comment

Colecf commented Jan 22, 2024 •

edited

Loading

evmar Jan 24, 2024 •

edited

Loading