`getSnippetHash`: Use regexp instead of parsing whole AST #759

julienduchesne · 2022-09-16T19:58:41Z

When calculating the env hash to use as cache keys, we don't need an exact list of imports
Therefore, to improve performance, we can replace the current strategy of using the AST by a regexp strategy.
This new strategy can lead to false positives (e.g. a string containing import 'foo') but this is not an issue for the hashing function.
All that matters is that the hash is consistent and accounts for all files (no false negatives)

On actual huge real-life environments, I have seen improvements of up to 95% (from 2s to 100ms)

When calculating the env hash to use as cache keys, we don't need an exact list of imports Therefore, to improve performance, we can replace the current strategy of using the AST by a regexp strategy. This new strategy can lead to false positives (e.g. a string containing import 'foo') but this is not an issue for the hashing function. All that matters is that the hash is consistent and accounts for all files (no false negatives)

github-actions · 2022-09-16T20:00:17Z

Benchstat (compared to main):

name              old time/op    new time/op    delta
GetSnippetHash-2    40.7ms ± 4%    20.4ms ± 5%  -49.88%  (p=0.000 n=10+10)

name              old alloc/op   new alloc/op   delta
GetSnippetHash-2    14.1MB ± 2%     3.9MB ± 0%  -72.74%  (p=0.000 n=10+10)

name              old allocs/op  new allocs/op  delta
GetSnippetHash-2      153k ± 1%       32k ± 0%  -78.94%  (p=0.000 n=10+10)

Duologic

LGTM

inkel

Left a simple question. Overall LGTM. The performance improvements are amazing! I'm pretty sure the speed up is because of the reduction in the number of allocations, though I could be wrong.

pkg/jsonnet/imports.go

julienduchesne · 2022-09-16T20:19:21Z

Left a simple question. Overall LGTM. The performance improvements are amazing! I'm pretty sure the speed up is because of the reduction in the number of allocations, though I could be wrong.

Yeah, the issue with the AST is that you have to parse all of it and traverse all nodes

Made a stupid mistake in the previous PR: #759 This fixes it and adds another benchmark test to ensure it doesn't happen again. I also removed the Github Actions benchmark test, as it's not really useful, anytime we change the tests, we'll get erroneous results which will be annoying. Instead, I added the benchmark tests to the Drone run, we can compare whenever we want.

* Fix `getSnippetHash` not considering all files Made a stupid mistake in the previous PR: #759 This fixes it and adds another benchmark test to ensure it doesn't happen again. I also removed the Github Actions benchmark test, as it's not really useful, anytime we change the tests, we'll get erroneous results which will be annoying. Instead, I added the benchmark tests to the Drone run, we can compare whenever we want. * linting * Add changelog, will release straight away

julienduchesne marked this pull request as ready for review September 16, 2022 20:02

julienduchesne requested review from Duologic, sh0rez, a team and inkel September 16, 2022 20:02

Duologic approved these changes Sep 16, 2022

View reviewed changes

inkel reviewed Sep 16, 2022

View reviewed changes

pkg/jsonnet/imports.go Outdated Show resolved Hide resolved

Check for Abs errors

e999249

julienduchesne force-pushed the julienduchesne/get-snippet-hash branch from d64804b to e999249 Compare September 19, 2022 12:11

julienduchesne merged commit 0aba526 into main Sep 19, 2022

julienduchesne deleted the julienduchesne/get-snippet-hash branch September 19, 2022 12:14

julienduchesne mentioned this pull request Sep 19, 2022

Improve importRecursive performance with a regexp #755

Closed

julienduchesne mentioned this pull request Sep 27, 2022

Fix getSnippetHash not considering all files #765

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`getSnippetHash`: Use regexp instead of parsing whole AST #759

`getSnippetHash`: Use regexp instead of parsing whole AST #759

julienduchesne commented Sep 16, 2022 •

edited

Loading

github-actions bot commented Sep 16, 2022 •

edited

Loading

Duologic left a comment

inkel left a comment

julienduchesne commented Sep 16, 2022

getSnippetHash: Use regexp instead of parsing whole AST #759

getSnippetHash: Use regexp instead of parsing whole AST #759

Conversation

julienduchesne commented Sep 16, 2022 • edited Loading

github-actions bot commented Sep 16, 2022 • edited Loading

Duologic left a comment

Choose a reason for hiding this comment

inkel left a comment

Choose a reason for hiding this comment

julienduchesne commented Sep 16, 2022

`getSnippetHash`: Use regexp instead of parsing whole AST #759

`getSnippetHash`: Use regexp instead of parsing whole AST #759

julienduchesne commented Sep 16, 2022 •

edited

Loading

github-actions bot commented Sep 16, 2022 •

edited

Loading