perf: Performance improvements #4959

TimBeyer · 2023-08-18T09:55:00Z

What this PR does / why we need it:

This PR adds a bunch of performance improvements, and replaces Bluebird with native promises.

Bluebird

Let's start off with Bluebird: The bluebird repo itself recommends moving to native promises and bluebird is no longer seeing any changes. Meanwhile native promises have seen big performance improvements and also usually show up better in stack traces and in performance profiles.

Repo scan tree indexing

The main performance improvement comes for GARDEN_GIT_SCAN_MODE=repo.
It now indexes the list of files into a tree structure for faster access yielding large improvements in graph resolution speeds for large projects.

Here's a comparison between the current main version and the updated one, run on an M2 mac on a repo of 11032 files with 500 modules.

Using main:

time GARDEN_GIT_SCAN_MODE=repo gdev validate
Validate ✔️

Project is configured with `apiVersion: garden.io/v0`, running with backwards compatibility.
ℹ garden               → Running in Garden environment dev.default
Scanning repository at /Users/tim/Development/garden/garden-large-repo-generator
ℹ providers            → Getting status...
✔ providers            → Cached (took 2.1 sec)
ℹ providers            → Run with --force-refresh to force a refresh of provider statuses.
ℹ graph                → Resolving actions and modules...
ℹ graph                → Scanning repository at /Users/tim/Development/garden/garden-large-repo-generator
✔ graph                → Done (took 16.5 sec)

OK ✔️
GARDEN_GIT_SCAN_MODE=repo ~/Development/garden/garden/bin/garden validate  76.32s user 7.61s system 133% cpu 1:02.88 total

Using this PR:

time GARDEN_GIT_SCAN_MODE=repo gdev validate
Validate ✔️

Project is configured with `apiVersion: garden.io/v0`, running with backwards compatibility.
ℹ garden               → Running in Garden environment dev.default
Scanning repository at /Users/tim/Development/garden/garden-large-repo-generator
ℹ providers            → Getting status...
✔ providers            → Cached (took 2.1 sec)
ℹ providers            → Run with --force-refresh to force a refresh of provider statuses.
ℹ graph                → Resolving actions and modules...
ℹ graph                → Scanning repository at /Users/tim/Development/garden/garden-large-repo-generator
✔ graph                → Done (took 8 sec)

OK ✔️
GARDEN_GIT_SCAN_MODE=repo ~/Development/garden/garden/bin/garden validate  64.68s user 6.62s system 136% cpu 52.410 total

As you can see the overall performance improvements are still moderate, but the graph resolution is almost 2x faster.
This makes the repo scan mode the most performant option especially when running in our single binary build which seems to have a lot of overhead for the default scan mode.

For comparison here are both scan modes with our current 0.13.12 binary version on the same repo:

time GARDEN_GIT_SCAN_MODE=repo garden validate
Validate ✔️

Project is configured with `apiVersion: garden.io/v0`, running with backwards compatibility.
ℹ garden               → Running in Garden environment dev.default
Scanning repository at /Users/tim/Development/garden/garden-large-repo-generator
ℹ providers            → Getting status...
✔ providers            → Cached (took 2.2 sec)
ℹ providers            → Run with --force-refresh to force a refresh of provider statuses.
ℹ graph                → Resolving actions and modules...
ℹ graph                → Scanning repository at /Users/tim/Development/garden/garden-large-repo-generator
✔ graph                → Done (took 15.9 sec)

OK ✔️
GARDEN_GIT_SCAN_MODE=repo garden validate  78.09s user 8.10s system 138% cpu 1:02.22 total

time garden validate
Validate ✔️

Project is configured with `apiVersion: garden.io/v0`, running with backwards compatibility.
ℹ garden               → Running in Garden environment dev.default
ℹ providers            → Getting status...
✔ providers            → Cached (took 2.1 sec)
ℹ providers            → Run with --force-refresh to force a refresh of provider statuses.
ℹ graph                → Resolving actions and modules...
✔ graph                → Done (took 26.2 sec)

OK ✔️
garden validate  88.09s user 28.67s system 160% cpu 1:12.63 total

You can see that the repo scan mode shows identical performance to the version running just on node for development.
Meanwhile graph resolution on the default scan mode with the binary takes 26 seconds to resolve the graph while just on node it's around 15 seconds (not in the outputs above since I didn't want to add even more noisy text here).

So the new implementation with repo mode should outperform the default scan mode in the binary by a good 3x still when it comes to graph resolution.

GC Improvements

We increase max-semi-space to 64M which should lead to less GC pauses and better performance at the cost of slightly higher memory consumption.

See https:/nodejs/node/blob/main/doc/api/cli.md#useful-v8-options
Also see https://www.alibabacloud.com/blog/better-node-application-performance-through-gc-optimization_595119 and nodejs/node#42511 for some details on impact.

Which issue(s) this PR fixes:

Fixes #

Special notes for your reviewer:

TimBeyer · 2023-08-18T11:06:12Z

core/src/vcs/file-tree.ts

+
+ const relativePath = file.path.slice(this.ownPath.length)
+ // We use absolute paths so the first part of the split is always an empty string
+ const [_, subpathSegment, nextSegment] = relativePath.split(path.sep)


Here I wasn't sure if we normalize everything to POSIX paths also for glob compatibility, or if we do need to use path.sep so that windows paths get split correctly as well.

That probably would also impact the .startsWith part above.

vvagaytsev · 2023-08-18T13:01:18Z

Wow! This is super awesome! 🚀 Huge thanks for such a great job! ✨
I need more time to take a closer look in review it in detail :)

core/src/commands/custom.ts

core/src/commands/workflow.ts

core/src/garden.ts

core/src/commands/cloud/users/users-create.ts

vvagaytsev

Amazing work! Thank you! I've left a few non-blocking comments, LGTM! 💯

…ry more quickly

…ost of some more memory See https:/nodejs/node/blob/main/doc/api/cli.md#useful-v8-options Also see https://www.alibabacloud.com/blog/better-node-application-performance-through-gc-optimization_595119 and nodejs/node#42511 for some details on impact

vvagaytsev

💯 🚀

TimBeyer requested review from edvald and thsig August 18, 2023 09:56

TimBeyer marked this pull request as ready for review August 18, 2023 09:57

TimBeyer requested a review from a team August 18, 2023 10:47

TimBeyer commented Aug 18, 2023

View reviewed changes

vvagaytsev reviewed Aug 21, 2023

View reviewed changes

core/src/commands/custom.ts Show resolved Hide resolved

vvagaytsev reviewed Aug 21, 2023

View reviewed changes

core/src/commands/workflow.ts Show resolved Hide resolved

vvagaytsev reviewed Aug 21, 2023

View reviewed changes

core/src/garden.ts Show resolved Hide resolved

vvagaytsev reviewed Aug 21, 2023

View reviewed changes

core/src/commands/cloud/users/users-create.ts Outdated Show resolved Hide resolved

vvagaytsev previously approved these changes Aug 21, 2023

View reviewed changes

TimBeyer added 12 commits August 22, 2023 14:23

perf: speed up splitLast function

7da66c5

perf: skip function wrapping if profiler is disabled

ac96beb

perf: index files into a tree structure to get files for a subdirecto…

b5b9f51

…ry more quickly

refactor: replace Bluebird with native Promises (part 1)

210bd20

refactor: replace bluebird with native promises (part 2)

5a02b5b

refactor: replace bluebird with native promises (part 3)

4890036

chore: make the linter happy after all the bluebird replacements

201acbe

fix: splitLast behavior with no index found

7f71b1b

chore: linter again

f498c63

chore: cleanup file-tree and add header

4d55320

refactor: replace pLimit with pMap

3e643e7

TimBeyer dismissed vvagaytsev’s stale review via 3e643e7 August 22, 2023 12:30

TimBeyer force-pushed the perf/random-improvements branch from d256c3b to 3e643e7 Compare August 22, 2023 12:30

vvagaytsev approved these changes Aug 22, 2023

View reviewed changes

TimBeyer added 2 commits August 22, 2023 15:18

Merge branch 'main' into perf/random-improvements

97dbcca

Merge branch 'main' into perf/random-improvements

b99b50a

vvagaytsev merged commit a2c5f6e into main Aug 22, 2023
2 checks passed

vvagaytsev deleted the perf/random-improvements branch August 22, 2023 14:03

hnicke mentioned this pull request Sep 6, 2023

0.13: [Bug]: update-remote: --parallel flag broken #5035

Closed

Dunemask mentioned this pull request Oct 21, 2023

BUG: deploy & run with kubernetes-module type do not copy / prepare secrets #3894

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: Performance improvements #4959

perf: Performance improvements #4959

TimBeyer commented Aug 18, 2023 •

edited

Loading

TimBeyer Aug 18, 2023

vvagaytsev commented Aug 18, 2023

vvagaytsev left a comment

vvagaytsev left a comment

perf: Performance improvements #4959

perf: Performance improvements #4959

Conversation

TimBeyer commented Aug 18, 2023 • edited Loading

Bluebird

Repo scan tree indexing

GC Improvements

TimBeyer Aug 18, 2023

Choose a reason for hiding this comment

vvagaytsev commented Aug 18, 2023

vvagaytsev left a comment

Choose a reason for hiding this comment

vvagaytsev left a comment

Choose a reason for hiding this comment

TimBeyer commented Aug 18, 2023 •

edited

Loading