Remove usage of [AggressiveOptimization] #58209

steveharter · 2021-08-26T19:35:07Z

Revert the 4 usages of [AggressiveOptimization].

The 2 in ObjectDefaultConverter that were reverted have loops which cause the jitter to re-jit them anyways, so the use of [AggressiveOptimization] really shouldn't do anything.

The other 2 in JsonConverter<T> that remain do seem get up to a 3% benefit but only some of the time, and only measured against a local release build that is a non-R2R (non-crossgen2'd) STJ.dll image. In theory there shouldn't be a perf improvement since a R2R image automatically did the equivalent of [AggressiveOptimization] for all methods. When obtaining the fastest time from several runs, the benchmark (which measures serializer overhead) is faster with [AO], however the variability of these benchmarks overlap and are difficult to measure consistently. Perhaps with [AO] there is sometimes closer\faster memory locality with other methods.

Removing [AO] also has the benefit of supporting a "DynamicPGO" explicit mode that offers "best perf but slow start".

See #57327 (comment)

I don't believe the changes to remove [AO] meet the bar to port to V6 at this time, since R2R performance should be the same in the vast majority because a "DynamicPGO" mode would need to be turned on explicitly.

ghost · 2021-08-26T19:35:13Z

Tagging subscribers to this area: @eiriktsarpalis, @layomia
See info in area-owners.md if you want to be subscribed.

Issue Details

Revert the 4 usages of [AggressiveOptimization].

The 2 in ObjectDefaultConverter that were reverted have loops which cause the jitter to re-jit them anyways, so the use of [AggressiveOptimization] really shouldn't do anything.

The other 2 in JsonConverter<T> that remain do seem get up to a 3% benefit but only some of the time, and only measured against a local release build that is a non-R2R (non-crossgen2'd) STJ.dll image. In theory there shouldn't be a perf improvement since a R2R image automatically did the equivalent of [AggressiveOptimization] for all methods. When obtaining the fastest time from several runes, the benchmark (which measures serializer overhead) is faster with [AO], however the variability of these benchmarks overlap and are difficult to measure consistently. Perhaps with [AO] there is sometimes closer\faster memory locality with other methods.

Removing [AO] also has the benefit of supporting a "DynamicPGO" explicit mode that offers "best perf but slow start".

See #57327 (comment)

I don't believe the changes to remove [AO] meet the bar to port to V6 at this time, since R2R performance should be the same in the vast majority because a "DynamicPGO" mode would need to be turned on explicitly.

Author:	steveharter
Assignees:	steveharter
Labels:	`area-System.Text.Json`, `tenet-build-performance`
Milestone:	7.0.0

eiriktsarpalis

Do we need to backport to 6.0?

AndyAyersMS

Some notes on your comments above:

AO prevents methods from being compiled in R2R. It also causes methods to bypass tiering and always be jitted with optimization. We generally advocate using it only if running an unoptimized or R2R version of the method is unacceptable. Because AO methods bypass tiering, we cannot gather any PGO data for these methods (neither static or dynamic).
By default, methods with loops will also bypass tiering and are always jitted with optimization. But this default behavior is overridden when gathering static PGO data. So it's possible once you remove AO from these loop methods and we get static PGO data for them, the non-AO versions will be faster than the AO versions. You should also see slightly faster startup behavior because of less jitting.
Removing AO from the non-loop methods should likewise enable static PGO data collection and potentially better perf once the static PGO collection process catches up. And also will provide a small startup benefit.
Generally speaking it is rare to be benchmarking R2R code; typically this will only happen if you use some custom benchmark harness or disable tiered compilation. If you benchmark the R2R code, it is typically 20% or so slower than the jitted counterpart.

stephentoub · 2021-08-27T11:13:48Z

/backport to release/6.0

github-actions · 2021-08-27T11:14:02Z

Started backporting to release/6.0: https:/dotnet/runtime/actions/runs/1174017757

stephentoub · 2021-08-28T10:46:14Z

@AndyAyersMS, do you know why all of these are AggressiveOptimization?

Only one (the async one) has a comment indicating its purpose. Maybe they should be revisited (or at least commented to say why they're necessary)?

EgorBo · 2021-08-28T10:59:49Z

@AndyAyersMS, do you know why all of these are AggressiveOptimization?

Only one (the async one) has a comment indicating its purpose. Maybe they should be revisited (or at least commented to say why they're necessary)?

My 5 cents: From my understanding there two major excuses for AggressiveOptimization to exist in BCL:

Sometimes tier1 optimizations are able to eliminate allocations and we don't want some specific code to allocate even in tier0.
(some code might stuck in a hot path in tier0 forever with DOTNET_TC_QuickJitForLoops=1 - we promote this variable for faster startup and DynamicPGO cases)
The code is already heavily optimized by hands and we don't want JIT to re-order it with PGO (because PGO sometimes can optimize a method for some specific use-case during startup, but all other call-sites will regress - a good example is all functions in CastHelpers.

EgorBo · 2021-08-28T11:03:28Z

A good example for 1) is this code:

codegen for tier0 is terrible. Two allocations (we box both values) 🙂

AndyAyersMS · 2021-09-02T17:55:26Z

@AndyAyersMS, do you know why all of these are AggressiveOptimization?

I don't, no. Probably worth reviewing.

IIRC we mostly wanted to use AO to eliminate box allocation from the async machinery, since we otherwise could see 100MB's of boxing from Tier0 code before things got rejitted.

Remove usage of [AggressiveOptimization]

88d03e4

steveharter added area-System.Text.Json tenet-build-performance Impacts build time: official, developer or CI labels Aug 26, 2021

steveharter added this to the 7.0.0 milestone Aug 26, 2021

steveharter requested review from EgorBo, eiriktsarpalis and AndyAyersMS August 26, 2021 19:35

steveharter requested a review from layomia as a code owner August 26, 2021 19:35

steveharter self-assigned this Aug 26, 2021

steveharter mentioned this pull request Aug 26, 2021

Improve serializer performance #57327

Merged

eiriktsarpalis approved these changes Aug 26, 2021

View reviewed changes

AndyAyersMS approved these changes Aug 26, 2021

View reviewed changes

stephentoub approved these changes Aug 27, 2021

View reviewed changes

stephentoub merged commit ba5dedd into dotnet:main Aug 27, 2021

github-actions bot mentioned this pull request Aug 27, 2021

[release/6.0] Remove usage of [AggressiveOptimization] #58253

Merged

steveharter deleted the STJPerfcls branch August 27, 2021 15:41

ghost locked as resolved and limited conversation to collaborators Oct 2, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove usage of [AggressiveOptimization] #58209

Remove usage of [AggressiveOptimization] #58209

steveharter commented Aug 26, 2021 •

edited

Loading

ghost commented Aug 26, 2021

eiriktsarpalis left a comment

AndyAyersMS left a comment

stephentoub commented Aug 27, 2021

github-actions bot commented Aug 27, 2021

stephentoub commented Aug 28, 2021

EgorBo commented Aug 28, 2021 •

edited

Loading

EgorBo commented Aug 28, 2021 •

edited

Loading

AndyAyersMS commented Sep 2, 2021

Remove usage of [AggressiveOptimization] #58209

Remove usage of [AggressiveOptimization] #58209

Conversation

steveharter commented Aug 26, 2021 • edited Loading

ghost commented Aug 26, 2021

eiriktsarpalis left a comment

Choose a reason for hiding this comment

AndyAyersMS left a comment

Choose a reason for hiding this comment

stephentoub commented Aug 27, 2021

github-actions bot commented Aug 27, 2021

stephentoub commented Aug 28, 2021

EgorBo commented Aug 28, 2021 • edited Loading

EgorBo commented Aug 28, 2021 • edited Loading

AndyAyersMS commented Sep 2, 2021

steveharter commented Aug 26, 2021 •

edited

Loading

EgorBo commented Aug 28, 2021 •

edited

Loading

EgorBo commented Aug 28, 2021 •

edited

Loading