Metric benchmark investigation #5544

dashpole · 2024-06-25T17:01:24Z

Looking into #5542. Closing, as this is not meant to be merged.

My original local benchmark:

$ go test -benchmem -benchtime=2s -bench=Bench .
goos: linux
goarch: amd64
pkg: go.opentelemetry.io/opentelemetry-go/sdk/benchmark
cpu: AMD EPYC 7B12
BenchmarkPrometheusCounter-24       	762422826	         3.154 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounter-24             	21401206	       113.2 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounterWithLabel-24    	 8772984	       271.5 ns/op	      16 B/op	       1 allocs/op

Exemplar collection accounts for ~50ns of the overhead (a1bead9), even though I believe we shouldn't be collecting exemplars by default, and we aren't doing tracing. This is probably a good area to optimize.

BenchmarkOTELCounter-24             	42491396	        56.06 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounterWithLabel-24    	10961318	       220.1 ns/op	      16 B/op	       1 allocs/op

Attribute cardinality limiting seems to account for a very small (~2ns) portion of the overhead.

Lookup based on the attribute set accounts for ~40ns of the overhead for the no-attributes case, and the vast majority of the overhead for the with-attributes case. OTel would need to introduce bound instruments to remove this chunk of overhead.

BenchmarkOTELCounter-24             	148504042	        16.23 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounterWithLabel-24    	38502283	        61.73 ns/op	      16 B/op	       1 allocs/op

Our counter increment function (with locking) accounts for ~8ns of the overhead. We use a simple lock and increment a counter value. Prometheus appears to have implemented some optimizations for this. Benchmarks without any measurement whatsoever:

BenchmarkOTELCounter-24             	299598176	         7.974 ns/op	       0 B/op	       0 allocs/op
BenchmarkOTELCounterWithLabel-24    	43538810	        55.06 ns/op	      16 B/op	       1 allocs/op

The remaining overhead is from the API, and from the Options pattern which requires calling NewAddConfig. This would presumably be eliminated if instruments were already bound to attributes.

BenchmarkOTELCounter-24 42491396 56.06 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 10961318 220.1 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 42471098 55.09 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 11157889 214.9 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 148504042 16.23 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 38502283 61.73 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 299598176 7.974 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 43538810 55.06 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 377620942 6.333 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 47813799 52.70 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 469475085 5.119 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 46523139 51.32 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 464676590 5.082 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 48377798 51.73 ns/op 16 B/op 1 allocs/op BenchmarkNoOpOTELCounter-24 1000000000 1.901 ns/op 0 B/op 0 allocs/op BenchmarkNoOpOTELCounterWithLabel-24 64543243 34.60 ns/op 16 B/op 1 allocs/op

BenchmarkOTELCounter-24 685085875 3.493 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 66929808 36.21 ns/op 16 B/op 1 allocs/op BenchmarkNoOpOTELCounter-24 1000000000 1.901 ns/op 0 B/op 0 allocs/op BenchmarkNoOpOTELCounterWithLabel-24 66596236 35.09 ns/op 16 B/op 1 allocs/op

dashpole added 11 commits June 25, 2024 15:09

benchmarks from jaeger

b3fb091

remove unneccessary registry

c0816d5

remove unneccessary otel prometheus exporter

3c5c699

disable exemplar Offer for sums

a1bead9

BenchmarkOTELCounter-24 42491396 56.06 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 10961318 220.1 ns/op 16 B/op 1 allocs/op

remove attribute limiting

e0bca4a

BenchmarkOTELCounter-24 42471098 55.09 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 11157889 214.9 ns/op 16 B/op 1 allocs/op

estimate bound instrument implementation performance

b8e2b15

BenchmarkOTELCounter-24 148504042 16.23 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 38502283 61.73 ns/op 16 B/op 1 allocs/op

show all non-measure overhead

13c9972

BenchmarkOTELCounter-24 299598176 7.974 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 43538810 55.06 ns/op 16 B/op 1 allocs/op

remove measure function

04574fc

BenchmarkOTELCounter-24 377620942 6.333 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 47813799 52.70 ns/op 16 B/op 1 allocs/op

assume single measure function (no multiple views)

f50c287

BenchmarkOTELCounter-24 469475085 5.119 ns/op 0 B/op 0 allocs/op BenchmarkOTELCounterWithLabel-24 46523139 51.32 ns/op 16 B/op 1 allocs/op

dashpole closed this Jun 25, 2024

dashpole mentioned this pull request Jun 25, 2024

Performance vs. Prometheus SDK #5542

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Metric benchmark investigation #5544

Metric benchmark investigation #5544

dashpole commented Jun 25, 2024 •

edited

Loading

Metric benchmark investigation #5544

Metric benchmark investigation #5544

Conversation

dashpole commented Jun 25, 2024 • edited Loading

dashpole commented Jun 25, 2024 •

edited

Loading