Configuring histograms in the API (hints?) #2229

mateuszrzeszutek · 2021-12-17T15:27:44Z

What are you trying to achieve?

I'm working on the micrometer->OTel bridge instrumentation in the javaagent. Some of the micrometer instruments translate to OTel concepts in a rather straightforward way, but some of them are pretty different. I had the most problems with histograms (used in e.g. micrometer Timer).
Histograms in micrometer are fully configurable via the micrometer API. Micrometer supports calculating percentiles on the client side, setting histogram buckets, rotating/expiring histogram buckets, and some other options that I don't fully comprehend yet. Meanwhile, you can't configure histograms in OTel metrics API at all - you can use Views, but that's in the SDK; and pretty much the only thing you can configure in a view are explicit buckets.
For now, in my instrumentation I can just fall back to the micrometer way of collecting histogram data (which basically is two sets of gauges with different tags for percentiles/count buckets) - but perhaps we should introduce a way to configure these things in the metrics API (hints)?

Additional context.

Micrometer bridge PR: open-telemetry/opentelemetry-java-instrumentation#4919

CC @jsuereth

The text was updated successfully, but these errors were encountered:

bogdandrutu · 2022-01-03T23:14:17Z

See #2232 (comment)

reyang · 2022-01-11T00:50:30Z

Related to #1753 (Metrics Hint API).

weyert · 2022-06-21T09:20:18Z

The inability to define histogram bucket sizes in the API is quite a inconvenience. As the consumer of a library (e.g. express metrics, host metrics) need to be aware:

Library has histograms
Need to know the expected bucket sizes of the histograms
Need to define the buckets in the SDK for the histogram requires the name of the metric

Quite a few steps when you just want to 'throw' a Express middleware which creates Express metrics like http request/response size and http duration or the hos metrics.

I made this mistake and now I am having a mismatch of bucket sizes for the same metrics in Prometheus not sure how to solve that at the moment

jsuereth · 2022-11-29T16:38:21Z

Another user request for this: https://cloud-native.slack.com/archives/C014L2KCTE3/p1669739859064699?thread_ts=1669652480.479969&cid=C014L2KCTE3

jmacd · 2022-11-29T19:35:38Z

To get Lightstep onto using the OTel APIs I had to add several off-spec metrics features features (mainly the synchronous gauge aggregation), which would ideally be expressed via hints. The powerful thing about an API-level hint is that it allows configuring things when the instrument is created as opposed to when the SDK is started. For the Lightstep metrics SDK we support configuring the exponential histogram size, e.g., https:/lightstep/otel-launcher-go/tree/main/lightstep/sdk/metric#metric-instrument-hints-api

asafm · 2022-12-04T14:39:24Z

I left, and comment in Slack, and I'm also leaving it here.

When you create a bucket-based histogram, you use the API, in which you don’t specify it’s a bucket-based histogram.
You have to do that via a view, which in the Java SDK seems to be done in the initialization of your app.
So it seems that defining a bucket-based histogram, with the buckets you’d like, is split into two separate locations - the actual file where the logic using the histogram is and the app in a file in which you define the view and specify the name of the histogram.
I’m contrasting this with how I’m used to working with metrics in DropWizard / Prometheus Simple Client and other metrics libraries in which all is done in place.

I understand the power of overriding buckets/aggregation type for a given instrument if you do not own that instrument - i.e., in a library you use in your app. Yet, as a user - be it the library writer or just you, the app developer: you just create an instrument, like a histogram, and you would like to define its aggregation in the exact location when it's created - it's the natural place for this. My only guess is that 80% of the time, you'd be doing that, and only 20% of the time you'll override the aggregation type.
Moreover, as an application developer, you need to start creating constants to share the metric name, so if the metric name is changed, its aggregation type associated with it will also be matched (in the view declaration). Seem like there is incidental complexity here.

Therefore I think enabling the user to specify the aggregation type and configure it seems pretty important to exist in the API itself.

jack-berg · 2022-12-20T23:26:14Z

As I'm reading through the feedback I'm noticing that virtually all of it is related to the lack of ability to define histogram configuration options at the point of instrumentation.

What if we narrow the scope (as the PR title suggests) and instead of designing a hint API just fix the histogram instrument?

When you think about it, API users are actually already making "hints" to the SDK by their choice of instrument. By using a counter you're hinting that a sum is probably sufficient even though the SDK could keep a full histogram to track the distribution of measurements. By recording a set of attributes you're hinting at the set of dimensions that are likely to be useful, even though the SDK may drop some to reduce cardinality.

Today, histogram creation accepts name, description (optional), unit (optional), and value_type as arguments. We could add an additional optional argument called something like histogram_aggregation_options.

histogram_aggregation_options could consist of the type of histogram to use (exponential bucket or explicit bucket) as well as options for the selected type (max size for exponential, bucket boundaries for explicit).

The main question would be to determine how these options interact with the SDK's logic for determining aggregation, which involves MetricReader's default output aggregation and Views. I would suggest solving this by adjusting the definition of the Default Aggregation to incorporate the options specified by the API caller when creating the histogram instrument.

The result would be:

If the instrument matches one or more view's selection criteria, use the aggregation specified by the view.
Else if the metric reader (or associated metric exporter) specifies a default output aggregation for the instrument type, use it.
Else use the default aggregation, which incorporates the options specified by the API caller when creating the instrument.

What do folks think? To me this seems like a pretty tractable way to provide a lot of the value, without boiling the ocean.

tsloughter · 2022-12-21T20:29:01Z

When you think about it, API users are actually already making "hints" to the SDK by their choice of instrument.

I just wanted to second this important point. There is no guarantee your histogram will even be aggregated as a histogram, adding additional option to instrument creation that is passed to the default instrument type aggregation seems clean and clear, and what people are asking for.

asafm · 2022-12-22T10:21:20Z

The result would be:

If the instrument matches one or more view's selection criteria, use the aggregation specified by the view.
Else if the metric reader (or associated metric exporter) specifies a default output aggregation for the instrument type, use it.
Else use the default aggregation, which incorporates the options specified by the API caller when creating the instrument.

The way I see it:
Specifying it on the instrumentation level presents the default - meaning, if no other override mechanism was specified, this should be used.
I guess my problem is with the next in the chain, as you mentioned: the reader. When you say, "This is the default for a certain instrumentation type", for me, it means: if no aggregation were defined, we'd default to what I'm configuring here in the reader. So I wouldn't see it as an override to what the user says but as a default.

Next is the view mechanism, which, by definition, is an override (if you use the same name, that is).

tsloughter · 2022-12-22T11:27:22Z

The reader basically defines what an instrument's aggregation is, so the defaults are what the reader considers defaults. The view is the only thing that can define a different aggregation.

asafm · 2022-12-26T14:35:41Z

One question popped into my mind: Will Hints be typed? If so, will the aggregations allowed be constrained only to the ones specified here, i.e., Explicit Bucket and Exponential Bucket? I'm asking because I was thinking of creating an SDK for my purposes which will require using the Summary type of aggregation, and I wanted to know if I can rely on the API for creating instruments for my needs.

jack-berg · 2022-12-27T14:53:51Z

I'm asking because I was thinking of creating an SDK for my purposes which will require using the Summary type of aggregation, and I wanted to know if I can rely on the API for creating instruments for my needs.

If you're referring to an aggregation that produces summary type metrics, that wouldn't be possible because there currently is no summary aggregation. However, you can produce a histogram that looks very similar to a summary by configuring an empty list of bucket boundaries, putting all measurements in a single bucket and providing: min, max, sum, count.

asafm · 2022-12-27T15:40:03Z

If you're referring to an aggregation that produces summary type metrics, that wouldn't be possible because there currently is no summary aggregation.

My thinking was creating my own SDK (my flavor, so it doesn't match the SDK spec). The protocol supports a Summary, as can also be seen in the Java SDK io.opentelemetry.sdk.metrics.data.SummaryPointData.
What you suggest only support min/max and lack quantiles :)

asafm · 2023-01-04T11:06:47Z

@jack-berg How can we move your idea forward of amending the Histogram interface to have a way to configure aggregation and its options/configuration?

jack-berg · 2023-01-04T17:57:34Z

@jack-berg How can we move your idea forward of amending the Histogram interface to have a way to configure aggregation and its options/configuration?

Someone needs to open a PR to the speec with the proposed changes. Often helps to have a prototype implementation to drive discussion. There will likely be a debate on whether the histogram API can be changed in isolation or should be considered within a wider (and still ambiguous) "hint API". Assuming the PR to update the spec is merged, language SDKs can go and implement the changes. I'm interested in this topic but juggling quite a few things so can't commit on driving it on a particular timeline.

Fixes #2229. Related to #3061 (lays groundwork but does not resolve). Related to #2977, which may use this new API to have `http.server.duration` report in seconds instead of ms without changing / breaking default bucket boundaries. Summary of the change: - Proposes a new parameter to optionally include when creating instruments, called "advice". - For the moment, advice only has one parameter for specifying the bucket boundaries of explicit bucket histogram. - Advice can be expanded with additional parameters in the future (e.g. default attributes to retain). The parameters may be general (aka applicable to all instruments) or specific to a particular instrument kind, like bucket boundaries. - Advice parameters can influence the [default aggregation](https:/open-telemetry/opentelemetry-specification/blob/main/specification/metrics/sdk.md#default-aggregation), which is used if there is no matching view and if the reader does not specify a preferred aggregation. - Not clear that all advice will be oriented towards configuring aggregation, so I've intentionally left the scope of what they can influence open ended. I've prototyped this in java [here](open-telemetry/opentelemetry-java#5217). Example usage: ``` DoubleHistogram doubleHistogram = meterProvider .get("meter") .histogramBuilder("histogram") .setUnit("foo") .setDescription("bar") .setAdvice( advice -> advice.setBoundaries(Arrays.asList(10.0, 20.0, 30.0))) .build(); ``` Advice could easily be changed to "hint" with everything else being equal. I thought "advice" clearly described what we're trying to accomplish, which is advice / recommend the implementation in providing useful output with minimal configuration. --------- Co-authored-by: Reiley Yang <[email protected]>

asafm · 2023-04-16T13:28:36Z

🥳 Brilliant work @jack-berg

mateuszrzeszutek added the spec:metrics Related to the specification/metrics directory label Dec 17, 2021

github-actions bot assigned jmacd Dec 17, 2021

reyang added the area:api Cross language API specification issue label Jan 11, 2022

reyang added this to the Metrics Future Release milestone Jan 11, 2022

reyang added the enhancement New feature or request label Jan 11, 2022

lalitb mentioned this issue Jul 24, 2022

Add configuration options for Aggregation creation open-telemetry/opentelemetry-cpp#1513

Merged

3 tasks

This was referenced Aug 1, 2022

Add Summary to the Aggregation in the SDK specifications #2704

Closed

Improvements to the instrumentation/runtime package open-telemetry/opentelemetry-go-contrib#2624

Closed

legendecas mentioned this issue Sep 21, 2022

Meter metrics get override in instrumentations after setMeterProvider has called open-telemetry/opentelemetry-js#3249

Closed

2 tasks

mateuszrzeszutek mentioned this issue Dec 28, 2022

Specify default metric view attributes in the API (hints?) #3061

Closed

asafm mentioned this issue Dec 29, 2022

Add the metrics API Hint #1753

Closed

jack-berg mentioned this issue Feb 16, 2023

Propose histogram bucket boundary metric advice (aka hint API) #3216

Merged

Aneurysm9 mentioned this issue Mar 2, 2023

Specifying Histogram Buckets - without views open-telemetry/opentelemetry-go#3826

Open

jack-berg closed this as completed in #3216 Apr 8, 2023

asafm mentioned this issue Apr 27, 2023

PIP-264: Enhanced OTel-based metric system apache/pulsar#20197

Closed

dufferzafar mentioned this issue Feb 25, 2024

[Metrics API] Add support for histogram advice API open-telemetry/opentelemetry-cpp#2132

Open

asafm mentioned this issue Apr 1, 2024

REQUEST: New membership for asafm open-telemetry/community#2029

Closed

6 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Configuring histograms in the API (hints?) #2229

Configuring histograms in the API (hints?) #2229

mateuszrzeszutek commented Dec 17, 2021

bogdandrutu commented Jan 3, 2022

reyang commented Jan 11, 2022

weyert commented Jun 21, 2022 •

edited

Loading

jsuereth commented Nov 29, 2022

jmacd commented Nov 29, 2022

asafm commented Dec 4, 2022

jack-berg commented Dec 20, 2022

tsloughter commented Dec 21, 2022

asafm commented Dec 22, 2022

tsloughter commented Dec 22, 2022

asafm commented Dec 26, 2022

jack-berg commented Dec 27, 2022

asafm commented Dec 27, 2022

asafm commented Jan 4, 2023

jack-berg commented Jan 4, 2023

asafm commented Apr 16, 2023

Configuring histograms in the API (hints?) #2229

Configuring histograms in the API (hints?) #2229

Comments

mateuszrzeszutek commented Dec 17, 2021

bogdandrutu commented Jan 3, 2022

reyang commented Jan 11, 2022

weyert commented Jun 21, 2022 • edited Loading

jsuereth commented Nov 29, 2022

jmacd commented Nov 29, 2022

asafm commented Dec 4, 2022

jack-berg commented Dec 20, 2022

tsloughter commented Dec 21, 2022

asafm commented Dec 22, 2022

tsloughter commented Dec 22, 2022

asafm commented Dec 26, 2022

jack-berg commented Dec 27, 2022

asafm commented Dec 27, 2022

asafm commented Jan 4, 2023

jack-berg commented Jan 4, 2023

asafm commented Apr 16, 2023

weyert commented Jun 21, 2022 •

edited

Loading