Add drop_histogram_buckets function #40281

iblancasa · 2025-05-26T06:55:21Z

Description

Add drop_histogram_buckets function.

Link to tracking issue

Fixes #40280

Testing

Added new tests.

Documentation

Added documentation to README.md

processor/transformprocessor/README.md

processor/transformprocessor/internal/metrics/func_drop_bucket.go

processor/transformprocessor/README.md

processor/transformprocessor/internal/metrics/func_drop_bucket.go

processor/transformprocessor/README.md

processor/transformprocessor/internal/metrics/func_drop_bucket.go

Signed-off-by: Israel Blancas <[email protected]>

iblancasa · 2025-06-13T06:26:58Z

Ping @evan-bradley @TylerHelmuth @bogdandrutu @edmocosta

jsuereth · 2025-06-16T16:31:13Z

This seems to break the Histogram data model, particularly removing sum.

Why not (when removing buckets) place the removed data into some single bucket or 'overflow' bucket?

I'm not really sure I understand the use case at all, but I'd be afraid this would break many metric backends and generally be a large footgun. Can you describe that more either in this PR or in the issue?

iblancasa · 2025-06-16T16:38:04Z

Why not (when removing buckets) place the removed data into some single bucket or 'overflow' bucket?

I'm not sure about that solution, to be honest. One of the benefits is to reduce unused data.

I'm not really sure I understand the use case at all, but I'd be afraid this would break many metric backends and generally be a large footgun.

Just if they use it and don't know what they are doing (as the rest of features we have in the OpenTelemetry Collector). Don't see the issue adding more flexibility.

Can you describe that more either in this PR or in the issue?

It is common for Prometheus users doing this:

metric_relabel_configs:
  - source_labels: [__name__, le]
    regex: 'example_latency_seconds_bucket;0\.0.*'
    action: drop

I want to replicate this feature in OpenTelemetry Collector.

One ref, for instance https://www.robustperception.io/why-are-prometheus-histograms-cumulative/

jsuereth · 2025-06-16T18:19:49Z

Ah, so if you're removing buckets prometheus style, you have the benefit that buckets are just less than a threshod, I.e. they include counts from previous buckets.

If you do the same in opentelemetry you NEED to add the counts from previous buckets into future buckets or you have literally lost data. I.e. if you use this function as it is defined, you just have a broken histogram, you wouldn't wind up with the same value as prometheus dropped buckets gives you.

At a minimum you should:

Shift bucket counts to nearest bucket (either up or down)
Expand bucket boundaries or nearest bucket to incorporate the lost bucket.

Then you'd get the prometheus behavior.

iblancasa · 2025-06-16T18:39:14Z

Shift bucket counts to nearest bucket (either up or down)

Any idea about how to do this? Because per my understanding
#40281 (comment)

Thanks for your comments @jsuereth

iblancasa requested review from TylerHelmuth, evan-bradley, edmocosta and a team as code owners May 26, 2025 06:55

github-actions bot assigned bogdandrutu May 26, 2025

github-actions bot added the processor/transform Transform processor label May 26, 2025

bogdandrutu reviewed May 26, 2025

View reviewed changes

processor/transformprocessor/README.md Outdated Show resolved Hide resolved

bogdandrutu reviewed May 26, 2025

View reviewed changes

iblancasa force-pushed the 40280 branch from e96cf76 to cb3accd Compare May 30, 2025 17:43

iblancasa requested a review from bogdandrutu May 30, 2025 17:44

edmocosta reviewed Jun 2, 2025

View reviewed changes

iblancasa force-pushed the 40280 branch from cb3accd to 4c22298 Compare June 2, 2025 11:55

iblancasa requested a review from edmocosta June 3, 2025 09:14

iblancasa changed the title ~~Add drop_bucket function~~ Add drop_histogram_buckets function Jun 3, 2025

Add drop_histogram_buckets function

58926d8

Signed-off-by: Israel Blancas <[email protected]>

iblancasa force-pushed the 40280 branch from 4c22298 to 58926d8 Compare June 4, 2025 05:09

iblancasa added the waiting-for-code-owners label Jun 6, 2025

github-actions bot mentioned this pull request Jun 10, 2025

Weekly Report: 2025-06-03 - 2025-06-10 #40577

Open

This was referenced Jun 17, 2025

Weekly Report: 2025-06-10 - 2025-06-17 #40753

Open

Weekly Report: 2025-06-17 - 2025-06-24 #40879

Open

iblancasa marked this pull request as draft June 27, 2025 13:30

github-actions bot mentioned this pull request Jul 1, 2025

Weekly Report: 2025-06-24 - 2025-07-01 #41008

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add drop_histogram_buckets function #40281

Add drop_histogram_buckets function #40281

Uh oh!

iblancasa commented May 26, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

iblancasa commented Jun 13, 2025

Uh oh!

jsuereth commented Jun 16, 2025

Uh oh!

iblancasa commented Jun 16, 2025

Uh oh!

jsuereth commented Jun 16, 2025

Uh oh!

iblancasa commented Jun 16, 2025

Uh oh!

Uh oh!

Add drop_histogram_buckets function #40281

Are you sure you want to change the base?

Add drop_histogram_buckets function #40281

Uh oh!

Conversation

iblancasa commented May 26, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Link to tracking issue

Testing

Documentation

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

iblancasa commented Jun 13, 2025

Uh oh!

jsuereth commented Jun 16, 2025

Uh oh!

iblancasa commented Jun 16, 2025

Uh oh!

jsuereth commented Jun 16, 2025

Uh oh!

iblancasa commented Jun 16, 2025

Uh oh!

Uh oh!

iblancasa commented May 26, 2025 •

edited

Loading