store the first raw value of a chunk during downsampling #1709

alfred-landrum · 2019-11-03T07:38:31Z

As discussed in #1568, storing only the last raw value
of a chunk will lose a counter reset when:
a) the reset occurs at a chunk boundary, and
b) the last raw value of the earlier chunk is less than
the first aggregated value of the later chunk.

This commit stores the first raw value of a chunk during
the initial raw aggregation, and retains it during
subsequent aggregations. This is similar to the existing
handling for the last raw value of a chunk.

With this change, when counterSeriesIterator iterates over
a chunk boundary, it will see the last raw value of the
earlier chunk, then the first raw value of the later chunk,
and then the first aggregated value of the later chunk. The
first raw value will always be less than or equal to the
first aggregated value, so the only difference in
counterSeriesIterator's output will be the possible detection
of a reset and an extra sample after the chunk boundary.

Fixes: #1568

Signed-off-by: Alfred Landrum alfred@leakybucket.org

I added CHANGELOG entry for this change.
Change is not relevant to the end user.

Changes

Verification

As discussed in thanos-io#1568, storing only the last raw value of a chunk will lose a counter reset when: a) the reset occurs at a chunk boundary, and b) the last raw value of the earlier chunk is less than the first aggregated value of the later chunk. This commit stores the first raw value of a chunk during the initial raw aggregation, and retains it during subsequent aggregations. This is similar to the existing handling for the last raw value of a chunk. With this change, when counterSeriesIterator iterates over a chunk boundary, it will see the last raw value of the earlier chunk, then the first raw value of the later chunk, and then the first aggregated value of the later chunk. The first raw value will always be less than or equal to the first aggregated value, so the only difference in counterSeriesIterator's output will be the possible detection of a reset and an extra sample after the chunk boundary. Fixes: thanos-io#1568 Signed-off-by: Alfred Landrum <alfred@leakybucket.org>

Signed-off-by: Alfred Landrum <alfred@leakybucket.org>

bwplotka

Nice! I don't have a way to really e2e test this case, but I read through the algorithm, and it makes sense to me 👍 Thanks! Small nit only. And thanks for awesome explanations on both issue and PR!

Just curious, did you also observed that in your real system in the actual query? (: if yes, did Thanos with this PR returns expected result?

Small style nit only from my side.

@brian-brazil could you take a look as well? (:

pkg/compact/downsample/downsample.go

bwplotka · 2019-11-03T14:23:36Z

pkg/compact/downsample/downsample.go

@@ -289,7 +289,13 @@ func (b *aggrChunkBuilder) add(t int64, aggr *aggregator) {
 	b.added++
 }

-func (b *aggrChunkBuilder) finalizeChunk(lastT int64, trueSample float64) {
+func (b *aggrChunkBuilder) firstRawSample(firstT int64, trueSample float64) {


those functions are quite shallow, and really the same. Can we maybe just inline with the comment? We use it twice, sure, but if we would inline them it might be even clearer?

Maybe but IMHO this split up is clear too since literally the function's name tells you what's happening 😄 up to you.

I've handled this in the latest diff by inlining the actions of the functions, but pointing them to new explanatory comments at CounterSeriesIterator, please take a look.

brian-brazil

👍

pkg/compact/downsample/downsample.go

pkg/compact/downsample/downsample_test.go

alfred-landrum · 2019-11-04T17:21:01Z

@bwplotka : regarding your question: I didn't observe this directly: my colleague @aponjavic spotted the potential issue as we were studying how Thanos implements downsampling. So I don't have an easy setup that repros the issue & the fix.

GiedriusS

Maybe we should also update the comment around the type CounterSeriesIterator?

Signed-off-by: Alfred Landrum <alfred@leakybucket.org>

GiedriusS

👍

bwplotka

LGTM, Thanks!

BTW great talk on PromCon (: We might want to link slides in downsampling doc even.

) * store the first raw value of a chunk during downsampling As discussed in thanos-io#1568, storing only the last raw value of a chunk will lose a counter reset when: a) the reset occurs at a chunk boundary, and b) the last raw value of the earlier chunk is less than the first aggregated value of the later chunk. This commit stores the first raw value of a chunk during the initial raw aggregation, and retains it during subsequent aggregations. This is similar to the existing handling for the last raw value of a chunk. With this change, when counterSeriesIterator iterates over a chunk boundary, it will see the last raw value of the earlier chunk, then the first raw value of the later chunk, and then the first aggregated value of the later chunk. The first raw value will always be less than or equal to the first aggregated value, so the only difference in counterSeriesIterator's output will be the possible detection of a reset and an extra sample after the chunk boundary. Fixes: thanos-io#1568 Signed-off-by: Alfred Landrum <alfred@leakybucket.org> * changelog for thanos-io#1709 Signed-off-by: Alfred Landrum <alfred@leakybucket.org> * adjust existing downsampling tests Signed-off-by: Alfred Landrum <alfred@leakybucket.org> * add counter aggregation comments to CounterSeriesIterator Signed-off-by: Alfred Landrum <alfred@leakybucket.org> Signed-off-by: Aleksey Sin <asin@ozon.ru>

alfred-landrum added 4 commits November 3, 2019 00:33

Merge branch 'master' into downsample-counter-reset

f25a23a

changelog for thanos-io#1709

e67e0eb

Signed-off-by: Alfred Landrum <alfred@leakybucket.org>

adjust existing downsampling tests

b3bd5bd

Signed-off-by: Alfred Landrum <alfred@leakybucket.org>

bwplotka approved these changes Nov 3, 2019

View reviewed changes

brian-brazil reviewed Nov 4, 2019

View reviewed changes

pkg/compact/downsample/downsample.go Outdated Show resolved Hide resolved

pkg/compact/downsample/downsample_test.go Outdated Show resolved Hide resolved

GiedriusS reviewed Nov 4, 2019

View reviewed changes

add counter aggregation comments to CounterSeriesIterator

0322a52

Signed-off-by: Alfred Landrum <alfred@leakybucket.org>

GiedriusS approved these changes Nov 7, 2019

View reviewed changes

GiedriusS requested a review from bwplotka November 7, 2019 12:22

bwplotka approved these changes Nov 9, 2019

View reviewed changes

bwplotka merged commit 3debaeb into thanos-io:master Nov 9, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

store the first raw value of a chunk during downsampling #1709

store the first raw value of a chunk during downsampling #1709

alfred-landrum commented Nov 3, 2019 •

edited

Loading

bwplotka left a comment

bwplotka Nov 3, 2019

GiedriusS Nov 4, 2019

alfred-landrum Nov 5, 2019

brian-brazil left a comment

alfred-landrum commented Nov 4, 2019

GiedriusS left a comment

GiedriusS left a comment

bwplotka left a comment

store the first raw value of a chunk during downsampling #1709

store the first raw value of a chunk during downsampling #1709

Conversation

alfred-landrum commented Nov 3, 2019 • edited Loading

Changes

Verification

bwplotka left a comment

Choose a reason for hiding this comment

bwplotka Nov 3, 2019

Choose a reason for hiding this comment

GiedriusS Nov 4, 2019

Choose a reason for hiding this comment

alfred-landrum Nov 5, 2019

Choose a reason for hiding this comment

brian-brazil left a comment

Choose a reason for hiding this comment

alfred-landrum commented Nov 4, 2019

GiedriusS left a comment

Choose a reason for hiding this comment

GiedriusS left a comment

Choose a reason for hiding this comment

bwplotka left a comment

Choose a reason for hiding this comment

alfred-landrum commented Nov 3, 2019 •

edited

Loading