Store-gateway: introduce chunkRangeReader #4209

dimitarvdimitrov · 2023-02-09T10:04:13Z

Signed-off-by: Dimitar Dimitrov dimitar.dimitrov@grafana.com

What this PR does

This is a followup of PR #4174 and upstreams another part of #3968

This PR introduces means to parse the raw byte slices of chunk ranges into storepb.AggrChunk. Currently these are not used, but will be in a subsequent PR.

The load of ranges happens via an alternative implementation of the existing bucketChunkReader and bucketChunkReaders. The new implementation is called bucketChunkRangesReader and bucketChunkRangesReaders.

The overall flow of loading chunk ranges is the following:

get the next set of seriesChunkRefsSet same as now
construct an intermediate data structure of partialSeriesChunks for each series, which holds both the raw byte ranges for each series' groups and the parsed chunks
- partialSeriesChunks is introduced in this PR
fetch some ranges' bytes from the cache and put them in partialSeriesChunks.rawRanges
- in a future PR
fetch the cache misses from the bucket and put them in partialSeriesChunks.rawRanges
- in a future PR
call partialSeriesChunks.parse() which reads all collected raw byte slices and tries to parse all the chunks from them into partialSeriesChunks.parsedChunks. parse() may not be able to parse a whole range because it was underfetched. In that case parse() will return a underfetchedChunksRangeIdx for each underfetched range
- included in this PR
iterate over the underfetchedChunksRangeIdx for each series and refetch the ranges from the bucket using the bucketChunkRangesReaders
- in a future PR
pass each underfetchedChunksRangeIdx back to partialSeriesChunks.reparse() which attempts to parse only that individual range of chunks
- included in this PR

Which issue(s) this PR fixes or relates to

Related to #3939

Checklist

Tests updated
Documentation added
CHANGELOG.md updated - the order of entries should be [CHANGE], [FEATURE], [ENHANCEMENT], [BUGFIX]

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

partitioner Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

pracucci

Submitting a partial review. I still have to review pkg/storegateway/series_chunks.go.

pkg/storegateway/bucket.go

pkg/storegateway/bucket_chunk_reader.go

pkg/storegateway/series_chunks.go

pkg/storegateway/bucket_chunk_reader.go

pracucci · 2023-02-09T15:40:47Z

pkg/storegateway/bucket_chunk_reader.go

+				rLen = int(diff)
+			}
+		}
+		rangeBuf := make([]byte, rLen)


[for a follow up discussion] Have you experienced with a memory pool here?

i ran some tests with a SlabPool, but i don't think the results were very reliable. The overall theme was that running with 16K slab size was too small and there were about 1/4 of allocations happening because the slab was too small for the range bytes. I also tried with 1.6M and that was too large, i suspect because of too few selected series.

I haven't tried with any values between 1.6M and 16K, maybe the sweet spot is somewhere there.

We can reconsider a pool here later, if we see this popping up in profiles.

pkg/storegateway/bucket_chunk_reader.go

pracucci · 2023-02-09T15:47:20Z

pkg/storegateway/bucket_chunk_reader.go

+	rangeEntry  int
+}
+
+type bucketChunksRangesReader struct {


A suggestion to unit test this. We don't have to get mad, but a unit test for the basic case may be nice.

Instead of storing *bucketBlock, we keep (a) the partitioner (b) a reference to the function chunkRangeReader() (c) the logger (d) the block ID. In the test we override the partitioner and chunkRangeRange(). The chunkRangeRange() returns a simple reader that returns the sequence of bytes in the range 0-254 each time you read bytes from it. When asserting the actual chunks data read we assert that the bytes range fetched match the expected one (you can test it with very small chunk ranges, so that the total size of data read is <= 254 bytes).

pkg/storegateway/series_chunks.go

pracucci · 2023-02-09T16:05:13Z

pkg/storegateway/series_chunks.go

+// parse tries to parse the ranges in the partial set with the bytes it has in rawRanges.
+// If the bytes for any of the ranges aren't enough, then parse will return an underfetchedChunksRangeIdx
+// and will correct the length of all ranges which had a understimated length.
+// Currently, parse will only correct the length of the last chunk, since this is the only chunk


I don't think this is entirely true. We start from the assumption that chunks are sorted in the segment file, but in the logic filling the length we clamp to 16000 bytes if the length we compute is longer than that. In the extreme case the real chunk is even longer, we may have underfetched a chunk which is not necessarily the last one.

pkg/storegateway/series_chunks.go

pracucci · 2023-02-09T16:16:23Z

pkg/storegateway/series_chunks.go

+// An error is returned when gBytes are malformed or when more than the last chunk is incomplete.
+//
+//nolint:unused // dead code while we are working on PR 3968
+func parseRange(rBytes []byte, chunks []storepb.AggrChunk) (allChunksComplete bool, lastChunkLen uint32, totalRead int, _ error) {


This function is self contained and should be easy to unit test.

pracucci · 2023-02-09T16:29:14Z

pkg/storegateway/series_chunks.go

+// if the data in rawRanges is invalid.
+//
+//nolint:unused // dead code while we are working on PR 3968
+func (s partialSeriesChunksSet) parse() ([]underfetchedChunksRangeIdx, error) {


This looks testable, given partialSeriesChunksSet is self contained and doesn't rely on any external dependancy.

pkg/storegateway/series_chunks.go

Store-gateway: introduce chunkRangeReader

cd31fde

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

dimitarvdimitrov added the component/store-gateway label Feb 9, 2023

dimitarvdimitrov requested a review from a team as a code owner February 9, 2023 10:04

dimitarvdimitrov added 4 commits February 9, 2023 11:05

Update CHANGELOG.md

dc8d78b

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Add unused linter comments

2ac7b06

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Add unused linter comments

6e4e406

Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

Track only bytes that are part of a range, not overfetched by the

6aaedaf

partitioner Signed-off-by: Dimitar Dimitrov <dimitar.dimitrov@grafana.com>

pracucci reviewed Feb 9, 2023

View reviewed changes

dimitarvdimitrov mentioned this pull request Feb 11, 2023

Store-gateway: cache chunks ranges #4227

Merged

3 tasks

dimitarvdimitrov closed this in #4227 Feb 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Store-gateway: introduce chunkRangeReader #4209

Store-gateway: introduce chunkRangeReader #4209

dimitarvdimitrov commented Feb 9, 2023

pracucci left a comment

pracucci Feb 9, 2023

dimitarvdimitrov Feb 9, 2023

pracucci Feb 9, 2023

pracucci Feb 9, 2023

pracucci Feb 9, 2023

pracucci Feb 9, 2023

pracucci Feb 9, 2023

Store-gateway: introduce chunkRangeReader #4209

Store-gateway: introduce chunkRangeReader #4209

Conversation

dimitarvdimitrov commented Feb 9, 2023

What this PR does

Which issue(s) this PR fixes or relates to

Checklist

pracucci left a comment

Choose a reason for hiding this comment

pracucci Feb 9, 2023

Choose a reason for hiding this comment

dimitarvdimitrov Feb 9, 2023

Choose a reason for hiding this comment

pracucci Feb 9, 2023

Choose a reason for hiding this comment

pracucci Feb 9, 2023

Choose a reason for hiding this comment

pracucci Feb 9, 2023

Choose a reason for hiding this comment

pracucci Feb 9, 2023

Choose a reason for hiding this comment

pracucci Feb 9, 2023

Choose a reason for hiding this comment