kgo: reduce allocations when processing batches #3

ortuman · 2024-09-23T12:11:01Z

This PR introduces various changes in order to reduce GC overhead by decreasing the total number of allocations when the EnableRecordsPool option is set.

Among the changes introduced, the following are noteworthy:

A dedicated pool is used during message batch decompression, in order to reduce number of allocations and to avoid a potential pool poisoning scenario.
The possibility of reusing the final output buffers derived from the decompression of a batch has been introduced, after invoking *kgo.(*Record).Reuse on all the resulting records.
Similarly, once *kgo.(*Record).Reuse has been invoked on all the resulting records of a batch, the possibility of recycling the intermediate []kmsg.Record buffers generated during the batch processing has been introduced.

pkg/kgo/compression.go

pkg/kgo/source.go

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

dimitarvdimitrov

this is great! About 50% of allocations in Mimir with ingest storage come from decompressing the bytes. This should make a massive difference.

I left a couple of comments on the pool sizes and how to make them work with the changes in #4

pkg/kgo/compression.go

pkg/kgo/source.go

pkg/kgo/client.go

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

pkg/kgo/compression.go

pkg/kgo/source.go

pkg/kgo/client.go

pkg/kgo/internal/pool/bucketed_pool.go

pkg/kgo/compression.go

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

pkg/kgo/compression.go

dimitarvdimitrov

LGTM, only one comment about lz4 out size and a suggestion about copying the data; but otherwise nice work!

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

…tions Signed-off-by: Miguel Ángel Ortuño <[email protected]>

* fetching: export utilities for decompressing and parsing partition retch responses ### Background In grafana/mimir we are working towards making fetch requests ourselves. The primary reason behind that is that individual requests to the kafka backend are slow, so doing them sequentially per partition becomes the bottleneck in our application. So we want to fetch records in parallel to speed up the consumption. One difficulty I met when issuing `FetchRequest`s ourselves is that parsing the response is non-trivial. That's why I'm proposing to export these functions for downstream projects to use. Alternatively, I can also try contributing the concurrent fetching logic. But I believe that is much more nuanced and with more tradeoffs around fetched bytes and latency. So I wasn't sure whether it's a good fit for a general purpose library. I'm open to discuss this further. ### What this PR does Moves `(*kgo.cursorOffsetNext).processRespPartition` from being a method to being a standalone function - `kgo.processRespPartition`. There were also little changes necessary to make the interface suitable for public use (like removing the `*broker` parameter). ### Side effects To minimize the necessary changes and the API surface of the package I opted to use a single global decompressor for all messages. Previously, there would be one decompressor per client and that decompressor would be passed down to `(*cursorOffsetNext).processRespPartition`. My understanding is that using different pooled readers (lz4, zst, gzip) shouldn't have a negative impact on performance because usage patterns do not affect the behaviour of the reader (for example, a consistent size of decompressed data doesn't make the reader more or less efficient). I have not thoroughly verified or tested this - Let me know if you think that's important. An alternative to this is to also export the `decompressor` along with `newDecompressor()` and the auxiliary types for decompression. * Restore multiline processV0OuterMessage * `*kgo.Records` pooling support Signed-off-by: Miguel Ángel Ortuño <[email protected]> * Merge pull request #1 from grafana/ortuman/reduce-kgo-record-alloc `*kgo.Record` pooling support * fetching: export utilities for decompressing and parsing partition retch responses * Merge pull request #4 from dimitarvdimitrov/dimitar/grafana-master-with-export-partition-parsing-utils fetching: export utilities for decompressing and parsing partition fetch responses * Merge pull request #3 from ortuman/reduce-decompression-buffer-allocations Signed-off-by: Miguel Ángel Ortuño <[email protected]> --------- Signed-off-by: Miguel Ángel Ortuño <[email protected]> Co-authored-by: Dimitar Dimitrov <[email protected]>

ortuman force-pushed the ortuman/reduce-decompression-buffer-allocations branch 9 times, most recently from 94547af to 64d04d6 Compare September 24, 2024 10:53

ortuman marked this pull request as ready for review September 25, 2024 08:24

ortuman changed the title ~~kgo: allow reusing decompressor output buffers~~ kgo: reduce allocations when processing batches Sep 25, 2024

flxbk reviewed Sep 25, 2024

View reviewed changes

pkg/kgo/compression.go Outdated Show resolved Hide resolved

flxbk reviewed Sep 26, 2024

View reviewed changes

pkg/kgo/source.go Outdated Show resolved Hide resolved

ortuman added a commit that referenced this pull request Sep 30, 2024

addressed PR feedback

d507d35

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman added a commit that referenced this pull request Sep 30, 2024

addressed PR feedback

6d8cb09

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman added a commit that referenced this pull request Sep 30, 2024

reduce max pool buffer size from 32Mb to 8Mb

6c46643

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

dimitarvdimitrov reviewed Sep 30, 2024

View reviewed changes

pkg/kgo/compression.go Outdated Show resolved Hide resolved

pkg/kgo/source.go Outdated Show resolved Hide resolved

pkg/kgo/source.go Outdated Show resolved Hide resolved

pkg/kgo/client.go Outdated Show resolved Hide resolved

ortuman added 6 commits October 1, 2024 09:18

limit max pool buffer capacity for decompressor

565303c

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

kgo: allow reusing decompressor output buffers

ffd82dc

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

kgo: allow reusing kmsg.Record buffers when processing batch message.

f0f416f

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

addressed PR feedback

df5b140

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

addressed PR feedback

b87b500

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

reduce max pool buffer size from 32Mb to 8Mb

22dba64

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman force-pushed the ortuman/reduce-decompression-buffer-allocations branch 6 times, most recently from 9d508f2 to 5505346 Compare October 1, 2024 08:41

refactor after rebase

8d36414

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman force-pushed the ortuman/reduce-decompression-buffer-allocations branch from 5505346 to 8d36414 Compare October 1, 2024 08:43

addressed PR feedback

4eed844

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman requested a review from dimitarvdimitrov October 1, 2024 08:52

dimitarvdimitrov reviewed Oct 1, 2024

View reviewed changes

addressed PR feedback

47adbd1

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman force-pushed the ortuman/reduce-decompression-buffer-allocations branch from 13c9457 to 47adbd1 Compare October 1, 2024 12:31

ortuman added 2 commits October 1, 2024 14:43

addressed PR feedback

2df6656

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

addressed PR feedback

b92faef

ref: #3 (comment) Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman requested a review from dimitarvdimitrov October 1, 2024 12:50

cosmetic change

4131b9d

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

dimitarvdimitrov reviewed Oct 1, 2024

View reviewed changes

pkg/kgo/compression.go Outdated Show resolved Hide resolved

dimitarvdimitrov reviewed Oct 1, 2024

View reviewed changes

pkg/kgo/compression.go Outdated Show resolved Hide resolved

dimitarvdimitrov reviewed Oct 1, 2024

View reviewed changes

pkg/kgo/compression.go Outdated Show resolved Hide resolved

dimitarvdimitrov approved these changes Oct 1, 2024

View reviewed changes

ortuman added 6 commits October 1, 2024 15:56

addressed PR feedback

88058a0

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

renamed function

a394414

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

cosmetic change

3900eec

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

fix buffer reuse bug

28dd742

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

fixed unit test

13eaa20

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

rough estimate lz4 decompressed buffer

420f45b

Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman merged commit 675fbbb into master Oct 3, 2024
1 check passed

ortuman added a commit that referenced this pull request Oct 3, 2024

Merge pull request #3 from ortuman/reduce-decompression-buffer-alloca…

835b5cb

…tions Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman added a commit that referenced this pull request Oct 3, 2024

Merge pull request #3 from ortuman/reduce-decompression-buffer-alloca…

d5fa5fe

…tions Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman added a commit that referenced this pull request Oct 3, 2024

Merge pull request #3 from ortuman/reduce-decompression-buffer-alloca…

6010ccd

…tions Signed-off-by: Miguel Ángel Ortuño <[email protected]>

ortuman mentioned this pull request Oct 3, 2024

use grafana/franz-go fork grafana/mimir#9511

Merged

4 tasks

twmb mentioned this pull request Oct 8, 2024

high number of allocations in kgo.recordToRecord function twmb/franz-go#823

Open

ortuman deleted the ortuman/reduce-decompression-buffer-allocations branch October 17, 2024 08:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kgo: reduce allocations when processing batches #3

kgo: reduce allocations when processing batches #3

ortuman commented Sep 23, 2024 •

edited

Loading

dimitarvdimitrov left a comment

dimitarvdimitrov left a comment

kgo: reduce allocations when processing batches #3

kgo: reduce allocations when processing batches #3

Conversation

ortuman commented Sep 23, 2024 • edited Loading

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

dimitarvdimitrov left a comment

Choose a reason for hiding this comment

ortuman commented Sep 23, 2024 •

edited

Loading