Kinesis: More robust default fetch settings. #13539

gianm · 2022-12-09T09:43:44Z

The current defaults can cause issues for people because they do not take into account the generally large difference in size between aggregated and nonaggregated records, and they do not take into account the amount of available memory. This patch improves the defaults such that they do take these things into account and would be appropriate in a wider variety of situations.

Default recordsPerFetch and recordBufferSize based on available memory
rather than using hardcoded numbers. For this, we need an estimate
of record size. Use 10 KB for regular records and 1 MB for aggregated
records. With 1 GB heaps, 2 processors per task, and nonaggregated
records, recordBufferSize comes out to the same as the old
default (10000), and recordsPerFetch comes out slightly lower (1250
instead of 4000).
Default maxRecordsPerPoll based on whether records are aggregated
or not (100 if not aggregated, 1 if aggregated). Prior default was 100.
Default fetchThreads based on processors divided by task count on
Indexers, rather than overall processor count.
Additionally clean up the serialized JSON a bit by adding various
JsonInclude annotations.

1) Default recordsPerFetch and recordBufferSize based on available memory rather than using hardcoded numbers. For this, we need an estimate of record size. Use 10 KB for regular records and 1 MB for aggregated records. With 1 GB heaps, 2 processors per task, and nonaggregated records, recordBufferSize comes out to the same as the old default (10000), and recordsPerFetch comes out slightly lower (1250 instead of 4000). 2) Default maxRecordsPerPoll based on whether records are aggregated or not (100 if not aggregated, 1 if aggregated). Prior default was 100. 3) Default fetchThreads based on processors divided by task count on Indexers, rather than overall processor count. 4) Additionally clean up the serialized JSON a bit by adding various JsonInclude annotations.

AmatyaAvadhanula · 2022-12-09T10:49:28Z

...dexing-service/src/main/java/org/apache/druid/indexing/kinesis/KinesisIndexTaskIOConfig.java

+   * Together with {@link KinesisIndexTaskTuningConfig#RECORD_BUFFER_MEMORY_MAX_HEAP_FRACTION}, don't take up more
+   * than 15% of the heap.
+   */
+  private static final double RECORD_FETCH_MEMORY_MAX_HEAP_FRACTION = 0.05;


Seems like a typo since 15% is mentioned above

I believe the intention here is that RECORD_BUFFER_MEMORY_MAX_HEAP_FRACTION + RECORD_FETCH_MEMORY_MAX_HEAP_FRACTION makes up the 15% threshold.

Sorry, I misread it.

digitalpoetry · 2022-12-09T12:24:42Z

...dexing-service/src/main/java/org/apache/druid/indexing/kinesis/KinesisIndexTaskIOConfig.java

+                                    ? KinesisIndexTaskTuningConfig.ASSUMED_RECORD_SIZE_AGGREGATE
+                                    : KinesisIndexTaskTuningConfig.ASSUMED_RECORD_SIZE;
+
+      return Ints.checkedCast(Math.max(1, memoryToUse / assumedRecordSize / fetchThreads));


Am I right to assume that for a 1GB-heap task with deaggregate=true and 1 available processor, the default recordsPerFetch from this calculation is 25?

memoryToUse = min(100_000_000, 1_000_000_000 * 0.05) = 50_000_000 assumedRecordSize = 1_000_000 recordsPerFetch = 50_000_000 / 1_000_000 / 2 = 25

If we carry the assumption that each record is 1MB, I think this result conflicts with the maximum size of data that GetRecords can return, which is 10 MB.

My understanding of the Kinesis API is that in that case, the GetRecords API will return fewer records, using our limit as a cap. That would be OK if so; nothing wrong with getting less than this from the API.

AmatyaAvadhanula · 2023-01-13T05:33:40Z

Merging since IT failure was unrelated

PR apache#13539 refactored record supplier creation and introduced a bug: this method would throw NPE when recordsPerFetch was not provided by the user. recordsPerFetch isn't needed in this context at all, since the supervisor-side supplier doesn't fetch records. So this patch sets it to zero.

* Fix NPE in KinesisSupervisor#setupRecordSupplier. PR #13539 refactored record supplier creation and introduced a bug: this method would throw NPE when recordsPerFetch was not provided by the user. recordsPerFetch isn't needed in this context at all, since the supervisor-side supplier doesn't fetch records. So this patch sets it to zero. * Remove unused imports.

gianm added the AWS Kinesis For changes in Kinesis ingestion label Dec 9, 2022

AmatyaAvadhanula reviewed Dec 9, 2022

View reviewed changes

digitalpoetry reviewed Dec 9, 2022

View reviewed changes

gianm added 2 commits December 30, 2022 10:56

Merge branch 'master' into kinesis-limits

9c41561

Updates for tests.

eb4d7a6

AmatyaAvadhanula approved these changes Dec 31, 2022

View reviewed changes

Additional important verify.

164dff8

AmatyaAvadhanula merged commit 182c4fa into apache:master Jan 13, 2023

gianm deleted the kinesis-limits branch February 13, 2023 19:38

gianm mentioned this pull request Feb 27, 2023

Fix NPE in KinesisSupervisor#setupRecordSupplier. #13859

Merged

clintropolis added this to the 26.0 milestone Apr 10, 2023

techdocsmith mentioned this pull request Apr 12, 2023

[DRAFT] 26.0.0 release notes #14064

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kinesis: More robust default fetch settings. #13539

Kinesis: More robust default fetch settings. #13539

gianm commented Dec 9, 2022 •

edited

Loading

AmatyaAvadhanula Dec 9, 2022

digitalpoetry Dec 9, 2022

AmatyaAvadhanula Dec 9, 2022

digitalpoetry Dec 9, 2022

gianm Dec 9, 2022

digitalpoetry Dec 10, 2022

AmatyaAvadhanula commented Jan 13, 2023

Kinesis: More robust default fetch settings. #13539

Kinesis: More robust default fetch settings. #13539

Conversation

gianm commented Dec 9, 2022 • edited Loading

AmatyaAvadhanula Dec 9, 2022

Choose a reason for hiding this comment

digitalpoetry Dec 9, 2022

Choose a reason for hiding this comment

AmatyaAvadhanula Dec 9, 2022

Choose a reason for hiding this comment

digitalpoetry Dec 9, 2022

Choose a reason for hiding this comment

gianm Dec 9, 2022

Choose a reason for hiding this comment

digitalpoetry Dec 10, 2022

Choose a reason for hiding this comment

AmatyaAvadhanula commented Jan 13, 2023

gianm commented Dec 9, 2022 •

edited

Loading