-
Notifications
You must be signed in to change notification settings - Fork 4.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[KafkaIO] Fix per-split metric updates for KafkaUnboundedReader and ReadFromKafkaDoFn #32921
Conversation
Checks are failing. Will not request review until checks are succeeding. If you'd like to override that behavior, comment |
ebb1a96
to
49f48e8
Compare
Run Java PreCommit |
7d1a399
to
b14590a
Compare
This LGTM, can you fix the conflicts? |
Done, thanks for reviewing this! |
Can you check on the RAT test? |
ce8ac6e
to
180f70c
Compare
…pache#31281)" This reverts commit fd4368f.
32de1c2
to
a0951af
Compare
@johnjcasey rebased and updated where required. |
Run Java_Examples_Dataflow PreCommit |
Assigning reviewers. If you would like to opt out of this review, comment R: @kennknowles for label java. Available commands:
The PR bot will only process comments in the main thread (not review comments). |
Merge this as this was approved and merge conflict/test were cleared |
This reverts #31137 and #31281. The changes in the former PR overwrite the per-split metric
backlog_bytes.${SPLIT}
with the per-partition metric rather than the accumulated value for the split. The latter PR introduced a map to store metrics for all past and current splits (1 partition) of theReadFromKafkaDoFn
instance and may repeatedly overwrite non-current splits with stale values. The map used to store these values is not thread-safe and may trigger aConcurrentModificationException
sinceGetSize
and other SDF methods may concurrently attempt to read and write the map.Finally, the per-split caches kept by the instance are keyed on
TopicPartition
, which is not unique among all splits since the split may override the bootstrap server. This has been fixed for this cache, all other caches will be patched in a separate PR.Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:
addresses #123
), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, commentfixes #<ISSUE NUMBER>
instead.CHANGES.md
with noteworthy changes.See the Contributor Guide for more tips on how to make review process smoother.
To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md
GitHub Actions Tests Status (on master branch)
See CI.md for more information about GitHub Actions CI or the workflows README to see a list of phrases to trigger workflows.