Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Migrate portable runners to second version of data channel code #25104

Merged
merged 1 commit into from
Jan 24, 2023

Conversation

lukecwik
Copy link
Member

@lukecwik lukecwik commented Jan 20, 2023

The deletion of Dataflow JRH allows us to follow-up with further clean-up as the majority of what was preventing this migration was an awkward re-use of the first version of the data channel code in the JRH.


Thank you for your contribution! Follow this checklist to help us incorporate your contribution quickly and easily:

  • Mention the appropriate issue in your description (for example: addresses #123), if applicable. This will automatically add a link to the pull request in the issue. If you would like the issue to automatically close on merging the pull request, comment fixes #<ISSUE NUMBER> instead.
  • Update CHANGES.md with noteworthy changes.
  • If this contribution is large, please file an Apache Individual Contributor License Agreement.

See the Contributor Guide for more tips on how to make review process smoother.

To check the build health, please visit https://github.com/apache/beam/blob/master/.test-infra/BUILD_STATUS.md

GitHub Actions Tests Status (on master branch)

Build python source distribution and wheels
Python tests
Java tests
Go tests

See CI.md for more information about GitHub Actions CI.

@codecov
Copy link

codecov bot commented Jan 20, 2023

Codecov Report

Merging #25104 (4dad3c6) into master (b3aa2e8) will increase coverage by 0.00%.
The diff coverage is n/a.

❗ Current head 4dad3c6 differs from pull request most recent head fe7f9c9. Consider uploading reports for the commit fe7f9c9 to get more accurate results

@@           Coverage Diff           @@
##           master   #25104   +/-   ##
=======================================
  Coverage   73.12%   73.12%           
=======================================
  Files         735      735           
  Lines       98161    98161           
=======================================
+ Hits        71782    71783    +1     
+ Misses      25016    25015    -1     
  Partials     1363     1363           
Flag Coverage Δ
python 82.67% <0.00%> (+<0.01%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
...python/apache_beam/runners/worker/worker_status.py 74.66% <0.00%> (-0.67%) ⬇️
...hon/apache_beam/runners/worker/bundle_processor.py 93.30% <0.00%> (-0.25%) ⬇️
...on/apache_beam/runners/dataflow/dataflow_runner.py 81.74% <0.00%> (-0.15%) ⬇️
...ks/python/apache_beam/runners/worker/sdk_worker.py 89.24% <0.00%> (+0.16%) ⬆️
...che_beam/runners/interactive/interactive_runner.py 91.77% <0.00%> (+1.26%) ⬆️
.../python/apache_beam/testing/test_stream_service.py 92.85% <0.00%> (+4.76%) ⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

@lukecwik lukecwik force-pushed the beam13015.runnercleanup branch from 057e1ee to 007b6f9 Compare January 21, 2023 00:09
@lukecwik lukecwik force-pushed the beam13015.runnercleanup branch from 007b6f9 to fe7f9c9 Compare January 23, 2023 19:17
@lukecwik lukecwik changed the title [WIP] Migrate portable runners to second version of data channel code Migrate portable runners to second version of data channel code Jan 23, 2023
@github-actions
Copy link
Contributor

Assigning reviewers. If you would like to opt out of this review, comment assign to next reviewer:

R: @kennknowles for label java.

Available commands:

  • stop reviewer notifications - opt out of the automated review tooling
  • remind me after tests pass - tag the comment author after tests pass
  • waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

…e on the runner side

This allows us to cleanup the existing code base now that the Dataflow JRH is gone and should improve the stability and performance for Java based portable runners built on top of this code since we have had a lot more use of the SDK harness variants at scale.
@lukecwik lukecwik force-pushed the beam13015.runnercleanup branch from fe7f9c9 to 22ff99b Compare January 23, 2023 21:56
@lukecwik
Copy link
Member Author

Run XVR_Flink PostCommit

@lukecwik
Copy link
Member Author

Run Go Flink ValidatesRunner

@lukecwik
Copy link
Member Author

Run Python Flink ValidatesRunner

@lukecwik
Copy link
Member Author

Run Python Samza ValidatesRunner

@lukecwik
Copy link
Member Author

Run Python Spark ValidatesRunner

@lukecwik
Copy link
Member Author

Run Java Flink PortableValidatesRunner Streaming

@lukecwik
Copy link
Member Author

Run Java Samza PortableValidatesRunner

@lukecwik
Copy link
Member Author

Run Java Spark PortableValidatesRunner Batch

@lukecwik
Copy link
Member Author

Run Go Samza ValidatesRunner

@lukecwik
Copy link
Member Author

Run Go Spark ValidatesRunner

@apache apache deleted a comment from github-actions bot Jan 23, 2023
@lukecwik
Copy link
Member Author

R: @y1chi @apilloud

@github-actions
Copy link
Contributor

Stopping reviewer notifications for this pull request: review requested by someone other than the bot, ceding control

Copy link
Contributor

@y1chi y1chi left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

}

@Override
public void onCompleted() {
LOG.warn(
"Hanged up for {}.",
apiServiceDescriptor == null ? "unknown endpoint" : apiServiceDescriptor);
outboundObserver.onCompleted();
}
}
}
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe we can rename this class to just BeamFnDataGrpcMultiplexer with a follow up?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, I wanted to do that in a separate CL since the commit history would be easier to understand.

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

#25144 is the follow-up

@lukecwik lukecwik merged commit 3d9318b into apache:master Jan 24, 2023
lukecwik added a commit to lukecwik/incubator-beam that referenced this pull request Jan 24, 2023
This is a follow-up for a comment in apache#25104

Also remove InboundDataClient which was missed in apache#25104
lukecwik added a commit that referenced this pull request Jan 24, 2023
…r2 (#25144)

This is a follow-up for a comment in #25104

Also remove InboundDataClient which was missed in #25104
@Abacn
Copy link
Contributor

Abacn commented Jan 25, 2023

python postcommit tests apache_beam.io.external.xlang_debeziumio_it_test.CrossLanguageDebeziumIOTest.test_xlang_debezium_read start failing likely related to this change:

https://ci-beam.apache.org/view/PostCommit/job/beam_PostCommit_Python310/393/
https://ci-beam.apache.org/view/PostCommit/job/beam_PostCommit_Python39/1381/
https://ci-beam.apache.org/view/PostCommit/job/beam_PostCommit_Python38/3658/
https://ci-beam.apache.org/view/PostCommit/job/beam_PostCommit_Python37/ (same test failed though not shown in jenkins, can be checked from log)

Error Message
RuntimeError: Service failed to start up with error 1
Stacktrace
self = <apache_beam.io.external.xlang_debeziumio_it_test.CrossLanguageDebeziumIOTest testMethod=test_xlang_debezium_read>

looks like it fails to start the expansion service.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants