-
Notifications
You must be signed in to change notification settings - Fork 590
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
bug: hash agg state fetching panics at vnode should not be accessed
in longevity test
#6642
Comments
Is there concrete SQL here? |
Have we ever seen other occurrence after #6833 ? |
Seems not. But I guess they might be unrelated. 😰 |
Trying to run |
Some additional info, thanks to @sushantgupta for confirming:
Some ideas I will try for debugging:
|
So far q15,q16,q17 pass: https://buildkite.com/risingwave-test/longevity-test/builds/244#018552e7-ed47-46c1-bc84-dd14911e8866/578. Perhaps other queries can reproduce this? Try again when https://github.com/risingwavelabs/risingwave-test/issues/158 is resolved. Next steps:
|
Currently nexmark longevity fails pretty often: https://buildkite.com/risingwave-test/longevity-test. Continue investigating when it is resolved. See: https://app.slack.com/client/T030LTU38S2/C0423G2NUF8/thread/C0423G2NUF8-1672652465.607889 for context. |
Some further updates, longevity test is still blocked, progress will be tracked here: https://github.com/risingwavelabs/risingwave-test/issues/179. Still need it, want to use it to pinpoint which nexmark query/queries is causing this error. |
Cannot reproduce |
Full logs here: https://ap-southeast-1.console.aws.amazon.com/cloudwatch/home?region=ap-southeast-1#logsV2:log-groups/log-group/$252Frisingwave$252Frls-apse1-eks-a$252Frwc-3-longevity-20221128-044939/log-events/ip-10-0-6-144.ap-southeast-1.compute.internal-risingwave.var.log.containers.risingwave-compute-0_rwc-3-longevity-20221128-044939_compute-384e31312989248ef0f98db5e6e3e9e4178827011352cb7f3361e5b5d3092807.log
This is really strange, as the dispatcher should already ensure that the group key has a correct vnode.
The text was updated successfully, but these errors were encountered: