Open iterators when cancelling queries #2881

Aklakan · 2024-12-06T21:24:30Z

Version

5.3.0-SNAPSHOT

What happened?

Sometimes when cancelling queries warnings are logged such as:

Open iterator: QueryIterPeek/123
Open iterator: QueryIterSingleton/456

So far I noticed this to happen when BGPs and UNIONs are involved, but this may not be exhaustive.

It turns out that not all the iterator construction in ARQ is performed truly lazy, which causes some parts of the code to leave iterators open if an exception happens during the iterator construction.

After investigation it seems that the warnings should (usually) be harmless, because the QueryIters are tracked in the ExecutionContext and thus eventually become closed without leaking resources.

However, seeing these warnings suggests that resources might be leaked - so this is an issue that IMO should be fixed.

See the PR for a test case and the proposed fixes.

Relevant output and stacktrace

No response

Are you interested in making a pull request?

Yes

The text was updated successfully, but these errors were encountered:

afs · 2024-12-07T08:45:38Z

I took the test case from the PR and have run it multiple times. I haven't got any test failures.

resources might be leaked

QueryIteratorCheck is 2015 (!!) and was then for development/internal consistency checking for non-cancelled queries.

One thing to consider is invert the responsibilities have specific management of external resources - for example, add to a specific manager, one per query execution, pass in the context. End of execution clears up. This would also be good for per-execution caches.

Let's enumerate the possibilities for such managed resources.

For in-memory all resources will eventually be garbage collected so there aren't any.

For TDB, the transaction mechanism clears up resources needing explicit work (ThreadLocal variables). Query execution resources are garbage collected.

Aklakan · 2024-12-07T09:12:25Z

The test case is not failing - but it produces warnings non-deterministically:

To make the test fail one could introduce a symbol ARQ.failOnOpenIterators which gets picked up by QueryIteratorCheck.

So its an issue that's quite hard to track down because (at least) OpUnion and StageGeneratorGeneric both start to construct QueryIters (which are tracked in the execution context), but the process may fail midway, leading to dangling QueryIters in the execution context which are disconnected from the 'input' iterator - and QueryIterCheck will warn in those cases - which looks as if something went wrong.

I think its good that QueryIterCheck warns in those cases - because dangling unclosed iterators shouldn't happen.

Aklakan · 2024-12-07T09:48:54Z

One thing to consider is invert the responsibilities have specific management of external resources - for example, add to a specific manager, one per query execution, pass in the context. End of execution clears up. This would also be good for per-execution caches.

Maybe I am misunderstanding, how would that differ from the current design?
Currently:

QueryIter registers itself at the execution context - the resource manager.
Something fails midway, resources are still tracked in the resource manager (although they could have been closed earlier).
End of execution closes all tracked but unclosed resources - but warns because this shouldn't normally happen.

afs · 2024-12-07T11:02:24Z

The current design does not have a way to track item that must have clear-up applied. The resource manager would a new thing - not based on the close iterator mechanism. It is only things that need clear-up that need to be handled carefully. The GC will deal with the rest.

If there are declared resources, we don't have to put in code for less-common situations on the performance path and we don't have to code in one place (general execution) so connected to code in another place (spacial resources).

afs · 2024-12-07T11:06:14Z

I don't get warnings. How many times do they need to be run?

Tests that pass-don't-pass are difficult to handle in the overall build. It really is error prone to rely on spotting warnings in a maven build. I've been trying to fix the current ones which (I hope) are spurious cases where the test code is by design hitting a warning case.

Aklakan · 2024-12-07T12:03:03Z

I updated the code to your comments and added an OpenIteratorException to QueryIteratorCheck that should now cause the test TestQueryExecutionCancel.test_cancel_concurrent_2 to fail.

The screenshot is created by removing this PR's try-catch block from QueryIterPeek.peek().

How many times do they need to be run?

The system I am working on has 6+7 physical and 6+(7*2)=20 virtual cores. The longest streak without warnings was maybe 10 times. Usually I get it roughly every 3 to 5 runs.

Aklakan added the bug label Dec 6, 2024

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 6, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

54440a4

Aklakan linked a pull request Dec 6, 2024 that will close this issue

GH-2881: Mitigate cases of open iterators when cancelling queries. #2882

Open

3 tasks

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 6, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

cf30474

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 7, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

1e9878c

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 7, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

4d485a7

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 7, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

db77f93

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 7, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

9f1564a

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 11, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

d3af02f

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 11, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

3f73972

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 11, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

7221065

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 11, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

e9293d1

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 13, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

e153b53

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 13, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

d6e4a2e

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 13, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

bc317ce

Aklakan added a commit to Aklakan/jena that referenced this issue Dec 14, 2024

apacheGH-2881: Mitigate cases of open iterators when cancelling queries.

de0e047

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Open iterators when cancelling queries #2881

Open iterators when cancelling queries #2881

Aklakan commented Dec 6, 2024 •

edited

Loading

afs commented Dec 7, 2024 •

edited

Loading

Aklakan commented Dec 7, 2024 •

edited

Loading

Aklakan commented Dec 7, 2024 •

edited

Loading

afs commented Dec 7, 2024

afs commented Dec 7, 2024 •

edited

Loading

Aklakan commented Dec 7, 2024 •

edited

Loading

Open iterators when cancelling queries #2881

Open iterators when cancelling queries #2881

Comments

Aklakan commented Dec 6, 2024 • edited Loading

Version

What happened?

Relevant output and stacktrace

Are you interested in making a pull request?

afs commented Dec 7, 2024 • edited Loading

Aklakan commented Dec 7, 2024 • edited Loading

Aklakan commented Dec 7, 2024 • edited Loading

afs commented Dec 7, 2024

afs commented Dec 7, 2024 • edited Loading

Aklakan commented Dec 7, 2024 • edited Loading

Aklakan commented Dec 6, 2024 •

edited

Loading

afs commented Dec 7, 2024 •

edited

Loading

Aklakan commented Dec 7, 2024 •

edited

Loading

Aklakan commented Dec 7, 2024 •

edited

Loading

afs commented Dec 7, 2024 •

edited

Loading

Aklakan commented Dec 7, 2024 •

edited

Loading