Fix: Recover boolean test flag after visiting subexpressions #13909

Lokejoke · 2024-10-24T12:40:35Z

Summary

This PR corrects the logic for handling the boolean test flag during expression analysis.

Since the flag can be modified during subexpression traversal, it must be restored before analyzing the full expression.

Additionally, this change allows the in_bool_test() method in crates/ruff_python_semantic/src/model.rs to be used outside of boolean operations, such as in conditional tests within Stmt::If, Stmt::While, Stmt::Assert, and others.

github-actions · 2024-10-24T12:54:38Z

`ruff-ecosystem` results

Linter (stable)

✅ ecosystem check detected no linter changes.

Linter (preview)

✅ ecosystem check detected no linter changes.

MichaReiser · 2024-10-24T12:57:38Z

Thanks. Would you mind adding a test demonstrating that restoring the flag is necessary?

Lokejoke · 2024-10-25T12:04:49Z

Thanks. Would you mind adding a test demonstrating that restoring the flag is necessary?

@MichaReiser Could you direct me to where this test would fit?

MichaReiser · 2024-10-25T12:10:27Z

We unfortunately don't have any semantic model tests. So the best you can do is to extend the tests of a rule that uses the flag. One such rule is RUF019 and the tests are defined here

ruff/crates/ruff_linter/src/rules/ruff/mod.rs

Line 46 in e37bde4

#[test_case(Rule::UnnecessaryKeyCheck, Path::new("RUF019.py"))]

(in RUF019.py)

MichaReiser

Could you explain me why restoring the flags here is necessary.

To me, it doesn't seem necessary because restoring the semantic.flags here means that the BOOLEAN_TEST flag won't be set for the analyze::expression phase.

We do reset the `semantic.flags on line 1480 so that it is correctly restored for its ancestor and sibling expressions

Lokejoke · 2024-11-01T18:21:21Z

Take for example this simple code:

if not f(a):
    pass

Currently, when visiting the expression f(a), it loses its BOOLEAN_TEST flag. This happens because it doesn’t match the conditions for BoolOp or Not, following the logic noted in the comment (lines 1032-1033):

// If we're in a boolean test (e.g., the test of a Stmt::If), but now within a
// subexpression (e.g., a in f(a)), then we're no longer in a boolean test.

But the f(a) it self should be in bool test so the flag should be recovered before analysis.

Example with Current vs. Proposed Behavior

To see the difference, here’s a comparison using various test cases:

if not a(b):
    pass

assert c(d)

x = e(f) or g(h)

while i == j:
    pass

Current BOOLEAN_TEST Flag Behavior (restoring the flag after analysis):
Expressions with BOOLEAN_TEST flag:

not a(b)

Proposed BOOLEAN_TEST Flag Behavior (restoring the flag before analysis):
Expressions with BOOLEAN_TEST flag:

not a(b)
a(b)
c(d)
i == j

Testing the Change

Testing this change with current rules is difficult, as they cover only Expr::BoolOp. However, implementing rule use-implicit-booleaness-not-len / C1802 would reveal the issue, as it checks for cases where BOOLEAN_TEST is essential to detect rule violations (in process).

dhruvmanila · 2024-11-04T03:45:37Z

However, implementing rule use-implicit-booleaness-not-len / C1802 would reveal the issue, as it checks for cases where BOOLEAN_TEST is essential to detect rule violations (in process).

Can you expand on why do you think it's an issue for this specific rule? I think that rule would need to be implemented at the ExprCall level for which the BOOLEAN_TEST flag should be test. It's only when the checker is visiting a sub-expression like an argument to the function call when the flag is being reset.

Lokejoke · 2024-11-04T11:11:34Z

I intended to illustrate with the rule example that the BOOLEAN_TEST flag is not limited to BoolOp and Not expressions, even though it isn’t used elsewhere in the project.

I agree with your interpretation of how the BOOLEAN_TEST flag should function, but the current implementation doesn’t reflect this expected behavior.

Store the initial flags.
Remove the BOOLEAN_TEST flag for any expressions that are not BoolOp or Not.
Visit subexpressions.
Analyze the expression.
Restore the original flags.

BOOLEAN_TEST flag should be restored between steps 3 and 4.

MichaReiser · 2024-11-04T11:21:07Z

crates/ruff_linter/src/checkers/ast/mod.rs


+        // Restore boolean test flag after examining subexpressions
+        if flags_snapshot.intersects(SemanticModelFlags::BOOLEAN_TEST) {
+            self.semantic.flags |= SemanticModelFlags::BOOLEAN_TEST;
+        }
+
        // Step 4: Analysis
        analyze::expression(expr, self);
        match expr {


Thanks for the explanation. The change now makes sense to me. However, I do think that this problem isn't specific to BOOLEAN_TEST but applies to all flags.
The flags should be restored when visiting the expression itself. They're only relevant when visiting the sub-expressions.

That's why I suggest that we change the code to:

// Step 4: Analysis match expr { Expr::StringLiteral(string_literal) => { analyze::string_like(string_literal.into(), self); } Expr::BytesLiteral(bytes_literal) => analyze::string_like(bytes_literal.into(), self), Expr::FString(f_string) => analyze::string_like(f_string.into(), self), _ => {} } self.semantic.flags = flags_snapshot; analyze::expression(expr, self);

I'm not sure about this. Could there be cases where a subexpression changes the semantics of its parent expression?
If so, this approach might not be correct.

I skimmed through the SemanticFlags and I didn't spot any expression level flag that would change the flags for the parent or a sibling.

MichaReiser · 2024-11-05T19:55:46Z

Thanks and nice find!

Fix: recover boolean test flag after visiting subexpressions

ef8e9db

MichaReiser requested changes Oct 29, 2024

View reviewed changes

MichaReiser reviewed Nov 4, 2024

View reviewed changes

MichaReiser added the internal An internal refactor or improvement label Nov 4, 2024

Fixed semantic flags logic

b4ea81e

MichaReiser merged commit abafeb4 into astral-sh:main Nov 5, 2024
20 checks passed

Lokejoke mentioned this pull request Dec 8, 2024

Aggregating All Rules Implemented for the Thesis (Do not merge!) Lokejoke/ruff#22

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix: Recover boolean test flag after visiting subexpressions #13909

Fix: Recover boolean test flag after visiting subexpressions #13909

Lokejoke commented Oct 24, 2024

github-actions bot commented Oct 24, 2024 •

edited

Loading

MichaReiser commented Oct 24, 2024

Lokejoke commented Oct 25, 2024

MichaReiser commented Oct 25, 2024

MichaReiser left a comment

Lokejoke commented Nov 1, 2024

dhruvmanila commented Nov 4, 2024

Lokejoke commented Nov 4, 2024

MichaReiser Nov 4, 2024 •

edited

Loading

Lokejoke Nov 4, 2024

MichaReiser Nov 4, 2024

MichaReiser commented Nov 5, 2024

Fix: Recover boolean test flag after visiting subexpressions #13909

Fix: Recover boolean test flag after visiting subexpressions #13909

Conversation

Lokejoke commented Oct 24, 2024

Summary

github-actions bot commented Oct 24, 2024 • edited Loading

ruff-ecosystem results

Linter (stable)

Linter (preview)

MichaReiser commented Oct 24, 2024

Lokejoke commented Oct 25, 2024

MichaReiser commented Oct 25, 2024

MichaReiser left a comment

Choose a reason for hiding this comment

Lokejoke commented Nov 1, 2024

Example with Current vs. Proposed Behavior

Testing the Change

dhruvmanila commented Nov 4, 2024

Lokejoke commented Nov 4, 2024

MichaReiser Nov 4, 2024 • edited Loading

Choose a reason for hiding this comment

Lokejoke Nov 4, 2024

Choose a reason for hiding this comment

MichaReiser Nov 4, 2024

Choose a reason for hiding this comment

MichaReiser commented Nov 5, 2024

github-actions bot commented Oct 24, 2024 •

edited

Loading

`ruff-ecosystem` results

MichaReiser Nov 4, 2024 •

edited

Loading