[test] failures #15631

danielxiangzl · 2024-12-18T19:32:11Z

Description

How Has This Been Tested?

Key Areas to Review

Type of Change

Which Components or Systems Does This Change Impact?

Checklist

I have read and followed the CONTRIBUTING doc
I have performed a self-review of my own code
I have commented my code, particularly in hard-to-understand areas
I identified and added all stakeholders and component owners affected by this change as reviewers
I tested both happy and unhappy path of the functionality
I have made corresponding changes to the documentation

trunk-io · 2024-12-18T19:32:14Z

⏱️ 3h 36m total CI duration on this PR

Slowest 15 Jobs	Cumulative Duration	Recent Runs
test-target-determinator	35m	🟩 🟩 🟩 🟩 🟩 (+5 more)
adhoc-forge-test / forge	29m	🟥
adhoc-forge-test / forge	24m	⬜
rust-cargo-deny	17m	🟩 🟩 🟩 🟩 🟩 (+5 more)
check-dynamic-deps	16m	🟩 🟩 🟩 🟩 🟩 (+6 more)
rust-move-tests	9m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥
rust-move-tests	7m	🟥

🚨 1 job on the last run was significantly faster/slower than expected

Job	Duration	vs 7d avg	Delta
check-dynamic-deps	3m	1m

_{settings ⋅ feedback ⋅ docs ⋅ learn more about trunk.io}

graphite-app · 2024-12-18T21:10:48Z

testsuite/smoke-test/src/consensus/consensus_fault_tolerance.rs

+    )
+    .await
+    .unwrap();
+    panic!("test_fault_tolerance_of_leader_equivocation");


This panic!() statement causes the test to fail before it can validate the leader equivocation behavior. Consider removing it to allow the test to complete and verify the fault tolerance mechanisms are working as expected.

Spotted by Graphite Reviewer

Is this helpful? React 👍 or 👎 to let us know.

graphite-app · 2024-12-18T21:58:33Z

testsuite/testcases/src/performance_test.rs

+    async fn test(
+        &self,
+        swarm: Arc<tokio::sync::RwLock<Box<dyn Swarm>>>,
+        _report: &mut TestReport,
+        duration: Duration,
+    ) -> Result<()> {
+        let validators = { swarm.read().await.get_validator_clients_with_names() };
+        // 10 vals, test 1,2,3 failures
+        let num_bad_leaders = 3;
+        for (name, validator)  in validators[..num_bad_leaders].iter() {
+            validator
+                    .set_failpoint(
+                        "consensus::leader_equivocation".to_string(),
+                        "return".to_string(),
+                    )
+                    .await
+                    .map_err(|e| {
+                        anyhow!(
+                            "set_failpoint to set consensus leader equivocation on {} failed, {:?}",
+                            name,
+                            e
+                        )
+                    })?;
+        };
+        Ok(())
+    }


The test currently sets up the failpoints but returns immediately without running for the specified test duration. Consider adding tokio::time::sleep(duration).await before returning to ensure the test runs for the full duration after the failpoints are configured. This will provide more realistic test coverage of how the system behaves under sustained equivocation conditions.

Spotted by Graphite Reviewer

Is this helpful? React 👍 or 👎 to let us know.

graphite-app · 2024-12-19T00:29:17Z

testsuite/testcases/src/performance_test.rs

+    async fn test(
+        &self,
+        swarm: Arc<tokio::sync::RwLock<Box<dyn Swarm>>>,
+        _report: &mut TestReport,
+        duration: Duration,
+    ) -> Result<()> {
+        let validators = { swarm.read().await.get_validator_clients_with_names() };
+        // 10 vals, test 1,2,3 failures
+        let num_bad_leaders = 1;
+        for (name, validator)  in validators[..num_bad_leaders].iter() {
+            validator
+                    .set_failpoint(
+                        "consensus::leader_equivocation".to_string(),
+                        "return".to_string(),
+                    )
+                    .await
+                    .map_err(|e| {
+                        anyhow!(
+                            "set_failpoint to set consensus leader equivocation on {} failed, {:?}",
+                            name,
+                            e
+                        )
+                    })?;
+        };
+        Ok(())
+    }


The test currently returns immediately after setting the failpoint, without waiting for the specified duration. This means the test may complete before the system has had time to exhibit the failure behavior being tested. Consider adding tokio::time::sleep(duration).await before returning to ensure the test runs for the full duration and properly exercises the equivocation scenario.

Spotted by Graphite Reviewer

Is this helpful? React 👍 or 👎 to let us know.

danielxiangzl added 2 commits December 18, 2024 11:28

test leader equivocation

740b95e

test

a1ba033

danielxiangzl added the CICD:build-failpoints-images Build failpoints docker image label Dec 18, 2024

graphite-app bot reviewed Dec 18, 2024

View reviewed changes

use rotating leader

2dd9b35

danielxiangzl force-pushed the daniel-paper-failures branch from f523873 to 2dd9b35 Compare December 18, 2024 21:57

graphite-app bot reviewed Dec 18, 2024

View reviewed changes

danielxiangzl added 4 commits December 18, 2024 14:34

1 failure

da3739d

tps

21240ee

tps

834bd77

tps

bc7a2da

graphite-app bot reviewed Dec 19, 2024

View reviewed changes

danielxiangzl added 2 commits December 18, 2024 17:06

3 faults

6ad6679

tps

4c6b8a5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[test] failures #15631

[test] failures #15631

danielxiangzl commented Dec 18, 2024

trunk-io bot commented Dec 18, 2024 •

edited

Loading

graphite-app bot Dec 18, 2024

graphite-app bot Dec 18, 2024

graphite-app bot Dec 19, 2024

[test] failures #15631

Are you sure you want to change the base?

[test] failures #15631

Conversation

danielxiangzl commented Dec 18, 2024

Description

How Has This Been Tested?

Key Areas to Review

Type of Change

Which Components or Systems Does This Change Impact?

Checklist

trunk-io bot commented Dec 18, 2024 • edited Loading

graphite-app bot Dec 18, 2024

Choose a reason for hiding this comment

graphite-app bot Dec 18, 2024

Choose a reason for hiding this comment

graphite-app bot Dec 19, 2024

Choose a reason for hiding this comment

trunk-io bot commented Dec 18, 2024 •

edited

Loading