Use more platform-independent random test ordering #39441

jbytheway · 2020-04-10T14:26:03Z

Summary

SUMMARY: Infrastructure "Make test failures from other platforms more easily reproducible"

Purpose of change

Currently we have difficulty reproducing test failures across platforms. One reason is that we are running the tests in declaration ordr, which depends on the build system & linker. We could use lexicographic order, but even better would be random order.

To that end, it would be helpful if random order (for the same seed) was the same across platforms. This is an attempt to achieve that.

Furthermore, with this change, when a subset of the tests are run, they run in the same order as they would have when more (or all) tests are run. This makes inter-test dependency bugs easier to track down by finding the smallest set of tests which reproduces them.

Describe the solution

Rather than randomly shuffling the tests, we now sort them be an integer value associated with each test. That value is derived from the random seed and test name in a deterministic manner.

Describe alternatives you've considered

std::uniform_int_distribution might also introduce platform-dependence, so this might need further refinement. Considered fixing that now, but decided to wait and see if it's a real problem.

Testing

Ran various subsets of tests with different seeds and observed the above properties.

Additional context

I've opened a similar PR on Catch2 directly, but I wanted to backport the change here because @wapcaplet has been working on resolving failures related to randomly ordered tests, and this should help.

Serves two purposes: - Should be consistent across platforms. - When fewer tests are run, the remainder run in the same order. This makes inter-test dependency bugs easier to track down.

kevingranade · 2020-04-10T21:24:46Z

I see upstream has some concerns, but they don't apply to us, we can just merge the solution you agree on when it happens.

This includes the upstreamed version of CleverRaven#39441, with some improvements. And various other improvements too.

Use platform-independent random ordering

4130511

Serves two purposes: - Should be consistent across platforms. - When fewer tests are run, the remainder run in the same order. This makes inter-test dependency bugs easier to track down.

kevingranade merged commit 80e93bf into CleverRaven:master Apr 10, 2020

jbytheway deleted the catch_random_order branch April 11, 2020 01:14

jbytheway added a commit to jbytheway/Cataclysm-DDA that referenced this pull request Apr 21, 2020

Update Catch2 to v2.12.0 (from 2.9.1)

0135827

This includes the upstreamed version of CleverRaven#39441, with some improvements. And various other improvements too.

jbytheway mentioned this pull request Apr 21, 2020

Update Catch2 to v2.12.0 (from 2.9.1) #39797

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use more platform-independent random test ordering #39441

Use more platform-independent random test ordering #39441

jbytheway commented Apr 10, 2020

kevingranade commented Apr 10, 2020

Use more platform-independent random test ordering #39441

Use more platform-independent random test ordering #39441

Conversation

jbytheway commented Apr 10, 2020

Summary

Purpose of change

Describe the solution

Describe alternatives you've considered

Testing

Additional context

kevingranade commented Apr 10, 2020