Implement expected value via integration over probability thresholds #1734

tjtg · 2022-06-03T08:55:51Z

As a follow up to #1719, implement processing of threshold data by numerical integration over probability thresholds, removing the quick-to-implement but poor performance implementation using ConvertProbabilitiesToPercentiles.

The acceptance test data is unchanged for this PR.
The time to run the acceptance test on threshold data reduced from ~1.5 seconds down to ~0.2 seconds on my workstation. There are significant performance improvements on larger data sets, such as whole-country sized grids.

Testing:

Ran tests and they passed OK
Added new tests for the new feature(s)

codecov · 2022-06-03T09:00:30Z

Codecov Report

Merging #1734 (7762939) into master (730ed80) will increase coverage by 0.04%.
The diff coverage is 95.12%.

@@            Coverage Diff             @@
##           master    #1734      +/-   ##
==========================================
+ Coverage   98.17%   98.22%   +0.04%     
==========================================
  Files         113      114       +1     
  Lines       10336    10678     +342     
==========================================
+ Hits        10147    10488     +341     
- Misses        189      190       +1

Impacted Files	Coverage Δ
improver/expected_value.py	`96.00% <95.12%> (-4.00%)`	⬇️
improver/constants.py	`100.00% <0.00%> (ø)`
improver/regrid/landsea.py	`99.21% <0.00%> (ø)`
improver/utilities/solar.py	`100.00% <0.00%> (ø)`
improver/developer_tools/metadata_interpreter.py	`99.35% <0.00%> (ø)`
improver/wind_calculations/vertical_updraught.py	`100.00% <0.00%> (ø)`
improver/calibration/reliability_calibration.py	`98.84% <0.00%> (+<0.01%)`	⬆️
...ometric_calculations/psychrometric_calculations.py	`99.22% <0.00%> (+0.02%)`	⬆️
improver/utilities/spatial.py	`98.91% <0.00%> (+0.04%)`	⬆️
improver/utilities/cube_manipulation.py	`99.14% <0.00%> (+0.07%)`	⬆️
... and 2 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 730ed80...7762939. Read the comment docs.

btrotta-bom

The integration logic looks ok to me. I made a suggestion about the choice of endpoints; this won't affect results much provided the original thresholds are sensibly chosen, but I think it would be good to have consistency.

improver/expected_value.py

improver_tests/acceptance/test_expected_value.py

Also add parameterized tests to pick up this issue

btrotta-bom

Thanks, this looks good now.

benowen-bom

This PR provides a good implementation for evaluating expected value directly from the probability data. Overall I am happy with the method, but I have put forward a couple of questions; expecting these should be easy enough to address.

improver/expected_value.py

improver_tests/expected_value/test_expected_value.py

benowen-bom

Thanks for responding to my comments. As I expected, the resolution was likely to be leave as is but I figured that it was worthwhile raising the questions.

I've left one suggestion for adding a comment on the +/- np.inf bounds, but I don't think is necessary given the issue you mentioned in the comment so I would be happy to leave it out (I'll leave this to your digression).

Either way, I'm happy with what is here. Great work!

benowen-bom

Thanks for adding in the comment. I'm happy for this to be merged in now.

* master: Calc temperature after latent heat release (metoppv#1739) Fixed broken links (metoppv#1745) Vicinity processing CLI (metoppv#1749) Rainforest minor fixes (metoppv#1751) Implement expected value via integration over probability thresholds (metoppv#1734) Exclude hidden directories and their sub-directories from the init check test. This accommodates IDEs that store information in hidden directories, i.e. vscode. (metoppv#1748) # Conflicts: # improver/psychrometric_calculations/psychrometric_calculations.py # improver_tests/acceptance/test_vicinity.py

…etoppv#1734) * Implement expected value over thresholds * Add tests for non-monotonic data and thresholds lt/gt * Update acceptance test docstring * min/max of threshold spacing and ECC bounds * Fix interpolation mismatched with thresholds Also add parameterized tests to pick up this issue * Add unit test for unequally spaced thresholds * Fix black * Remove duplicated data equality check * Update comment explaining extra thresholds

Implement expected value over thresholds

d4edea0

tjtg marked this pull request as draft June 3, 2022 08:56

Add tests for non-monotonic data and thresholds lt/gt

1e398f0

tjtg marked this pull request as ready for review June 7, 2022 03:24

tjtg requested a review from btrotta-bom June 7, 2022 23:35

btrotta-bom reviewed Jun 8, 2022

View reviewed changes

improver/expected_value.py Outdated Show resolved Hide resolved

improver/expected_value.py Show resolved Hide resolved

improver_tests/acceptance/test_expected_value.py Show resolved Hide resolved

improver_tests/acceptance/test_expected_value.py Show resolved Hide resolved

tjtg mentioned this pull request Jun 8, 2022

regrid nearest with mask fails for input grid extent less than target grid extent #1718

Closed

tjtg added 5 commits June 9, 2022 16:07

Update acceptance test docstring

9132a0f

min/max of threshold spacing and ECC bounds

3ad326c

Fix interpolation mismatched with thresholds

a144447

Also add parameterized tests to pick up this issue

Add unit test for unequally spaced thresholds

7217d40

Fix black

cf7ecbc

btrotta-bom previously approved these changes Jun 23, 2022

View reviewed changes

tjtg dismissed btrotta-bom’s stale review via cf7ecbc June 23, 2022 02:07

tjtg requested a review from benowen-bom June 23, 2022 23:57

benowen-bom reviewed Jun 24, 2022

View reviewed changes

improver/expected_value.py Show resolved Hide resolved

improver/expected_value.py Show resolved Hide resolved

improver_tests/expected_value/test_expected_value.py Outdated Show resolved Hide resolved

Remove duplicated data equality check

220d3bc

benowen-bom previously approved these changes Jun 29, 2022

View reviewed changes

Update comment explaining extra thresholds

7762939

tjtg dismissed benowen-bom’s stale review via 7762939 June 29, 2022 02:28

benowen-bom approved these changes Jun 30, 2022

View reviewed changes

tjtg merged commit 98691c8 into metoppv:master Jun 30, 2022

tjtg mentioned this pull request Jul 22, 2022

Fix out of bounds expected values for bounded diagnostics #1767

Merged

2 tasks

tjtg deleted the expectedvalue3 branch July 25, 2022 02:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement expected value via integration over probability thresholds #1734

Implement expected value via integration over probability thresholds #1734

tjtg commented Jun 3, 2022 •

edited

Loading

codecov bot commented Jun 3, 2022 •

edited

Loading

btrotta-bom left a comment

btrotta-bom left a comment

benowen-bom left a comment

benowen-bom left a comment

benowen-bom left a comment

Implement expected value via integration over probability thresholds #1734

Implement expected value via integration over probability thresholds #1734

Conversation

tjtg commented Jun 3, 2022 • edited Loading

codecov bot commented Jun 3, 2022 • edited Loading

Codecov Report

btrotta-bom left a comment

Choose a reason for hiding this comment

btrotta-bom left a comment

Choose a reason for hiding this comment

benowen-bom left a comment

Choose a reason for hiding this comment

benowen-bom left a comment

Choose a reason for hiding this comment

benowen-bom left a comment

Choose a reason for hiding this comment

tjtg commented Jun 3, 2022 •

edited

Loading

codecov bot commented Jun 3, 2022 •

edited

Loading