Verification of ensemble relative frequencies without binning #1742
-
What is the best way to produce verification measures for probabilistic forecasts for an ensemble of size MET's definition of verification measures for probabilistic forecasts uses the midpoint of bins specified by user-defined thresholds to define the probabilities used in the formulae. I want to use the relative frequencies themselves, i.e. the values output by Gen-Ens_Prod without any binning. Although it seems possible to specify the bins in such a way as to make their midpoints the relative frequencies, it gets rather awkward: For example for an ensemble size of 6, the ensemble relative frequencies are [0, 0.16667, 0.33333, 0.5, 0.66667, 0.83333, 1]. If I set the thresholds as: This is tedious to set up by hand for large ensemble sizes though, and the two empty bins seem a bit clumsy. It would be easier if MET had an option to use full set of probabilities found in the forecast without binning, equivalent to Thanks for your time, Roger |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 3 replies
-
Hi Roger, And thank you for your question. As you've indicated well in your question, the quickest way to obtain an uncalibrated probability forecast is through the use of the gen-ens-prod tool by turning on the Unfortunately, from my experience with MET and METplus (especially with probability forecasts), there is no way that I'm aware of to dynamically set the categorical thresholds for any subsequent analytical tools (grid-stat, point-stat, series-analysis, etc.). As background, a score like Brier Score where the probability of the forecast (f) and observation (o) fields are traditionally directly reviewed for the calculation (e.g. Sum of f_t - o_t squared divided by total pairs), MET still relies on the use of nx2 contingency tables and thus requires thresholds. As you correctly linked and provided an example of, the best way to make these two approaches equivalent is to make the midpoint of each bin the value you desire for f_t. But this isn't without its drawbacks, as it's finicky (finding the exact half of a repeating decimal value) and becomes tedious with larger threshold quantities. I'm not sure how easy it would be for METplus to be coded to dynamically set the bin width based on the input/ a set value, or the (potentially) larger task of allowing direct sampling of probabilities for BSS calculation. @JohnHalleyGotway, do you have any insight into how this could be done in MET, to dynamically set the |
Beta Was this translation helpful? Give feedback.
Hi Roger,
And thank you for your question. As you've indicated well in your question, the quickest way to obtain an uncalibrated probability forecast is through the use of the gen-ens-prod tool by turning on the
frequency
ensemble_flag setting.Unfortunately, from my experience with MET and METplus (especially with probability forecasts), there is no way that I'm aware of to dynamically set the categorical thresholds for any subsequent analytical tools (grid-stat, point-stat, series-analysis, etc.). As background, a score like Brier Score where the probability of the forecast (f) and observation (o) fields are traditionally directly reviewed for the calculation (e.g. Sum of f_t - o_t squa…