Timezone: »
Multivariate probabilistic time series forecasts are commonly evaluated via proper scoring rules, i.e., functions that are minimal in expectation for the ground-truth distribution. However, this property is not sufficient to guarantee good discrimination in the non-asymptotic regime. In this paper, we provide the first systematic finite-sample study of proper scoring rules for time series forecasting evaluation. Through a power analysis, we identify the ``region of reliability'' of a scoring rule, i.e., the set of practical conditions where it can be relied on to identify forecasting errors. We carry out our analysis on a comprehensive synthetic benchmark, specifically designed to test several key discrepancies between ground-truth and forecast distributions, and we gauge the generalizability of our findings to real-world tasks with an application to an electricity production problem. Our results reveal critical shortcomings in the evaluation of multivariate probabilistic forecasts as commonly performed in the literature.
Author Information
Étienne Marcotte (ServiceNow Research)
Valentina Zantedeschi (INRIA, UCL)
Alexandre Drouin (ServiceNow Research)
Nicolas Chapados (ServiceNow Research)
More from the Same Authors
-
2021 : Typing assumptions improve identification in causal discovery »
Philippe Brouillard · Perouz Taslakian · Alexandre Lacoste · Sébastien Lachapelle · Alexandre Drouin -
2023 : Invariant Causal Set Covering Machines »
Thibaud Godon · Baptiste Bauvin · Pascal Germain · Jacques Corbeil · Alexandre Drouin -
2023 : Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation »
Chris Emezue · Alexandre Drouin · Tristan Deleu · Stefan Bauer · Yoshua Bengio -
2023 : Causal Discovery with Language Models as Imperfect Experts »
Stephanie Long · Alex Piche · Valentina Zantedeschi · Tibor Schuster · Alexandre Drouin -
2022 Poster: TACTiS: Transformer-Attentional Copulas for Time Series »
Alexandre Drouin · Étienne Marcotte · Nicolas Chapados -
2022 Spotlight: TACTiS: Transformer-Attentional Copulas for Time Series »
Alexandre Drouin · Étienne Marcotte · Nicolas Chapados -
2021 Poster: Learning Binary Decision Trees by Argmin Differentiation »
Valentina Zantedeschi · Matt J. Kusner · Vlad Niculae -
2021 Spotlight: Learning Binary Decision Trees by Argmin Differentiation »
Valentina Zantedeschi · Matt J. Kusner · Vlad Niculae