Replies: 4 comments 1 reply
-
You could simply run Can you elaborate on your use case a bit more, i.e., what do you want to achieve here? Because I'm wondering whether it is more of a mixed metric that you want, or more of a benchmarking tool. |
Beta Was this translation helpful? Give feedback.
-
would this PR by @hazrulakmal suit your needs? |
Beta Was this translation helpful? Give feedback.
-
Hi @fkiraly, first off, thank you for your quick replies and your interest. What I have in mind, which I don't believe the PR you referenced solves, is something like Figure 4, p. 16, of https://www.federalreserve.gov/econres/feds/files/2021014pap.pdf. I say something like, because here the statistics are all relative to a baseline model(here, the AR(1), Gap model, which is the first one on the x-axis), whereas in my original question I was after the raw RMSEs, although in practice RMSEs are reported relative to a baseline. In any case, in this figure they are reporting (relative) RMSEs for forecasts of a quarterly inflation variable, and the results are reported separately for horizons 1, 2, 4, and 8. So if we look at the Tealbook forecasts for example (third to last on the x-axis), we see that (relative to the baseline) it is very accurate for 1-quarter-ahead forecasts, however this performance deteriorates for 2-quarter-ahead forecasts, and deteriorates again for 4-quarter-ahead forecasts. Let me know if this still doesn't help explain. Also, you are correct in that one can just run |
Beta Was this translation helpful? Give feedback.
-
Interesting evaluation case. I hope I understood your message correctly. Based on your explanation and the code you provided, let's assume we have 2.5 years of data, with the last 6 months being the test set. CV will output 6 fold evaluation because the window moves every 30 days and you want to compute metrics, say RMSE, on those 6 points of each forecast horizon, fh |
Beta Was this translation helpful? Give feedback.
-
Hello, I was wondering if sktime supports computing a performance metric, e.g. RMSE, for one-step ahead forecasts, two-step ahead forecasts, etc? For example, the following code gives the MAPE across each collection of 30 forecasts (y is daily data here):
So it looks like at each step it generates 30 forecasts (since fh = [1,...,30]) and computes a single MAPE over those 30 forecasts. What I have in mind is getting all the fh=1 forecasts across all the steps and then computing say the RMSE over all of those, then getting all the fh=2 forecasts across all the steps and computing the RMSE over all of those, and so on.
Is this supported?
Beta Was this translation helpful? Give feedback.
All reactions