Evaluating Long-Form Forecasts by Their Effect on Downstream Predictions

Published in ICML 2026 AI Forecasting Workshop, 2026

We argue that the value of a long-form forecast lies in how it updates the world model of a downstream predictor. We measure how conditioning on a long-form forecast improves the prediction accuracy of a weaker model across a sample of real-world events, testing seven frontier models under this framework.

Recommended citation: Qin, J. et al. (2026). "Evaluating Long-Form Forecasts by Their Effect on Downstream Predictions." ICML 2026 AI Forecasting Workshop.
Download Paper

Share on

Bluesky Facebook LinkedIn X (formerly Twitter)

Jeremy Qin

Share on