Online Learning with Optimism and Delay

Genevieve Flaspohler, Francesco Orabona, Judah Cohen, Soukayna Mouatadid, Miruna Oprescu, Paulo Orenstein, Lester Mackey

Research output: Chapter in Book/Report/Conference proceedingConference contribution

13 Scopus citations

Abstract

Inspired by the demands of real-time climate and weather forecasting, we develop optimistic online learning algorithms that require no parameter tuning and have optimal regret guarantees under delayed feedback. Our algorithms-DORM, DORM+, and AdaHedgeD-arise from a novel reduction of delayed online learning to optimistic online learning that reveals how optimistic hints can mitigate the regret penalty caused by delay. We pair this delay-as-optimism perspective with a new analysis of optimistic learning that exposes its robustness to hinting errors and a new meta-algorithm for learning effective hinting strategies in the presence of delay. We conclude by benchmarking our algorithms on four subseasonal climate forecasting tasks, demonstrating low regret relative to state-of-the-art forecasting models.
Original languageEnglish (US)
Title of host publicationProceedings of Machine Learning Research
PublisherML Research Press
Pages3363-3373
Number of pages11
ISBN (Print)9781713845065
StatePublished - Jan 1 2021
Externally publishedYes

Fingerprint

Dive into the research topics of 'Online Learning with Optimism and Delay'. Together they form a unique fingerprint.

Cite this