A machine learning methodology for real-time forecasting of the 2019-2020 COVID-19 outbreak using Internet searches, news alerts, and estimates from mechanistic models
Liu, Dianbo, Clemente, Leonardo, Poirier, Canelle, Ding, Xiyu, Chinazzi, Matteo, Davis, Jessica T, Vespignani, Alessandro, Santillana, Mauricio
We present a timely and novel methodology that combines disease estimates from mechanistic models with digital traces, via interpretable machine-learning methodologies, to reliably forecast COVID-19 activity in Chinese provinces in real-time. Specifically, our method is able to produce stable and accurate forecasts 2 days ahead of current time, and uses as inputs (a) official health reports from Chinese Center Disease for Control and Prevention (China CDC), (b) COVID-19-related internet search activity from Baidu, (c) news media activity reported by Media Cloud, and (d) daily forecasts of COVID-19 activity from GLEAM, an agent-based mechanistic model. Our machine-learning methodology uses a clustering technique that enables the exploitation of geo-spatial synchronicities of COVID-19 activity across Chinese provinces, and a data augmentation technique to deal with the small number of historical disease activity observations, characteristic of emerging outbreaks. Our model's predictive power outperforms a collection of baseline models in 27 out of the 32 Chinese provinces, and could be easily extended to other geographies currently affected by the COVID-19 outbreak to help decision makers.
Apr-8-2020
- Country:
- South America (0.04)
- Africa (0.04)
- North America
- Central America (0.04)
- United States > Massachusetts
- Suffolk County > Boston (0.05)
- Mexico > Nuevo León
- Monterrey (0.04)
- Europe > Italy
- Piedmont > Turin Province > Turin (0.04)
- Asia
- Taiwan (0.06)
- Mongolia (0.05)
- Macao (0.04)
- Vietnam (0.04)
- China
- Hubei Province > Wuhan (0.06)
- Hong Kong (0.06)
- Shanghai > Shanghai (0.05)
- Beijing > Beijing (0.05)
- Inner Mongolia (0.05)
- Tianjin Province > Tianjin (0.05)
- Chongqing Province > Chongqing (0.05)
- Tibet Autonomous Region (0.04)
- Gansu Province (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology: