Matsui, Akira
Hybrid Forecasting of Geopolitical Events
Benjamin, Daniel M., Morstatter, Fred, Abbas, Ali E., Abeliuk, Andres, Atanasov, Pavel, Bennett, Stephen, Beger, Andreas, Birari, Saurabh, Budescu, David V., Catasta, Michele, Ferrara, Emilio, Haravitch, Lucas, Himmelstein, Mark, Hossain, KSM Tozammel, Huang, Yuzhong, Jin, Woojeong, Joseph, Regina, Leskovec, Jure, Matsui, Akira, Mirtaheri, Mehrnoosh, Ren, Xiang, Satyukov, Gleb, Sethi, Rajiv, Singh, Amandeep, Sosic, Rok, Steyvers, Mark, Szekely, Pedro A, Ward, Michael D., Galstyan, Aram
Sound decision-making relies on accurate prediction for tangible outcomes ranging from military conflict to disease outbreaks. To improve crowdsourced forecasting accuracy, we developed SAGE, a hybrid forecasting system that combines human and machine generated forecasts. The system provides a platform where users can interact with machine models and thus anchor their judgments on an objective benchmark. The system also aggregates human and machine forecasts weighting both for propinquity and based on assessed skill while adjusting for overconfidence. We present results from the Hybrid Forecasting Competition (HFC) - larger than comparable forecasting tournaments - including 1085 users forecasting 398 real-world forecasting problems over eight months. Our main result is that the hybrid system generated more accurate forecasts compared to a human-only baseline which had no machine generated predictions. We found that skilled forecasters who had access to machine-generated forecasts outperformed those who only viewed historical data. We also demonstrated the inclusion of machine-generated forecasts in our aggregation algorithms improved performance, both in terms of accuracy and scalability. This suggests that hybrid forecasting systems, which potentially require fewer human resources, can be a viable approach for maintaining a competitive level of accuracy over a larger number of forecasting questions.
Detecting multi-timescale consumption patterns from receipt data: A non-negative tensor factorization approach
Matsui, Akira, Kobayashi, Teruyoshi, Moriwaki, Daisuke, Ferrara, Emilio
Understanding consumer behavior is an important task, not only for developing marketing strategies but also for the management of economic policies. Detecting consumption patterns, however, is a high-dimensional problem in which various factors that would affect consumers' behavior need to be considered, such as consumers' demographics, circadian rhythm, seasonal cycles, etc. Here, we develop a method to extract multi-timescale expenditure patterns of consumers from a large dataset of scanned receipts. We use a non-negative tensor factorization (NTF) to detect intra- and inter-week consumption patterns at one time. The proposed method allows us to characterize consumers based on their consumption patterns that are correlated over different timescales.
Unbiased Lift-based Bidding System
Moriwaki, Daisuke, Hayakawa, Yuta, Munemasa, Isshu, Saito, Yuta, Matsui, Akira
Conventional bidding strategies for online display ad auction heavily relies on observed performance indicators such as clicks or conversions. A bidding strategy naively pursuing these easily observable metrics, however, fails to optimize the profitability of the advertisers. Rather, the bidding strategy that leads to the maximum revenue is a strategy pursuing the performance lift of showing ads to a specific user. Therefore, it is essential to predict the lift-effect of showing ads to each user on their target variables from observed log data. However, there is a difficulty in predicting the lift-effect, as the training data gathered by a past bidding strategy may have a strong bias towards the winning impressions. In this study, we develop Unbiased Lift-based Bidding System, which maximizes the advertisers' profit by accurately predicting the lift-effect from biased log data. Our system is the first to enable high-performing lift-based bidding strategy by theoretically alleviating the inherent bias in the log. Real-world, large-scale A/B testing successfully demonstrates the superiority and practicability of the proposed system.