Learning to Predict from Textual Data
Radinsky, K., Davidovich, S., Markovitch, S.
–Journal of Artificial Intelligence Research
Given a current news event, we tackle the problem of generating plausible predictions of future events it might cause. We present a new methodology for modeling and predicting such future news events using machine learning and data mining techniques. Our Pundit algorithm generalizes examples of causality pairs to infer a causality predictor. To obtain precisely labeled causality examples, we mine 150 years of news articles and apply semantic natural language modeling techniques to headlines containing certain predefined causality patterns. For generalization, the model uses a vast number of world knowledge ontologies. Empirical evaluation on real news articles shows that our Pundit algorithm performs as well as non-expert humans.
Journal of Artificial Intelligence Research
Dec-26-2012
- Country:
- South America > Brazil (0.04)
- Antarctica (0.04)
- Pacific Ocean (0.04)
- Indian Ocean (0.04)
- Oceania
- Australia (0.04)
- Solomon Islands (0.04)
- North America
- Mexico (0.04)
- United States
- Texas (0.04)
- North Carolina (0.04)
- Louisiana (0.04)
- New Jersey (0.04)
- California (0.04)
- New York (0.04)
- Arizona (0.04)
- Florida > Brevard County (0.04)
- Nebraska (0.04)
- Massachusetts > Plymouth County
- Norwell (0.04)
- Haiti > Ouest
- Port-au-Prince (0.04)
- Europe
- Germany (0.14)
- Middle East > Cyprus (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Netherlands > South Holland
- Delft (0.04)
- Iceland > Capital Region
- Reykjavik (0.04)
- Asia
- Pakistan (0.04)
- North Korea (0.04)
- India (0.04)
- China (0.04)
- Middle East
- Republic of Türkiye > Ankara Province
- Ankara (0.04)
- Lebanon > Beirut Governorate
- Beirut (0.04)
- Israel > Haifa District
- Haifa (0.04)
- Iraq > Baghdad Governorate
- Baghdad (0.04)
- Iran > Tehran Province
- Tehran (0.04)
- Republic of Türkiye > Ankara Province
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Afghanistan > Kabul Province
- Kabul (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology:
- Information Technology
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Representation & Reasoning
- Rule-Based Reasoning (1.00)
- Ontologies (1.00)
- Uncertainty > Bayesian Inference (0.67)
- Natural Language
- Text Processing (1.00)
- Information Extraction (1.00)
- Grammars & Parsing (1.00)
- Machine Learning
- Statistical Learning (1.00)
- Pattern Recognition (0.92)
- Learning Graphical Models
- Undirected Networks > Markov Models (1.00)
- Directed Networks > Bayesian Learning (1.00)
- Representation & Reasoning
- Information Technology