Learning to Predict from Textual Data
Radinsky, K., Davidovich, S., Markovitch, S.
–Journal of Artificial Intelligence Research
Given a current news event, we tackle the problem of generating plausible predictions of future events it might cause. We present a new methodology for modeling and predicting such future news events using machine learning and data mining techniques. Our Pundit algorithm generalizes examples of causality pairs to infer a causality predictor. To obtain precisely labeled causality examples, we mine 150 years of news articles and apply semantic natural language modeling techniques to headlines containing certain predefined causality patterns. For generalization, the model uses a vast number of world knowledge ontologies. Empirical evaluation on real news articles shows that our Pundit algorithm performs as well as non-expert humans.
Journal of Artificial Intelligence Research
Dec-26-2012
- Country:
- Antarctica (0.04)
- Asia
- Afghanistan > Kabul Province
- Kabul (0.04)
- China (0.04)
- India (0.04)
- Japan > Honshū
- Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Middle East
- Iran > Tehran Province
- Tehran (0.04)
- Iraq > Baghdad Governorate
- Baghdad (0.04)
- Israel > Haifa District
- Haifa (0.04)
- Lebanon > Beirut Governorate
- Beirut (0.04)
- Republic of Türkiye > Ankara Province
- Ankara (0.04)
- Iran > Tehran Province
- North Korea (0.04)
- Pakistan (0.04)
- Afghanistan > Kabul Province
- Europe
- Germany (0.14)
- Iceland > Capital Region
- Reykjavik (0.04)
- Middle East > Cyprus (0.04)
- Netherlands > South Holland
- Delft (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Indian Ocean (0.04)
- North America
- Haiti > Ouest
- Port-au-Prince (0.04)
- Mexico (0.04)
- United States
- California (0.04)
- Nebraska (0.04)
- Louisiana (0.04)
- New Jersey (0.04)
- North Carolina (0.04)
- Florida > Brevard County (0.04)
- Arizona (0.04)
- New York (0.04)
- Texas (0.04)
- Massachusetts > Plymouth County
- Norwell (0.04)
- Haiti > Ouest
- Oceania
- Australia (0.04)
- Solomon Islands (0.04)
- Pacific Ocean (0.04)
- South America > Brazil (0.04)
- Genre:
- Research Report > New Finding (1.00)
- Industry:
- Technology:
- Information Technology
- Artificial Intelligence
- Machine Learning
- Learning Graphical Models
- Directed Networks > Bayesian Learning (1.00)
- Undirected Networks > Markov Models (1.00)
- Pattern Recognition (0.92)
- Statistical Learning (1.00)
- Learning Graphical Models
- Natural Language
- Grammars & Parsing (1.00)
- Information Extraction (1.00)
- Text Processing (1.00)
- Representation & Reasoning
- Ontologies (1.00)
- Rule-Based Reasoning (1.00)
- Uncertainty > Bayesian Inference (0.67)
- Machine Learning
- Data Science > Data Mining (1.00)
- Artificial Intelligence
- Information Technology