Cause Identification from Aviation Safety Incident Reports via Weakly Supervised Semantic Lexicon Construction
Abedin, M. A., Ng, V., Khan, L.
–Journal of Artificial Intelligence Research
The Aviation Safety Reporting System collects voluntarily submitted reports on aviation safety incidents to facilitate research work aiming to reduce such incidents. To effectively reduce these incidents, it is vital to accurately identify why these incidents occurred. More precisely, given a set of possible causes, or shaping factors, this task of cause identification involves identifying all and only those shaping factors that are responsible for the incidents described in a report. We investigate two approaches to cause identification. Both approaches exploit information provided by a semantic lexicon, which is automatically constructed via Thelen and Riloff's Basilisk framework augmented with our linguistic and algorithmic modifications. The first approach labels a report using a simple heuristic, which looks for the words and phrases acquired during the semantic lexicon learning process in the report. The second approach recasts cause identification as a text classification problem, employing supervised and transductive text classification algorithms to learn models from incident reports labeled with shaping factors and using the models to label unseen reports. Our experiments show that both the heuristic-based approach and the learning-based approach (when given sufficient training data) outperform the baseline system significantly.
Journal of Artificial Intelligence Research
Aug-26-2010
- Country:
- Asia > Middle East
- Jordan (0.04)
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.04)
- North America > United States
- Illinois
- Cook County > Chicago (0.04)
- Lake County > Waukegan (0.04)
- Michigan (0.04)
- Texas > Dallas County
- Richardson (0.04)
- Illinois
- Asia > Middle East
- Genre:
- Research Report
- Experimental Study (0.67)
- New Finding (1.00)
- Research Report
- Industry:
- Technology:
- Information Technology > Artificial Intelligence
- Machine Learning
- Inductive Learning (0.93)
- Learning Graphical Models > Directed Networks
- Bayesian Learning (0.92)
- Performance Analysis > Accuracy (1.00)
- Statistical Learning > Support Vector Machines (0.68)
- Supervised Learning (0.93)
- Natural Language
- Grammars & Parsing (1.00)
- Text Classification (0.68)
- Text Processing (1.00)
- Machine Learning
- Information Technology > Artificial Intelligence