Disambiguation and Filtering Methods in Using Web Knowledge for Coreference Resolution
Uryupina, Olga (CiMEC, University of Trento) | Poesio, Massimo (CiMEC, University of Trento) | Giuliano, Claudio (Fondazione Bruno Kessler) | Tymoshenko, Kateryna (Fondazione Bruno Kessler)
We investigate two publicly available web knowledge bases, Wikipedia and Yago, in an attempt to leverage semantic information and increase the performance level of a state-of-the-art coreference resolution (CR) engine. We extract semantic compatibility and aliasing information from Wikipedia and Yago, and incorporate it into a CR system. We show that using such knowledge with no disambiguation and filtering does not bring any improvement over the baseline, mirroring the previous findings. We propose, therefore, a number of solutions to reduce the amount of noise coming from web resources: using disambiguation tools for Wikipedia, pruning Yago to eliminate the most generic categories and imposing additional constraints on affected mentions. Our evaluation experiments on the ACE-02 corpus show that the knowledge, extracted from Wikipedia and Yago, improves our system's performance by 2-3 percentage points.
May-18-2011
- Country:
- North America > United States
- Wisconsin (0.04)
- New York (0.04)
- Connecticut (0.04)
- Massachusetts > Middlesex County
- Cambridge (0.04)
- California
- Los Angeles County > Los Angeles (0.04)
- San Francisco County > San Francisco (0.04)
- San Diego County > San Diego (0.04)
- Europe
- Czechia (0.04)
- Poland (0.04)
- Hungary (0.04)
- Central Europe (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Asia
- Afghanistan (0.05)
- South Korea (0.04)
- Pakistan (0.04)
- India (0.04)
- North America > United States
- Genre:
- Research Report (0.47)
- Industry:
- Government (0.68)
- Technology: