Semantic Representation and Inference for NLP
–arXiv.org Artificial Intelligence
Semantic representation and inference is essential for Natural Language Processing (NLP). The state of the art for semantic representation and inference is deep learning, and particularly Recurrent Neural Networks (RNNs), Convolutional Neural Networks (CNNs), and transformer Self-Attention models. This thesis investigates the use of deep learning for novel semantic representation and inference, and makes contributions in the following three areas: creating training data, improving semantic representations and extending inference learning. In terms of creating training data, we contribute the largest publicly available dataset of real-life factual claims for the purpose of automatic claim verification (MultiFC), and we present a novel inference model composed of multi-scale CNNs with different kernel sizes that learn from external sources to infer fact checking labels. In terms of improving semantic representations, we contribute a novel model that captures non-compositional semantic indicators. By definition, the meaning of a non-compositional phrase cannot be inferred from the individual meanings of its composing words (e.g., hot dog). Motivated by this, we operationalize the compositionality of a phrase contextually by enriching the phrase representation with external word embeddings and knowledge graphs. Finally, in terms of inference learning, we propose a series of novel deep learning architectures that improve inference by using syntactic dependencies, by ensembling role guided attention heads, incorporating gating layers, and concatenating multiple heads in novel and effective ways. This thesis consists of seven publications (five published and two under review).
arXiv.org Artificial Intelligence
Jun-15-2021
- Country:
- Oceania > Australia
- Queensland (0.04)
- North America
- Mexico (0.14)
- United States
- Florida (0.04)
- Wisconsin (0.04)
- Ohio (0.04)
- District of Columbia > Washington (0.04)
- Texas > Travis County
- Austin (0.04)
- Michigan > Washtenaw County
- Ann Arbor (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Washington > King County
- Seattle (0.04)
- California
- San Francisco County > San Francisco (0.14)
- Santa Clara County > Palo Alto (0.04)
- San Diego County > San Diego (0.04)
- New York > New York County
- New York City (0.04)
- Trinidad and Tobago > Trinidad
- Canada > Quebec
- Montreal (0.04)
- Europe
- Germany > Berlin (0.04)
- Czechia > Prague (0.04)
- Russia (0.04)
- Italy > Tuscany
- Florence (0.04)
- France > Occitanie
- Haute-Garonne > Toulouse (0.04)
- Denmark
- Capital Region > Copenhagen (0.04)
- North Jutland > Aalborg (0.04)
- Bulgaria > Varna Province
- Varna (0.04)
- Belgium > Brussels-Capital Region
- Brussels (0.04)
- Asia
- Russia (0.04)
- Thailand > Chiang Mai
- Chiang Mai (0.04)
- Middle East
- China
- Africa > Ethiopia
- Addis Ababa > Addis Ababa (0.04)
- Oceania > Australia
- Genre:
- Research Report
- Promising Solution (1.00)
- New Finding (1.00)
- Research Report
- Industry:
- Information Technology (1.00)
- Media > News (0.92)
- Government > Regional Government