EmbNum: Semantic labeling for numerical values with deep metric learning
Nguyen, Phuc, Nguyen, Khai, Ichise, Ryutaro, Takeda, Hideaki
Semantic labeling is a task of matching unknown data source to labeled data sources. The semantic labels could be properties, classes in knowledge bases or labeled data are manually annotated by domain experts. In this paper, we present EmbNum, a novel approach to match numerical columns from different table data sources. We use a representation network architecture consisting of triplet network and convolutional neural network to learn a mapping function from numerical columns to a transformed space. In this space, the Euclidean distance can be used to measure "semantic similarity" of two columns. Our experiments on City-Data and Open-Data demonstrate that EmbNum achieves considerable improvements in comparison with the state-of-the-art methods on effectiveness and efficiency.
Jun-25-2018
- Country:
- Oceania > Australia (0.14)
- Europe > Ireland (0.04)
- North America
- Canada (0.04)
- United States
- District of Columbia > Washington (0.04)
- New York > New York County
- New York City (0.04)
- Asia > Japan
- Honshū > Kantō
- Tokyo Metropolis Prefecture > Tokyo (0.14)
- Kanagawa Prefecture (0.04)
- Honshū > Kantō
- Genre:
- Research Report
- Promising Solution (0.54)
- New Finding (0.46)
- Research Report
- Technology: