EmbNum: Semantic labeling for numerical values with deep metric learning

Nguyen, Phuc, Nguyen, Khai, Ichise, Ryutaro, Takeda, Hideaki

arXiv.org Machine Learning 

Semantic labeling is a task of matching unknown data source to labeled data sources. The semantic labels could be properties, classes in knowledge bases or labeled data are manually annotated by domain experts. In this paper, we presentEmbNum, a novel approach to match numerical columns from different table data sources. We use a representation network architecture consisting of triplet network and convolutional neural network to learn a mapping function from numerical columns toa transformed space. In this space, the Euclidean distance can be used to measure "semantic similarity" of two columns. Our experiments onCity-Data and Open-Data demonstrate thatEmbNumachieves considerable improvements in comparison with the state-of-the-art methods in effectiveness and efficiency.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found