Exploiting Semantics for Big Data Integration

AI Magazine 

An equally important dimension of big data is variety, where the focus is to process highly heterogeneous data sets. We describe how we use semantics to address the problem of big data variety. We also describe Karma, a system that implements our approach and show how Karma can be applied to integrate data in the cultural heritage domain. In this use case, Karma integrates data across many museums even though the data sets from different museums are highly heterogeneous. Volume refers to the problem of how to deal with very large data sets, which typically requires execution in a distributed cloud-based infrastructure.