Data-gov Wiki: Towards Linking Government Data
Ding, Li (Rensselaer Polytechnic Institute) | Difranzo, Dominic (Rensselaer Polytechnic Institute) | Graves, Alvaro (Rensselaer Polytechnic Institute) | Michaelis, James R (Rensselaer Polytechnic Institute) | Li, Xian (Rensselaer Polytechnic Institute) | McGuinness, Deborah L (Rensselaer Polytechnic Institute) | Hendler, Jim (Rensselaer Polytechnic Institute)
Data.gov is a website that provides US Government data to the general public to ensure better accountability and transparency. Our recent work on the Data-gov Wiki, which attempts to integrate the datasets published at Data.gov into the Linking Open Data (LOD) cloud (yielding "linked government data"), has produced 5 billion triples – covering a range of topics including: government spending, environmental records, and statistics on the cost and usage of public services. In this paper, we investigate the role of Semantic Web technologies in converting, enhancing and using linked government data. In particular, we show how government data can be (i) inter-linked by sharing the same terms and URIs, (ii) linked to existing data sources ranging from the LOD cloud (e.g. DBpedia) to the conventional web (e.g. the New York Times), and (iii) cross-linked by their knowledge provenance (which captures, among other things, derivation and revision histories).
Mar-22-2010