Ontologies
Understanding Semantic Web and Ontologies: Theory and Applications
One of the most interesting inventions, in recent decades, is that of Web Services [36]. These are computer program "applications": self-describing, selfcontained applications whose function is to automatically share information over the Internet with other applications. Some weaknesses such as browsing information without taking its meaning into account have recently appeared in Web Services. This creates a need for a new Web with more relevance to the user. Semantic Web is actually an extension of the current one in that it represents information more meaningfully for humans and computers alike. It enables the description of contents and services in machine-readable form, and enables annotating, discovering, publishing, advertising and composing services to be automated. It was developed based on Ontology, which is considered as the backbone of the Semantic Web. In other words, the current Web is transformed from being machine-readable to machineunderstandable. One function of the Web is to build a source of reference for information on several subjects, while the Semantic Web is designed to build a web of meaning.
sTeX+ - a System for Flexible Formalization of Linked Data
Kohlhase, Andrea, Kohlhase, Michael, Lange, Christoph
We present the sTeX+ system, a user-driven advancement of sTeX - a semantic extension of LaTeX that allows for producing high-quality PDF documents for (proof)reading and printing, as well as semantic XML/OMDoc documents for the Web or further processing. Originally sTeX had been created as an invasive, semantic frontend for authoring XML documents. Here, we used sTeX in a Software Engineering case study as a formalization tool. In order to deal with modular pre-semantic vocabularies and relations, we upgraded it to sTeX+ in a participatory design process. We present a tool chain that starts with an sTeX+ editor and ultimately serves the generated documents as XHTML+RDFa Linked Data via an OMDoc-enabled, versioned XML database. In the final output, all structural annotations are preserved in order to enable semantic information retrieval services.
An Efficient Technique for Similarity Identification between Ontologies
Farooq, Amjad, Ahsan, Syed, Shah, Abad
Ontologies usually suffer from the semantic heterogeneity when simultaneously used in information sharing, merging, integrating and querying processes. Therefore, the similarity identification between ontologies being used becomes a mandatory task for all these processes to handle the problem of semantic heterogeneity. In this paper, we propose an efficient technique for similarity measurement between two ontologies. The proposed technique identifies all candidate pairs of similar concepts without omitting any similar pair. The proposed technique can be used in different types of operations on ontologies such as merging, mapping and aligning. By analyzing its results a reasonable improvement in terms of completeness, correctness and overall quality of the results has been found.
Embedding Non-Ground Logic Programs into Autoepistemic Logic for Knowledge Base Combination
de Bruijn, Jos, Eiter, Thomas, Polleres, Axel, Tompits, Hans
Adopting a layered architecture, a number of building blocks have been proposed that serve different purposes, from low-level data encoding to high-level semantic representation. In this architecture, the building blocks for ontologies, rules, and query languages play a prominent role. Furthermore, to ensure interoperability and wide applicability, standard representation formalisms are propagated by the World Wide Web Consortium(W3C), including the Resource Description Framework (RDF) [RDF Concepts 2004; RDF Semantics 2004], the Web Ontology Language (OWL) [OWL Semantics 2004; OWL 2 2009], and the recent Rule Interchange Format Basic Logic Dialect (RIF BLD) [RIF BLD 2009]. In addition, the RIF logical framework [Kifer 2008] lays the foundation for Web rule languages extending RIF BLD with nonmonotonic negation. Each of these formalisms has a formal semantics, which is either expressible in terms of classical logic or logic programming [de Bruijn and Heymans 2007; Horrocks and Patel-Schneider 2003; Kifer 2008]. There is a need for combining these formalisms, which is illustrated by the following simple example.
From RESTful Services to RDF: Connecting the Web and the Semantic Web
RESTful services on the Web expose information through retrievable resource representations that represent self-describing descriptions of resources, and through the way how these resources are interlinked through the hyperlinks that can be found in those representations. This basic design of RESTful services means that for extracting the most useful information from a service, it is necessary to understand a service's representations, which means both the semantics in terms of describing a resource, and also its semantics in terms of describing its linkage with other resources. Based on the Resource Linking Language (ReLL), this paper describes a framework for how RESTful services can be described, and how these descriptions can then be used to harvest information from these services. Building on this framework, a layered model of RESTful service semantics allows to represent a service's information in RDF/OWL. Because REST is based on the linkage between resources, the same model can be used for aggregating and interlinking multiple services for extracting RDF data from sets of RESTful services.
The Dilated Triple
Rodriguez, Marko A., Pepe, Alberto, Shinavier, Joshua
The basic unit of meaning on the Semantic Web is the RDF statement, or triple, which combines a distinct subject, predicate and object to make a definite assertion about the world. A set of triples constitutes a graph, to which they give a collective meaning. It is upon this simple foundation that the rich, complex knowledge structures of the Semantic Web are built. Yet the very expressiveness of RDF, by inviting comparison with real-world knowledge, highlights a fundamental shortcoming, in that RDF is limited to statements of absolute fact, independent of the context in which a statement is asserted. This is in stark contrast with the thoroughly context-sensitive nature of human thought. The model presented here provides a particularly simple means of contextualizing an RDF triple by associating it with related statements in the same graph. This approach, in combination with a notion of graph similarity, is sufficient to select only those statements from an RDF graph which are subjectively most relevant to the context of the requesting process.
General-Purpose Computing on a Semantic Network Substrate
A semantic network is a directed labeled graph (Sowa, 1991). The thesis of this article is that the state of a computing machine, its low-level instructions, and the executing program can be represented as a semantic network. The computational model that is presented can be instantiated using any semantic network representation. However, given the existence of the Resource Description Framework (RDF) (Manola & Miller, 2004) and the popular Web Ontology Language (OWL) (McGuinness & Harmelen, 2004), this article presents the theory and the application in terms of these constructs. The computing model that is proposed is perhaps simple in theory, but in application, requires a relatively strong background in the computer sciences.
Constructing Reference Sets from Unstructured, Ungrammatical Text
Michelson, M., Knoblock, C. A.
Vast amounts of text on the Web are unstructured and ungrammatical, such as classified ads, auction listings, forum postings, etc. We call such text posts. Despite their inconsistent structure and lack of grammar, posts are full of useful information. This paper presents work on semi-automatically building tables of relational information, called reference sets, by analyzing such posts directly. Reference sets can be applied to a number of tasks such as ontology maintenance and information extraction. Our reference-set construction method starts with just a small amount of background knowledge, and constructs tuples representing the entities in the posts to form a reference set. We also describe an extension to this approach for the special case where even this small amount of background knowledge is impossible to discover and use. To evaluate the utility of the machine-constructed reference sets, we compare them to manually constructed reference sets in the context of reference-set-based information extraction. Our results show the reference sets constructed by our method outperform manually constructed reference sets. We also compare the reference-set-based extraction approach using the machine-constructed reference set to supervised extraction approaches using generic features. These results demonstrate that using machine-constructed reference sets outperforms the supervised methods, even though the supervised methods require training data.
Semantics for Digital Engineering Archives Supporting Engineering Design Education
Regli, William C. (Drexel University) | Kopena, Joseph B. (Drexel University) | Grauer, Michael (Drexel University) | Simpson, Timothy W. (Penn State University) | Stone, Robert B. (Oregon State University) | Lewis, Kemper (University at Buffalo - SUNY) | Bohm, Matt R. (Oregon State University) | Wilkie, David (Drexel University) | Piecyk, Martin (Drexel University) | Osecki, Jordan (Drexel University)
This article introduces the challenge of digital preservation in the area of engineering design and manufacturing and presents a methodology to apply knowledge representation and semantic techniques to develop Digital Engineering Archives. This work is part of an ongoing, multiuniversity, effort to create cyber infrastructure-based engineering repositories for undergraduates (CIBER-U) to support engineering design education. The technical approach is to use knowledge representation techniques to create formal models of engineering data elements, workflows and processes. With these formal engineering knowledge and processes can be captured and preserved with some guarantee of long-term interpretability. The article presents examples of how the techniques can be used to encode specific engineering information packages and workflows. These techniques are being integrated into a semantic wiki that supports the CIBER-U engineering education activities across nine universities and involving over 3500 students since 2006.
Towards a Logic of Feature-Based Semantic Science Theories
Poole, David (University of British Columbia,)
The aim of semantic science is to allow for the publications of ontologies, observation data, and hypotheses/theories. Hypotheses make predictions on data and on new cases. Those hypotheses that fit the available evidence are called theories. This paper considers how thoeries can be used for predictions in new cases. Theories are typically very narrow and not all of the inputs to a theory are observed, so to make predictions on a particular case, many theories need to be used. Without any global design, the available theories do not necessarily fit together nicely. This paper explains how theories can be combined into theory ensembles to make predictions on a particular case. This is needed to evaluate theories, and to make useful predictions. We motivate and give desiderata for theory ensembles for level 1, feature-based, semantic science, which assumes that the data and the theories can be described in terms of features (random variables).