Ontologies
On the Satisfiability Problem for SPARQL Patterns
Zhang, Xiaowang, Van den Bussche, Jan, Picalausa, François
The satisfiability problem for SPARQL 1.0 patterns is undecidable in general, since the relational algebra can be emulated using such patterns. The goal of this paper is to delineate the boundary of decidability of satisfiability in terms of the constraints allowed in filter conditions. The classes of constraints considered are bound-constraints, negated bound- constraints, equalities, nonequalities, constant-equalities, and constant-nonequalities. The main result of the paper can be summarized by saying that, as soon as inconsistent filter conditions can be formed, satisfiability is undecidable. The key insight in each case is to find a way to emulate the set difference operation. Undecidability can then be obtained from a known undecidability result for the algebra of binary relations with union, composition, and set difference. When no inconsistent filter conditions can be formed, satisfiability is decidable by syntactic checks on bound variables and on the use of literals. Although the problem is shown to be NP-complete, it is experimentally shown that the checks can be implemented efficiently in practice. The paper also points out that satisfiability for the so-called ‘well-designed’ patterns can be decided by a check on bound variables and a check for inconsistent filter conditions.
ConferenceCall 2016 03 17 - OntologPSMW
Phone (US): 1 (425) 440-5100 ... (long distance cost may apply) (1C4A) Unfamiliar with how to do this on Skype? Add the contact "join.conference" to your skype contact list first. To participate in the teleconference, make a skype call to "join.conference", then open the dial pad (see platform-specific instructions below) and enter the Conference ID: 843758# when prompted. You can indicate that you want to ask a question verbally by clicking on the "hand" button, and wait for the moderator to call on you; or, type and send your question into the chat window at the bottom of the screen. Just add the room as a buddy - (in our case here) summit_20160317@soaphub.org ... Handy for mobile devices!
Helping Novices Avoid the Hazards of Data: Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources
Janpuangtong, Sasin (Texas A&M University) | Shell, Dylan A. (Texas A&M University)
This article describes an end-to-end learning framework that allows a novice to create models from data easily by helping structure the model building process and capturing extended aspects of domain knowledge. By treating the whole modeling process interactively and exploiting high-level knowledge in the form of an ontology, the framework is able to aid the user in a number of ways, including in helping to avoid pitfalls such as data dredging. We describe how the framework automatically exploits structured knowledge in an ontology to identify relevant concepts, and how a data extraction component can make use of online data sources to find measurements of those concepts so that their relevance can be evaluated. Prediction error on unseen examples of these models show that our framework, making use of the ontology, helps to improve model generalization.
Helping Novices Avoid the Hazards of Data: Leveraging Ontologies to Improve Model Generalization Automatically with Online Data Sources
Janpuangtong, Sasin (Texas A&M University) | Shell, Dylan A. (Texas A&M University)
The infrastructure and tools necessary for large-scale data analytics, formerly the exclusive purview of experts, are increasingly available. Whereas a knowledgeable data-miner or domain expert can rightly be expected to exercise caution when required (for example, around fallacious conclusions supposedly supported by the data), the nonexpert may benefit from some judicious assistance. This article describes an end-to-end learning framework that allows a novice to create models from data easily by helping structure the model building process and capturing extended aspects of domain knowledge. By treating the whole modeling process interactively and exploiting high-level knowledge in the form of an ontology, the framework is able to aid the user in a number of ways, including in helping to avoid pitfalls such as data dredging. Prudence must be exercised to avoid these hazards as certain conclusions may only be supported if, for example, there is extra knowledge which gives reason to trust a narrower set of hypotheses. This article adopts the solution of using higher-level knowledge to allow this sort of domain knowledge to be used automatically, selecting relevant input attributes, and thence constraining the hypothesis space. We describe how the framework automatically exploits structured knowledge in an ontology to identify relevant concepts, and how a data extraction component can make use of online data sources to find measurements of those concepts so that their relevance can be evaluated. To validate our approach, models of four different problem domains were built using our implementation of the framework. Prediction error on unseen examples of these models show that our framework, making use of the ontology, helps to improve model generalization.
THE PAST (Entity-Attribute-Value) vs THE FUTURE (Sign, Signifier, Signified)
In both semantic model standards Topic Maps and RDF/OWL and in many other NoSQL approaches to solve efficiently the problem of how to represent relations and relationships one major stumbling block is raised beyond all efforts: the namespace. It is a language problem, the babel we have in our civilized world is transferred into our IT systems. But machines do not have to understand our language, we do. The problem here is that from a semantic point of view, similar diagrams are in need from users that want to express business processes but when we reach the implementation stage software engineers have to marry business requirements with the technical constrains of the database system hence the ER diagram you see. Generally speaking this is known as "The Model", a conceptual view of the user on data. The ER version of the model has several limitations, due to the architecture of RDBMS.
On the satisfiability problem for SPARQL patterns
Zhang, Xiaowang, Bussche, Jan Van den, Picalausa, François
The satisfiability problem for SPARQL patterns is undecidable in general, since the expressive power of SPARQL 1.0 is comparable with that of the relational algebra. The goal of this paper is to delineate the boundary of decidability of satisfiability in terms of the constraints allowed in filter conditions. The classes of constraints considered are bound-constraints, negated bound-constraints, equalities, nonequalities, constant-equalities, and constant-nonequalities. The main result of the paper can be summarized by saying that, as soon as inconsistent filter conditions can be formed, satisfiability is undecidable. The key insight in each case is to find a way to emulate the set difference operation. Undecidability can then be obtained from a known undecidability result for the algebra of binary relations with union, composition, and set difference. When no inconsistent filter conditions can be formed, satisfiability is efficiently decidable by simple checks on bound variables and on the use of literals. The paper also points out that satisfiability for the so-called `well-designed' patterns can be decided by a check on bound variables and a check for inconsistent filter conditions.
Query and Predicate Emptiness in Ontology-Based Data Access
Baader, Franz, Bienvenu, Meghyn, Lutz, Carsten, Wolter, Frank
In ontology-based data access (OBDA), database querying is enriched with an ontology that provides domain knowledge and additional vocabulary for query formulation. We identify query emptiness and predicate emptiness as two central reasoning services in this context. Query emptiness asks whether a given query has an empty answer over all databases formulated in a given vocabulary. Predicate emptiness is defined analogously, but quantifies universally over all queries that contain a given predicate. In this paper, we determine the computational complexity of query emptiness and predicate emptiness in the EL, DL-Lite, and ALC-families of description logics, investigate the connection to ontology modules, and perform a practical case study to evaluate the new reasoning services.
Cognitive Architect/siliconarmada.com
Job Description In this role, you'll be part of our European consulting team that is helping clients to design and deliver innovative solutions based on Cognitive Computing approaches - in particular based on IBM WATSON technology. We're looking for experienced professionals who have proven expertise in one or multiple of the areas of Artificial Intelligence, Natural Language Processing, Semantic Technologies, Information Retrieval or Machine Learning. You'll provide advisory and implementation expertise to our clients including: use case and business case development for Cognitive Computing solutions; proof of concept execution to prove the value of Cognitive Computing use cases; solution outline and design of Cognitive Computing systems; as well as supporting business development activities. You'll have strong experience in designing and building innovative solutions based on the above technologies but you will also be have the expertise and architectural mindset to relate and integrate such solutions with existing client system infrastructures, such as e.g. Proven hands-on experience in conducting analyses on unstructured as well as structured / semi-structured data and in working with state-of the art technologies in Cognitive Computing will be expected.
AI and cognitive computing: how to distinguish the real value proposition
Google has developed some awesome mobile applications that realize visual and audio recognition. Google just recently announced an open source Natural Language Understanding (NLU) system called SyntaxNet. This NLU system is built upon Google's TensorFlow, an open source neural network framework. Google has been able to achieve an overall 90 percent accuracy rate with their system. This is quite an accomplishment from just ten years ago, where part of speech tagging consisted of simply identifying entity extraction (verbs, nouns, etc.).