Goto

Collaborating Authors

Ambiverse - an amazing open-source suite for natural language understanding

#artificialintelligence

While doing performance benchmarks for Named Entity Linking solutions for our AI/FinTech start-up Risklio, I stumbled upon a very powerful, only just open-sourced framework called AmbiverseNLU. It was developed by Ambiverse and is based on work previously done at the Max Planck Institute¹. The components it uses are more well-known: entity recognition from KnowNER², open information extraction using ClausIE³ and AIDA, an entity detection and disambiguation tool⁴. You can have a look at the demo here. For the former one you can choose whether to use Apache Cassandra or PostgreSQL as a backend, while the last one uses Neo4j.


Ambiverse - an amazing open-source suite for natural language understanding

#artificialintelligence

While doing performance benchmarks for Named Entity Linking solutions for our AI/FinTech start-up Risklio, I stumbled upon a very powerful, only just open-sourced framework called AmbiverseNLU. It was developed by Ambiverse and is based on work previously done at the Max Planck Institute¹. The components it uses are more well-known: entity recognition from KnowNER², open information extraction using ClausIE³ and AIDA, an entity detection and disambiguation tool⁴. You can have a look at the demo here. For the former one you can choose whether to use Apache Cassandra or PostgreSQL as a backend, while the last one uses Neo4j.


The Enterprise Computing Conference (23d edition) - Sciencesconf.org

#artificialintelligence

Abstract: The phenomenal growth of social media, mobile applications, sensor based technologies and the Internet of Things is generating a flood of "Big Data" and disrupting our world in many ways. Simultaneously, we are seeing many interesting developments in machine learning and Artificial Intelligence (AI) technologies and methods. In this talk I will examine the paradigm shift caused by recent developments in AI and Big Data and ways to harness their power to create a smarter enterprise computing environment. Using examples from health care, smart cities, education, and businesses in general, I will highlight challenges and research opportunities for developing an enterprise of the future. Bio: Sudha Ram is Anheuser-Busch Endowed Professor of MIS, Entrepreneurship & Innovation in the Eller College of Management at the University of Arizona.


Graph integration of structured, semistructured and unstructured data for data journalism

arXiv.org Artificial Intelligence

Nowadays, journalism is facilitated by the existence of large amounts of digital data sources, including many Open Data ones. Such data sources are extremely heterogeneous, ranging from highly struc-tured (relational databases), semi-structured (JSON, XML, HTML), graphs (e.g., RDF), and text. Journalists (and other classes of users lacking advanced IT expertise, such as most non-governmental-organizations, or small public administrations) need to be able to make sense of such heterogeneous corpora, even if they lack the ability to de ne and deploy custom extract-transform-load work ows. These are di cult to set up not only for arbitrary heterogeneous inputs , but also given that users may want to add (or remove) datasets to (from) the corpus. We describe a complete approach for integrating dynamic sets of heterogeneous data sources along the lines described above: the challenges we faced to make such graphs useful, allow their integration to scale, and the solutions we proposed for these problems. Our approach is implemented within the ConnectionLens system; we validate it through a set of experiments.


Commonsense Properties from Query Logs and Question Answering Forums

arXiv.org Artificial Intelligence

Commonsense knowledge about object properties, human behavior and general concepts is crucial for robust AI applications. However, automatic acquisition of this knowledge is challenging because of sparseness and bias in online sources. This paper presents Quasimodo, a methodology and tool suite for distilling commonsense properties from non-standard web sources. We devise novel ways of tapping into search-engine query logs and QA forums, and combining the resulting candidate assertions with statistical cues from encyclopedias, books and image tags in a corroboration step. Unlike prior work on commonsense knowledge bases, Quasimodo focuses on salient properties that are typically associated with certain objects or concepts. Extensive evaluations, including extrinsic use-case studies, show that Quasimodo provides better coverage than state-of-the-art baselines with comparable quality.