AITopics | Manolescu, Ioana

Plotting

Manolescu, Ioana

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Integrating connection search in graph queries

Anadiotis, Angelos Christos, Manolescu, Ioana, Mohanty, Madhulika

arXiv.org Artificial IntelligenceAug-9-2022

Graph data management and querying has many practical applications. When graphs are very heterogeneous and/or users are unfamiliar with their structure, they may need to find how two or more groups of nodes are connected in a graph, even when users are not able to describe the connections. This is only partially supported by existing query languages, which allow searching for paths, but not for trees connecting three or more node groups. The latter is related to the NP-hard Group Steiner Tree problem, and has been previously considered for keyword search in databases. In this work, we formally show how to integrate connecting tree patterns (CTPs, in short) within a graph query language such as SPARQL or Cypher, leading to an Extended Query Language (or EQL, in short). We then study a set of algorithms for evaluating CTPs; we generalize prior keyword search work, most importantly by (i) considering bidirectional edge traversal and (ii) allowing users to select any score function for ranking CTP results. To cope with very large search spaces, we propose an efficient pruning technique and formally establish a large set of cases where our algorithm, MOLESP, is complete even with pruning. Our experiments validate the performance of our CTP and EQL evaluation algorithms on a large set of synthetic and real-world workloads.

information retrieval, natural language, node, (18 more...)

arXiv.org Artificial Intelligence

2208.04802

Country:

Europe (1.00)
Asia (1.00)
North America > United States > California (0.67)
North America > Canada (0.67)

Genre: Research Report (0.50)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.87)

Add feedback

Graph integration of structured, semistructured and unstructured data for data journalism

Anadiotis, Angelos-Christos, Balalau, Oana, Conceicao, Catarina, Galhardas, Helena, Haddad, Mhd Yamen, Manolescu, Ioana, Merabti, Tayeb, You, Jingmao

arXiv.org Artificial IntelligenceDec-16-2020

Such a query can be answered currently at a high human effort cost, by inspecting e.g., a JSON list of Assemblée elected officials (available from NosDeputes.fr) and manually connecting the names with those found in a national registry of companies. This considerable effort may still miss connections that could be found if one added information about politicians' and business people's spouses, information sometimes available in public knowledge bases such as DBPedia, or journalists' notes. No single query language can be used on such heterogeneous data; instead, we study methods to query the corpus by specifying some keywords and asking for all the connections that exist, in one or across several data sources, between these keywords. This problem has been studied under the name of keyword search over structured data, in particular for relational databases [49, 27], XML documents [24, 33], RDF graphs [30, 16]. However, most of these works assumed one single source of data, in which connections among nodes are clearly identified. When authors considered several data sources [31], they still assumed that one query answer comes from a single data source. In contrast, the ConnectionLens system [10] answers keyword search queries over arbitrary combinations of datasets and heterogeneous data models, independently produced by actors unaware of each other's existence.

information management, node, text processing, (21 more...)

arXiv.org Artificial Intelligence

2012.0883

Country:

Europe > France (0.28)
Africa > Middle East > Morocco (0.14)

Genre: Research Report > New Finding (0.46)

Industry:

Media > News (1.00)
Government (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.88)

Add feedback

Graph integration of structured, semistructured and unstructured data for data journalism

Balalau, Oana, Conceiç{ã}o, Catarina, Galhardas, Helena, Manolescu, Ioana, Merabti, Tayeb, You, Jingmao, Youssef, Youssr

arXiv.org Artificial IntelligenceJul-23-2020

Nowadays, journalism is facilitated by the existence of large amounts of digital data sources, including many Open Data ones. Such data sources are extremely heterogeneous, ranging from highly struc-tured (relational databases), semi-structured (JSON, XML, HTML), graphs (e.g., RDF), and text. Journalists (and other classes of users lacking advanced IT expertise, such as most non-governmental-organizations, or small public administrations) need to be able to make sense of such heterogeneous corpora, even if they lack the ability to de ne and deploy custom extract-transform-load work ows. These are di cult to set up not only for arbitrary heterogeneous inputs , but also given that users may want to add (or remove) datasets to (from) the corpus. We describe a complete approach for integrating dynamic sets of heterogeneous data sources along the lines described above: the challenges we faced to make such graphs useful, allow their integration to scale, and the solutions we proposed for these problems. Our approach is implemented within the ConnectionLens system; we validate it through a set of experiments.

dataset, information management, text processing, (21 more...)

arXiv.org Artificial Intelligence

2007.12488

Country:

Europe > France > Île-de-France (0.14)
Africa > Middle East > Morocco (0.14)

Genre: Research Report (0.50)

Industry: Media > News (1.00)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Databases (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(3 more...)

Add feedback

RDFViewS: A Storage Tuning Wizard for RDF Applications

Goasdoué, François, Karanasos, Konstantinos, Leblay, Julien, Manolescu, Ioana

arXiv.org Artificial IntelligenceAug-12-2010

In recent years, the significant growth of RDF data used in numerous applications has made its efficient and scalable manipulation an important issue. In this paper, we present RDFViewS, a system capable of choosing the most suitable views to materialize, in order to minimize the query response time for a specific SPARQL query workload, while taking into account the view maintenance cost and storage space constraints. Our system employs practical algorithms and heuristics to navigate through the search space of potential view configurations, and exploits the possibly available semantic information - expressed via an RDF Schema - to ensure the completeness of the query evaluation.

artificial intelligence, query, rdfviews, (13 more...)

arXiv.org Artificial Intelligence

1008.2186

Country: North America > Canada > Ontario (0.15)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)

Add feedback