AITopics | Information Retrieval

Collaborating Authors

Information Retrieval

Our accustomed systems of retrieving particular bits of information no longer fill the needs of many people. Searching traditional indexes of print publications has been aided by computerized databases, but still usually requires time-consuming serial searching of one database after the other, and then moving on to other methods of searching for internet sources. And what if the information being sought is a sound byte? A video clip? Yesterday's e-mail exchange between respected scientists? Artificial intelligence may hold the key to information retrieval in an age where widely different formats contain the information being sought, and the universe of knowledge is simply too big and growing too rapidly for successful searching to proceed at a human's slow speed.

News Overviews Instructional Materials AI-Alerts Classics

Two-Layer Generalization Analysis for Ranking Using Rademacher Average

Chen, Wei, Liu, Tie-yan, Ma, Zhi-ming

Neural Information Processing SystemsDec-31-2010

This paper is concerned with the generalization analysis on learning to rank for information retrieval (IR). In IR, data are hierarchically organized, i.e., consisting of queries and documents per query. Previous generalization analysis for ranking, however, has not fully considered this structure, and cannot explain how the simultaneous change of query number and document number in the training data will affect the performance of algorithms. In this paper, we propose performing generalization analysis under the assumption of two-layer sampling, i.e., the i.i.d. sampling of queries and the conditional i.i.d sampling of documents per query. Such a sampling can better describe the generation mechanism of real data, and the corresponding generalization analysis can better explain the real behaviors of learning to rank algorithms. However, it is challenging to perform such analysis, because the documents associated with different queries are not identically distributed, and the documents associated with the same query become no longer independent if represented by features extracted from the matching between document and query. To tackle the challenge, we decompose the generalization error according to the two layers, and make use of the new concept of two-layer Rademacher average. The generalization bounds we obtained are quite intuitive and are in accordance with previous empirical studies on the performance of ranking algorithms.

information retrieval, machine learning, natural language, (19 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

Add feedback

Ontology-based Queries over Cancer Data

Gonzalez-Beltran, Alejandra, Tagger, Ben, Finkelstein, Anthony

arXiv.org Artificial IntelligenceDec-26-2010

The ever-increasing amount of data in biomedical research, and in cancer research in particular, needs to be managed to support efficient data access, exchange and integration. Existing software infrastructures, such caGrid, support access to distributed information annotated with a domain ontology. However, caGrid's current querying functionality depends on the structure of individual data resources without exploiting the semantic annotations. In this paper, we present the design and development of an ontology-based querying functionality that consists of: the generation of OWL2 ontologies from the underlying data resources metadata and a query rewriting and translation process based on reasoning, which converts a query at the domain ontology level into queries at the software infrastructure level. We present a detailed analysis of our approach as well as an extensive performance evaluation. While the implementation and evaluation was performed for the caGrid infrastructure, the approach could be applicable to other model and metadata-driven environments for data sharing.

artificial intelligence, information retrieval query processing, natural language, (19 more...)

arXiv.org Artificial Intelligence

1012.5506

Country: Europe > United Kingdom (0.14)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area > Oncology (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.46)

Add feedback

How Quantum Theory Is Developing the Field of Information Retrieval

AAAI ConferencesNov-5-2010

This position paper provides an overview of work conducted and an outlook of future directions within the field of Information Retrieval (IR) that aims to develop novel models, methods and frameworks inspired by Quantum Theory (QT).

information, representation, van rijsbergen, (14 more...)

AAAI Conferences

2010 AAAI Fall Symposium Series

Country:

Europe > Spain (0.05)
Oceania > Australia > Queensland (0.04)
Europe > Italy (0.04)
Asia > China > Tianjin Province > Tianjin (0.04)

Genre:

Overview (0.89)
Research Report (0.89)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

Explanation of Relevance Judgement Discrepancy with Quantum Interference

Wang, Jun (Robert Gordon University) | Song, Dawei (Robert Gordon University) | Zhang, Peng (Robert Gordon University) | Hou, Yuexian (Tianjin University) | Bruza, Peter (Queensland University of Techonology )

AAAI ConferencesNov-5-2010

A key concept in many Information Retrieval (IR) tasks, e.g. document indexing, query language modelling, aspect and diversity retrieval, is the relevance measurement of topics, i.e. to what extent an information object (e.g. a document or a query) is about the topics. This paper investigates the interference of relevance measurement of a topic caused by another topic. For example, consider that two user groups are required to judge whether a topic q is relevant to a document d, and q is presented together with another topic (referred to as a companion topic). If different companion topics are used for different groups, interestingly different relevance probabilities of q given d can be reached. In this paper, we present empirical results showing that the relevance of a topic to a document is greatly affected by the companion topic’s relevance to the same document, and the extent of the impact differs with respect to different companion topics. We further analyse the phenomenon from classical and quantum-like interference perspectives, and connect the phenomenon to nonreality and contextuality in quantum mechanics. We demonstrate that quantum like model fits in the empirical data, could be potentially used for predicting the relevance when interference exists.

artificial intelligence, information retrieval, natural language, (15 more...)

AAAI Conferences

2010 AAAI Fall Symposium Series

Country:

Oceania > Australia > Queensland (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > Scotland (0.04)
(6 more...)

Genre: Research Report (0.74)

Industry: Leisure & Entertainment (0.46)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

Add feedback

Logical Leaps and Quantum Connectives: Forging Paths through Predication Space

Cohen, Trevor (Center for Cognitive Informatics and Decision Making, School of Biomedical Informatics, University of Texas Health Science Center at Houston) | Widdows, Dominic (Google, Inc.) | Schvaneveldt, Roger W. (Arizona State University) | Rindflesch, Thomas C. (National Library of Medicine)

AAAI ConferencesNov-5-2010

The Predication-based Semantic Indexing (PSI) approach encodes both symbolic and distributional information into a semantic space using a permutation-based variant of Random Indexing. In this paper, we develop and evaluate a computational model of abductive reasoning based on PSI. Using distributional information, we identify pairs of concepts that are likely to be predicated about a common third concept, or middle term. As this occurs without the explicit identification of the middle term concerned, we refer to this process as a “logical leap”. Subsequently, we use further operations in the PSI space to retrieve this middle term and identify the predicate types involved. On evaluation using a set of 1000 randomly selected cue concepts, the model is shown to retrieve with accuracy concepts that can be connected to a cue concept by a middle term, as well as the middle term concerned, using nearest-neighbor search in the PSI space. The utility of quantum logical operators as a means to identify alternative paths through this space is also explored.

artificial intelligence, information retrieval, natural language, (19 more...)

AAAI Conferences

2010 AAAI Fall Symposium Series

Country:

North America > United States > Texas (0.04)
North America > United States > Arizona (0.04)

Genre: Research Report (0.68)

Industry: Health & Medicine > Therapeutic Area (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.48)
Information Technology > Artificial Intelligence > Representation & Reasoning > Abductive Reasoning (0.35)

Add feedback

An Agent based Approach towards Metadata Extraction, Modelling and Information Retrieval over the Web

Ahmed, Zeeshan, Gerhard, Detlef

arXiv.org Artificial IntelligenceAug-7-2010

Web development is a challenging research area for its creativity and complexity. The existing raised key challenge in web technology technologic development is the presentation of data in machine read and process able format to take advantage in knowledge based information extraction and maintenance [4]. Currently it is not possible to search and extract optimized results using full text queries because there is no such mechanism exists which can fully extract the semantic from full text queries and then look for particular knowledge based information. Mechanism of presenting information over the web in a format so that the humans as well as machines can understand the context leads to the concept of Semantic Web introduced by Tim Berners Lee [4]. Semantic web is a linked mesh of information to produce technologies capable of reasoning on semi structured information and processed by machines [4].

data mining, information, information retrieval, (15 more...)

arXiv.org Artificial Intelligence

1008.1333

Country:

Europe > Austria > Vienna (0.21)
Oceania > Australia > Victoria > Melbourne (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Communications > Web > Semantic Web (0.62)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.47)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.43)
(2 more...)

Add feedback

Semantic Oriented Agent based Approach towards Engineering Data Management, Web Information Retrieval and User System Communication Problems

Ahmed, Zeeshan, Gerhard, Detlef

arXiv.org Artificial IntelligenceAug-7-2010

The four intensive problems to the software rose by the software industry .i.e., User System Communication / Human Machine Interface, Meta Data extraction, Information processing & management and Data representation are discussed in this research paper. To contribute in the field we have proposed and described an intelligent semantic oriented agent based search engine including the concepts of intelligent graphical user interface, natural language based information processing, data management and data reconstruction for the final user end information representation.

artificial intelligence, information retrieval, natural language, (13 more...)

arXiv.org Artificial Intelligence

1008.1328

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.05)
Europe > Austria > Vienna (0.05)
Asia > South Korea > Busan > Busan (0.05)
Asia > Pakistan (0.05)

Genre: Research Report (0.40)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.64)

Add feedback

Keyword Extraction and Headline Generation Using Novel Word Features

Xu, Songhua (Yale University) | Yang, Shaohui (University of Hong Kong) | Lau, Francis (University of Hong Kong)

AAAI ConferencesJul-15-2010

We introduce several novel word features for keyword extraction and headline generation. These new word features are derived according to the background knowledge of a document as supplied by Wikipedia. Given a document, to acquire its background knowledge from Wikipedia, we first generate a query for searching the Wikipedia corpus based on the key facts present in the document. We then use the query to find articles in the Wikipedia corpus that are closely related to the contents of the document. With the Wikipedia search result article set, we extract the inlink, outlink, category and infobox information in each article to derive a set of novel word features which reflect the document's background knowledge. These newly introduced word features offer valuable indications on individual words' importance in the input document. They serve as nice complements to the traditional word features derivable from explicit information of a document. In addition, we also introduce a word-document fitness feature to charcterize the influence of a document's genre on the keyword extraction and headline generation process. We study the effectiveness of these novel word features for keyword extraction and headline generation by experiments and have obtained very encouraging results.

wikipedia, wikipedia article, word feature, (13 more...)

AAAI Conferences

Twenty-Fourth AAAI Conference on Artificial Intelligence

Country:

Asia > China > Hong Kong (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Connecticut > New Haven County > New Haven (0.04)
(4 more...)

Genre:

Overview (0.49)
Research Report (0.47)

Industry: Government (0.94)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

Natural Language Aided Visual Query Building for Complex Data Access

Pan, Shimei (IBM Watson Research Center) | Zhou, Michelle (IBM Almaden Research Center) | Houck, Keith (IBM Watson Research Center) | Kissa, Peter (IBM Watson Research Center)

AAAI ConferencesJul-15-2010

Over the past decades, there have been significant efforts on developing robust and easy-to-use query interfaces to databases. So far, the typical query interfaces are GUI-based visual query interfaces. Visual query interfaces however, have limitations especially when they are used for accessing large and complex datasets. Therefore, we are developing a novel query interface where users can use natural language expressions to help author visual queries. Our work enhances the usability of a visual query interface by directly addressing the "knowledge gap" issue in visual query interfaces. We have applied our work in several real-world applications. Our preliminary evaluation demonstrates the effectiveness of our approach.

artificial intelligence, natural language, text processing, (22 more...)

AAAI Conferences

Twenty-Second IAAI Conference

Genre: Research Report > Experimental Study (0.46)

Industry:

Education (1.00)
Banking & Finance > Real Estate (0.69)

Technology:

Information Technology > Human Computer Interaction > Interfaces (1.00)
Information Technology > Databases (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (1.00)

Add feedback

Ontological Reasoning with F-logic Lite and its Extensions

Cali, Andrea (University of Oxford) | Gottlob, Georg (University of Oxford) | Kifer, Michael (SUNY Stony Brook) | Lukasiewicz, Thomas (University of Oxford) | Pieris, Andreas (University of Oxford)

AAAI ConferencesJul-15-2010

Answering queries posed over knowledge bases is a central problem in knowledge representation and database theory. In the database area, checking query containment is an important query optimization and schema integration technique. In knowledge representation it has been used for object classification, schema integration, service discovery, and more. In the presence of a knowledge base, the problem of query containment is strictly related to that of query answering; indeed, the two are reducible to each other; we focus on the latter, and our results immediately extend to the former.

artificial intelligence, information retrieval query processing, natural language, (20 more...)

AAAI Conferences

Twenty-Fourth AAAI Conference on Artificial Intelligence

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
North America > United States > Pennsylvania (0.04)
North America > United States > New York > Suffolk County > Stony Brook (0.04)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Ontologies (0.83)
Information Technology > Artificial Intelligence > Representation & Reasoning > Information Fusion (0.69)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.54)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.54)

Add feedback