AITopics | Information Retrieval

Collaborating Authors

Information Retrieval

Our accustomed systems of retrieving particular bits of information no longer fill the needs of many people. Searching traditional indexes of print publications has been aided by computerized databases, but still usually requires time-consuming serial searching of one database after the other, and then moving on to other methods of searching for internet sources. And what if the information being sought is a sound byte? A video clip? Yesterday's e-mail exchange between respected scientists? Artificial intelligence may hold the key to information retrieval in an age where widely different formats contain the information being sought, and the universe of knowledge is simply too big and growing too rapidly for successful searching to proceed at a human's slow speed.

News Overviews Instructional Materials AI-Alerts Classics

InPars-Light: Cost-Effective Unsupervised Training of Efficient Rankers

Boytsov, Leonid, Patel, Preksha, Sourabh, Vivek, Nisar, Riddhi, Kundu, Sayani, Ramanathan, Ramya, Nyberg, Eric

arXiv.org Artificial IntelligenceJan-8-2023

We carried out a reproducibility study of InPars recipe for unsupervised training of neural rankers. As a by-product of this study, we developed a simple-yet-effective modification of InPars, which we called InPars-light. Unlike InPars, InPars-light uses only a freely available language model BLOOM and 7x-100x smaller ranking models. On all five English retrieval collections (used in the original InPars study) we obtained substantial (7-30%) and statistically significant improvements over BM25 in nDCG or MRR using only a 30M parameter six-layer MiniLM ranker. In contrast, in the InPars study only a 100x larger MonoT5-3B model consistently outperformed BM25, whereas their smaller MonoT5-220M model (which is still 7x larger than our MiniLM ranker), outperformed BM25 only on MS MARCO and TREC DL 2020. In a purely unsupervised setting, our 435M parameter DeBERTA v3 ranker was roughly at par with the 7x larger MonoT5-3B: In fact, on three out of five datasets, it slightly outperformed MonoT5-3B. Finally, these good results were achieved by re-ranking only 100 candidate documents compared to 1000 used in InPars. We believe that InPars-light is the first truly cost-effective prompt-based unsupervised recipe to train and deploy neural ranking models that outperform BM25.

information retrieval, large language model, machine learning, (21 more...)

arXiv.org Artificial Intelligence

2301.02998

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Europe > Italy > Tuscany > Florence (0.04)

Genre: Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.96)

Add feedback

Micro, Macro & Weighted Averages of F1 Score, Clearly Explained - KDnuggets

#artificialintelligenceJan-4-2023, 20:30:53 GMT

The F1 score (aka F-measure) is a popular metric for evaluating the performance of a classification model. In the case of multi-class classification, we adopt averaging methods for F1 score calculation, resulting in a set of different average scores (macro, weighted, micro) in the classification report. This article looks at the meaning of these averages, how to calculate them, and which one to choose for reporting. Note: Skip this section if you are already familiar with the concepts of precision, recall, and F1 score. Layman definition: Of all the positive predictions I made, how many of them are truly positive?

information retrieval, machine learning, natural language, (19 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.47)

Add feedback

Follow the Timeline! Generating Abstractive and Extractive Timeline Summary in Chronological Order

Chen, Xiuying, Li, Mingzhe, Gao, Shen, Chan, Zhangming, Zhao, Dongyan, Gao, Xin, Zhang, Xiangliang, Yan, Rui

arXiv.org Artificial IntelligenceJan-2-2023

Nowadays, time-stamped web documents related to a general news query floods spread throughout the Internet, and timeline summarization targets concisely summarizing the evolution trajectory of events along the timeline. Unlike traditional document summarization, timeline summarization needs to model the time series information of the input events and summarize important events in chronological order. To tackle this challenge, in this paper, we propose a Unified Timeline Summarizer (UTS) that can generate abstractive and extractive timeline summaries in time order. Concretely, in the encoder part, we propose a graph-based event encoder that relates multiple events according to their content dependency and learns a global representation of each event. In the decoder part, to ensure the chronological order of the abstractive summary, we propose to extract the feature of event-level attention in its generation process with sequential information remained and use it to simulate the evolutionary attention of the ground truth summary. The event-level attention can also be used to assist in extracting summary, where the extracted summary also comes in time sequence. We augment the previous Chinese large-scale timeline summarization dataset and collect a new English timeline dataset. Extensive experiments conducted on these datasets and on the out-of-domain Timeline 17 dataset show that UTS achieves state-of-the-art performance in terms of both automatic and human evaluations.

information retrieval, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2301.00867

Country:

Asia > China (0.04)
North America > United States > Indiana > Lake County > Gary (0.04)
North America > United States > California (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.93)
Research Report > Experimental Study (0.68)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)
Media > News (0.66)

Technology:

Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(3 more...)

Add feedback

Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques

Suleman, Muhammad, Asif, Muhammad, Zamir, Tayyab, Mehmood, Ayaz, Khan, Jebran, Ahmad, Nasir, Ahmad, Kashif

arXiv.org Artificial IntelligenceDec-31-2022

This paper presents our solutions for the MediaEval 2022 task on DisasterMM. The task is composed of two subtasks, namely (i) Relevance Classification of Twitter Posts (RCTP), and (ii) Location Extraction from Twitter Texts (LETT). The RCTP subtask aims at differentiating flood-related and non-relevant social posts while LETT is a Named Entity Recognition (NER) task and aims at the extraction of location information from the text. For RCTP, we proposed four different solutions based on BERT, RoBERTa, Distil BERT, and ALBERT obtaining an F1-score of 0.7934, 0.7970, 0.7613, and 0.7924, respectively. For LETT, we used three models namely BERT, RoBERTa, and Distil BERTA obtaining an F1-score of 0.6256, 0.6744, and 0.6723, respectively.

artificial intelligence, information retrieval, natural language, (4 more...)

arXiv.org Artificial Intelligence

2301.00321

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.87)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.73)

Add feedback

TA-DA: Topic-Aware Domain Adaptation for Scientific Keyphrase Identification and Classification (Student Abstract)

Smădu, Răzvan-Alexandru, Zaharia, George-Eduard, Avram, Andrei-Marius, Cercel, Dumitru-Clementin, Dascalu, Mihai, Pop, Florin

arXiv.org Artificial IntelligenceDec-30-2022

Keyphrase identification and classification is a Natural Language Processing and Information Retrieval task that involves extracting relevant groups of words from a given text related to the main topic. In this work, we focus on extracting keyphrases from scientific documents. We introduce TA-DA, a Topic-Aware Domain Adaptation framework for keyphrase extraction that integrates Multi-Task Learning with Adversarial Training and Domain Adaptation. Our approach improves performance over baseline models by up to 5% in the exact match of the F1-score.

information retrieval, machine learning, natural language, (12 more...)

arXiv.org Artificial Intelligence

2301.06902

Country:

Europe > Romania > București - Ilfov Development Region > Municipality of Bucharest > Bucharest (0.06)
Asia > Middle East > Jordan (0.05)

Genre: Research Report (0.65)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.71)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Add feedback

Modified Query Expansion Through Generative Adversarial Networks for Information Extraction in E-Commerce

Cakir, Altan, Gurkan, Mert

arXiv.org Artificial IntelligenceDec-30-2022

This work addresses an alternative approach for query expansion (QE) using a generative adversarial network (GAN) to enhance the effectiveness of information search in e-commerce. We propose a modified QE conditional GAN (mQE-CGAN) framework, which resolves keywords by expanding the query with a synthetically generated query that proposes semantic information from text input. We train a sequence-to-sequence transformer model as the generator to produce keywords and use a recurrent neural network model as the discriminator to classify an adversarial output with the generator. With the modified CGAN framework, various forms of semantic insights gathered from the query document corpus are introduced to the generation process. We leverage these insights as conditions for the generator model and discuss their effectiveness for the query expansion task. Our experiments demonstrate that the utilization of condition structures within the mQE-CGAN framework can increase the semantic similarity between generated sequences and reference documents up to nearly 10% compared to baseline models

generative adversarial network, machine learning, natural language, (5 more...)

arXiv.org Artificial Intelligence

2301.00036

Genre: Research Report (0.40)

Industry: Information Technology > Services > e-Commerce Services (0.60)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.80)

Add feedback

Automated Dynamic Algorithm Configuration

Adriaensen, Steven (University of Freiburg, Machine Learning Lab) | Biedenkapp, André (University of Freiburg, Machine Learning Lab) | Shala, Gresa (University of Freiburg, Machine Learning Lab) | Awad, Noor (University of Freiburg, Machine Learning Lab) | Eimer, Theresa (Leibniz University Hannover, Institute for Information Processing) | Lindauer, Marius (Leibniz University Hannover, Institute for Information Processing) | Hutter, Frank (University of Freiburg, Machine Learning Lab & Bosch Center for Artificial Intelligence)

Journal of Artificial Intelligence ResearchDec-30-2022

The performance of an algorithm often critically depends on its parameter configuration. While a variety of automated algorithm configuration methods have been proposed to relieve users from the tedious and error-prone task of manually tuning parameters, there is still a lot of untapped potential as the learned configuration is static, i.e., parameter settings remain fixed throughout the run. However, it has been shown that some algorithm parameters are best adjusted dynamically during execution. Thus far, this is most commonly achieved through hand-crafted heuristics. A promising recent alternative is to automatically learn such dynamic parameter adaptation policies from data. In this article, we give the first comprehensive account of this new field of automated dynamic algorithm configuration (DAC), present a series of recent advances, and provide a solid foundation for future research in this field. Specifically, we (i) situate DAC in the broader historical context of AI research; (ii) formalize DAC as a computational problem; (iii) identify the methods used in prior art to tackle this problem; and (iv) conduct empirical case studies for using DAC in evolutionary optimization, AI planning, and machine learning.

algorithm, dac, proceedings, (13 more...)

Journal of Artificial Intelligence Research

doi: 10.1613/jair.1.13922

AI Access Foundation

13922

Journal of Artificial Intelligence Research

Country:

Europe > Germany > Baden-Württemberg > Freiburg (0.04)
Europe > Belgium (0.04)

Genre:

Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Experimental Study (0.67)

Industry: Leisure & Entertainment (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Planning & Scheduling (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
(7 more...)

Add feedback

Customizing Knowledge Graph Embedding to Improve Clinical Study Recommendation

Liu, Xiong, Khalil, Iya, Devarakonda, Murthy

arXiv.org Artificial IntelligenceDec-28-2022

Inferring knowledge from clinical trials using knowledge graph embedding is an emerging area. However, customizing graph embeddings for different use cases remains a significant challenge. We propose custom2vec, an algorithmic framework to customize graph embeddings by incorporating user preferences in training the embeddings. It captures user preferences by adding custom nodes and links derived from manually vetted results of a separate information retrieval method. We propose a joint learning objective to preserve the original network structure while incorporating the user's custom annotations. We hypothesize that the custom training improves user-expected predictions, for example, in link prediction tasks. We demonstrate the effectiveness of custom2vec for clinical trials related to non-small cell lung cancer (NSCLC) with two customization scenarios: recommending immuno-oncology trials evaluating PD-1 inhibitors and exploring similar trials that compare new therapies with a standard of care. The results show that custom2vec training achieves better performance than the conventional training methods. Our approach is a novel way to customize knowledge graph embeddings and enable more accurate recommendations and predictions.

artificial intelligence, information retrieval, natural language, (2 more...)

arXiv.org Artificial Intelligence

2212.14102

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.89)

Industry: Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (0.87)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.80)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.53)

Add feedback

Towards mapping the contemporary art world with ArtLM: an art-specific NLP model

Chen, Qinkai, El-Mennaoui, Mohamed, Fosset, Antoine, Rebei, Amine, Cao, Haoyang, Bouscasse, Philine, O'Beirne, Christy Eóin, Shevchenko, Sasha, Rosenbaum, Mathieu

arXiv.org Artificial IntelligenceDec-22-2022

With an increasing amount of data in the art world, discovering artists and artworks suitable to collectors' tastes becomes a challenge. It is no longer enough to use visual information, as contextual information about the artist has become just as important in contemporary art. In this work, we present a generic Natural Language Processing framework (called ArtLM) to discover the connections among contemporary artists based on their biographies. In this approach, we first continue to pre-train the existing general English language models with a large amount of unlabelled art-related data. We then fine-tune this new pre-trained model with our biography pair dataset manually annotated by a team of professionals in the art industry. With extensive experiments, we demonstrate that our ArtLM achieves 85.6% accuracy and 84.0% F1 score and outperforms other baseline models. We also provide a visualisation and a qualitative analysis of the artist network built from ArtLM's outputs.

artist, machine learning, pattern recognition, (19 more...)

arXiv.org Artificial Intelligence

2212.07127

Country:

North America > United States > New York (0.04)
Europe > France (0.04)
Europe > United Kingdom (0.04)
(5 more...)

Genre: Research Report > New Finding (0.47)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.47)
Information Technology > Artificial Intelligence > Machine Learning > Pattern Recognition (0.46)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

Add feedback

Fine-Grained Distillation for Long Document Retrieval

Zhou, Yucheng, Shen, Tao, Geng, Xiubo, Tao, Chongyang, Long, Guodong, Xu, Can, Jiang, Daxin

arXiv.org Artificial IntelligenceDec-20-2022

Long document retrieval aims to fetch query-relevant documents from a large-scale collection, where knowledge distillation has become de facto to improve a retriever by mimicking a heterogeneous yet powerful cross-encoder. However, in contrast to passages or sentences, retrieval on long documents suffers from the scope hypothesis that a long document may cover multiple topics. This maximizes their structure heterogeneity and poses a granular-mismatch issue, leading to an inferior distillation efficacy. In this work, we propose a new learning framework, fine-grained distillation (FGD), for long-document retrievers. While preserving the conventional dense retrieval paradigm, it first produces global-consistent representations crossing different fine granularity and then applies multi-granular aligned distillation merely during training. In experiments, we evaluate our framework on two long-document retrieval benchmarks, which show state-of-the-art performance.

information retrieval, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.10423

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > Canada (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(15 more...)

Genre: Research Report (0.64)

Industry: Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)

Add feedback