AITopics | Information Retrieval

Collaborating Authors

Information Retrieval

Our accustomed systems of retrieving particular bits of information no longer fill the needs of many people. Searching traditional indexes of print publications has been aided by computerized databases, but still usually requires time-consuming serial searching of one database after the other, and then moving on to other methods of searching for internet sources. And what if the information being sought is a sound byte? A video clip? Yesterday's e-mail exchange between respected scientists? Artificial intelligence may hold the key to information retrieval in an age where widely different formats contain the information being sought, and the universe of knowledge is simply too big and growing too rapidly for successful searching to proceed at a human's slow speed.

News Overviews Instructional Materials AI-Alerts Classics

Evo* 2022 -- Late-Breaking Abstracts Volume

Mora, A. M., Esparcia-Alcázar, A. I.

arXiv.org Artificial IntelligenceJul-31-2022

This volume contains the Late-Breaking Abstracts accepted at Evo* 2022 Conference, held in Madrid (Spain), from 20 to 22 of April. They were also presented as short talks as well as at the conference's poster session. The works present ongoing research and preliminary results investigating on the application of different approaches of Evolutionary Computation and other Nature-Inspired techniques to different problems, most of them real world ones. These are very promising contributions, since they outline some of the incoming advances and applications in the area of nature-inspired methods, mainly Evolutionary Algorithms.

algorithm, arxiv, proceedings, (14 more...)

arXiv.org Artificial Intelligence

2208.00555

Country:

Europe > Spain > Galicia > Madrid (0.24)
Europe > Portugal > Lisbon > Lisbon (0.04)
North America > United States > California > San Diego County > San Diego (0.04)
(11 more...)

Genre: Research Report > New Finding (0.92)

Industry:

Media > Music (1.00)
Banking & Finance (0.92)
Health & Medicine > Pharmaceuticals & Biotechnology (0.68)
(2 more...)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
(4 more...)

Add feedback

A Small Survey On Event Detection Using Twitter

Datta, Debanjan

arXiv.org Artificial IntelligenceJul-30-2022

This is evident from popular phenomena such as effects of fake news and online social movements. However the the data obtained from social media presents itself with large volume and velocity, accompanied by significant amount of irrelevant data pertaining to general discussions, personal messages and spam. Social media has been shown to be effective for detecting, forecasting and tracking real world events. The ability to detect real world events is crucial and has applications in disease surveillance, commerce, governance and other areas. Thus extraction of useful information and modelling the characteristics of social media to detect real world events is an important problem. 2 RESEARCH PROBLEM To outline the research problem we need to define events, which has multiple interpretations.

information retrieval, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2011.05801

Country:

North America > United States > Virginia > Arlington County > Arlington (0.04)
Asia (0.04)

Genre: Research Report (1.00)

Industry: Information Technology > Services (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Clustering (1.00)
(2 more...)

Add feedback

PHEMEPlus: Enriching Social Media Rumour Verification with External Evidence

Dougrez-Lewis, John, Kochkina, Elena, Arana-Catania, M., Liakata, Maria, He, Yulan

arXiv.org Artificial IntelligenceJul-28-2022

Work on social media rumour verification utilises signals from posts, their propagation and users involved. Other lines of work target identifying and fact-checking claims based on information from Wikipedia, or trustworthy news articles without considering social media context. However works combining the information from social media with external evidence from the wider web are lacking. To facilitate research in this direction, we release a novel dataset, PHEMEPlus, an extension of the PHEME benchmark, which contains social media conversations as well as relevant external evidence for each rumour. We demonstrate the effectiveness of incorporating such evidence in improving rumour verification models. Additionally, as part of the evidence collection, we evaluate various ways of query formulation to identify the most effective method.

dataset, proceedings, verification, (14 more...)

arXiv.org Artificial Intelligence

2207.1397

Country:

North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry:

Media > News (1.00)
Health & Medicine (0.99)
Information Technology (0.68)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.94)

Add feedback

AI Tools Streamline Content Marketing and SEO

#artificialintelligenceJul-27-2022, 17:39:52 GMT

The aim of content marketing is to attract, engage, and retain customers. It takes many forms, including videos, podcasts, graphics, articles, and whitepapers. Each of those could have a sub-task. This article focuses on attracting an audience -- driving top-of-the-funnel prospects -- with blog content. A blog post that ranks well on search engine results pages must include the words and phrases of searchers.

blog post, jasper, rankiq, (10 more...)

#artificialintelligence

Country: North America > United States > Indiana > Dubois County > Jasper (0.05)

Industry:

Leisure & Entertainment (0.75)
Media > Film (0.53)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.36)

Add feedback

UNIMIB at TREC 2021 Clinical Trials Track

Peikos, Georgios, Espitia, Oscar, Pasi, Gabriella

arXiv.org Artificial IntelligenceJul-27-2022

This contribution summarizes the participation of the UNIMIB team to the TREC 2021 Clinical Trials Track. We have investigated the effect of different query representations combined with several retrieval models on the retrieval performance. First, we have implemented a neural re-ranking approach to study the effectiveness of dense text representations. Additionally, we have investigated the effectiveness of a novel decision-theoretic model for relevance estimation. Finally, both of the above relevance models have been compared with standard retrieval approaches. In particular, we combined a keyword extraction method with a standard retrieval process based on the BM25 model and a decision-theoretic relevance model that exploits the characteristics of this particular search task. The obtained results show that the proposed keyword extraction method improves 84% of the queries over the TREC's median NDCG@10 measure when combined with either traditional or decision-theoretic relevance models. Moreover, regarding RPEC@10, the employed decision-theoretic model improves 85% of the queries over the reported TREC's median value.

criteria, query, representation, (14 more...)

arXiv.org Artificial Intelligence

2207.13514

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.28)
Europe > Italy > Tuscany > Pisa Province > Pisa (0.04)
Europe > Italy > Lombardy > Milan (0.04)
Europe > Belgium > Brussels-Capital Region > Brussels (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

BioADAPT-MRC: Adversarial Learning-based Domain Adaptation Improves Biomedical Machine Reading Comprehension Task

Mahbub, Maria, Srinivasan, Sudarshan, Begoli, Edmon, Peterson, Gregory D

arXiv.org Artificial IntelligenceJul-26-2022

Biomedical machine reading comprehension (biomedical-MRC) aims to comprehend complex biomedical narratives and assist healthcare professionals in retrieving information from them. The high performance of modern neural network-based MRC systems depends on high-quality, large-scale, human-annotated training datasets. In the biomedical domain, a crucial challenge in creating such datasets is the requirement for domain knowledge, inducing the scarcity of labeled data and the need for transfer learning from the labeled general-purpose (source) domain to the biomedical (target) domain. However, there is a discrepancy in marginal distributions between the general-purpose and biomedical domains due to the variances in topics. Therefore, direct-transferring of learned representations from a model trained on a general-purpose domain to the biomedical domain can hurt the model's performance. We present an adversarial learning-based domain adaptation framework for the biomedical machine reading comprehension task (BioADAPT-MRC), a neural network-based method to address the discrepancies in the marginal distributions between the general and biomedical domain datasets. BioADAPT-MRC relaxes the need for generating pseudo labels for training a well-performing biomedical-MRC model. We extensively evaluate the performance of BioADAPT-MRC by comparing it with the best existing methods on three widely used benchmark biomedical-MRC datasets -- BioASQ-7b, BioASQ-8b, and BioASQ-9b. Our results suggest that without using any synthetic or human-annotated data from the biomedical domain, BioADAPT-MRC can achieve state-of-the-art performance on these datasets. Availability: BioADAPT-MRC is freely available as an open-source project at \url{https://github.com/mmahbub/BioADAPT-MRC}.

bioadapt-mrc, dataset, representation, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1093/bioinformatics/btac508

2202.13174

Country:

North America > United States > Tennessee > Knox County > Knoxville (0.14)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
North America > United States > Texas > Travis County > Austin (0.04)
(6 more...)

Genre: Research Report > New Finding (0.86)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Education > Assessment & Standards > Student Performance (1.00)
Government > Military (0.93)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Add feedback

Buffer Pool Aware Query Scheduling via Deep Reinforcement Learning

Zhang, Chi, Marcus, Ryan, Kleiman, Anat, Papaemmanouil, Olga

arXiv.org Artificial IntelligenceJul-26-2022

One could imagine many simple heuristics, query scheduling with the explicit goal of reducing disk reads such as greedily selecting the next query with the highest and thus implicitly increasing query performance. We introduce expected buffer usage, to solve this problem. However, a SmartQueue, a learned scheduler that leverages overlapping hand-designed policy to handle the complexity of the entire data reads among incoming queries and learns a problem, including different buffer sizes, shifting query scheduling strategy that improves cache hits. SmartQueue workloads, heterogeneous data types (e.g., index files vs base relies on deep reinforcement learning to produce workloadspecific relations), and balancing short-term gains against long-term scheduling strategies that focus on long-term performance strategy is much more difficult to conceive.

buffer, query, smartqueue, (14 more...)

arXiv.org Artificial Intelligence

2007.10568

Country: North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (0.30)

Add feedback

What you Know about Keywords and their Importance in SEO

#artificialintelligenceJul-23-2022, 01:35:35 GMT

Modern internet growth has created the need for many skills, which are called digital skills. One of these skills is search engine optimization. So people can improve their skills by reading online content from our website. In this article, although our intended readers are beginners, professionals can also refresh their knowledge. If you know about keywords, then you must know about search engine optimization.

artificial intelligence, information retrieval, natural language, (17 more...)

#artificialintelligence

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.74)

Add feedback

Facing Changes: Continual Entity Alignment for Growing Knowledge Graphs

Wang, Yuxin, Cui, Yuanning, Liu, Wenqiang, Sun, Zequn, Jiang, Yiqiao, Han, Kexin, Hu, Wei

arXiv.org Artificial IntelligenceJul-23-2022

Entity alignment is a basic and vital technique in knowledge graph (KG) integration. Over the years, research on entity alignment has resided on the assumption that KGs are static, which neglects the nature of growth of real-world KGs. As KGs grow, previous alignment results face the need to be revisited while new entity alignment waits to be discovered. In this paper, we propose and dive into a realistic yet unexplored setting, referred to as continual entity alignment. To avoid retraining an entire model on the whole KGs whenever new entities and triples come, we present a continual alignment method for this task. It reconstructs an entity's representation based on entity adjacency, enabling it to generate embeddings for new entities quickly and inductively using their existing neighbors. It selects and replays partial pre-aligned entity pairs to train only parts of KGs while extracting trustworthy alignment for knowledge augmentation. As growing KGs inevitably contain non-matchable entities, different from previous works, the proposed method employs bidirectional nearest neighbor matching to find new entity alignment and update old alignment. Furthermore, we also construct new datasets by simulating the growth of multilingual DBpedia. Extensive experiments demonstrate that our continual alignment method is more effective than baselines based on retraining or inductive learning.

alignment, entity alignment, new entity, (13 more...)

arXiv.org Artificial Intelligence

2207.11436

Country:

Asia > China > Jiangsu Province > Nanjing (0.05)
Asia > China > Zhejiang Province > Hangzhou (0.04)
Europe > Sweden (0.04)
(3 more...)

Genre: Research Report (1.00)

Industry: Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Semantic Networks (0.64)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.46)

Add feedback

Meta AI introduces Sphere, a model designed to verify citations on Wikipedia - Actu IA

#artificialintelligenceJul-22-2022, 08:11:19 GMT

When we do a search on the Internet, the search engine very often suggests the site of the community encyclopedia Wikipedia. It contains about 6.5 million articles by volunteer contributors, but how can we know if these are reliable, even though the sources of the articles are cited? Meta relied on Meta AI's research and advances to develop SPHERE, an open source model capable of automatically analyzing hundreds of thousands of citations at a time to check whether they actually support the corresponding claims, it recently published it on the Github platform. Meta said it is not partnering with Wikimedia, the foundation that runs Wikipedia, on this project. Its goal is to create a platform to help Wikipedia editors systematically spot citation problems and quickly correct the citation or the corresponding article content.

meta ai introduce sphere, representation, wikipedia, (4 more...)

#artificialintelligence

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.53)

Add feedback