AITopics | Information Retrieval

Collaborating Authors

Information Retrieval

Our accustomed systems of retrieving particular bits of information no longer fill the needs of many people. Searching traditional indexes of print publications has been aided by computerized databases, but still usually requires time-consuming serial searching of one database after the other, and then moving on to other methods of searching for internet sources. And what if the information being sought is a sound byte? A video clip? Yesterday's e-mail exchange between respected scientists? Artificial intelligence may hold the key to information retrieval in an age where widely different formats contain the information being sought, and the universe of knowledge is simply too big and growing too rapidly for successful searching to proceed at a human's slow speed.

News Overviews Instructional Materials AI-Alerts Classics

Website Analyzer: How to Check your Website's SEO for Free

#artificialintelligenceFeb-26-2023, 10:00:24 GMT

When it comes to search engine optimization, you need to make sure that your website is optimized for the best possible results. Using Website Analyzer helps to find the improvement points. Here are 10 tips for improving your SEO rankings on Google and other search engines. When optimizing a website for search engine optimization (SEO) purposes, keyword research tools can be an invaluable resource. Keyword research tools enable you to identify the most popular search terms related to your content, allowing you to craft your page titles and meta descriptions accordingly.

search engine, website, website analyzer, (12 more...)

#artificialintelligence

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.97)

Add feedback

Topic-Selective Graph Network for Topic-Focused Summarization

Zesheng, Shi, Yucheng, Zhou

arXiv.org Artificial IntelligenceFeb-25-2023

Due to the success of the pre-trained language model (PLM), existing PLM-based summarization models show their powerful generative capability. However, these models are trained on general-purpose summarization datasets, leading to generated summaries failing to satisfy the needs of different readers. To generate summaries with topics, many efforts have been made on topic-focused summarization. However, these works generate a summary only guided by a prompt comprising topic words. Despite their success, these methods still ignore the disturbance of sentences with non-relevant topics and only conduct cross-interaction between tokens by attention module. To address this issue, we propose a topic-arc recognition objective and topic-selective graph network. First, the topic-arc recognition objective is used to model training, which endows the capability to discriminate topics for the model. Moreover, the topic-selective graph network can conduct topic-guided cross-interaction on sentences based on the results of topic-arc recognition. In the experiments, we conduct extensive evaluations on NEWTS and COVIDET datasets. Results show that our methods achieve state-of-the-art performance.

information retrieval, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2302.13106

Country:

Europe > Russia (0.05)
North America > United States > Illinois > Cook County > Chicago (0.05)
Europe > Poland (0.05)
(13 more...)

Genre: Research Report (0.71)

Industry:

Transportation > Air (0.68)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.68)
Government > Regional Government > North America Government > United States Government (0.68)
Government > Military > Air Force (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.94)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.66)

Add feedback

Development of a Thermodynamics of Human Cognition and Human Culture

Aerts, Diederik, Arguëlles, Jonito Aerts, Beltran, Lester, Sozzo, Sandro

arXiv.org Artificial IntelligenceFeb-24-2023

Inspired by foundational studies in classical and quantum physics, and by information retrieval studies in quantum information theory, we prove that the notions of 'energy' and 'entropy' can be consistently introduced in human language and, more generally, in human culture. More explicitly, if energy is attributed to words according to their frequency of appearance in a text, then the ensuing energy levels are distributed non-classically, namely, they obey Bose-Einstein, rather than Maxwell-Boltzmann, statistics, as a consequence of the genuinely 'quantum indistinguishability' of the words that appear in the text. Secondly, the 'quantum entanglement' due to the way meaning is carried by a text reduces the (von Neumann) entropy of the words that appear in the text, a behaviour which cannot be explained within classical (thermodynamic or information) entropy. We claim here that this 'quantum-type behaviour is valid in general in human language', namely, any text is conceptually more concrete than the words composing it, which entails that the entropy of the overall text decreases. In addition, we provide examples taken from cognition, where quantization of energy appears in categorical perception, and from culture, where entities collaborate, thus 'entangle', to decrease overall entropy. We use these findings to propose the development of a new 'non-classical thermodynamic theory' for human cognition, which also covers broad parts of human culture and its artefacts and bridges concepts with quantum physics entities.

artificial intelligence, information retrieval, natural language, (18 more...)

arXiv.org Artificial Intelligence

2212.12795

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Asia > Singapore (0.04)
North America > United States > New York (0.04)
(3 more...)

Genre: Research Report (0.64)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.34)

Add feedback

Implicit Temporal Reasoning for Evidence-Based Fact-Checking

Allein, Liesbeth, Saelens, Marlon, Cartuyvels, Ruben, Moens, Marie-Francine

arXiv.org Artificial IntelligenceFeb-24-2023

Leveraging contextual knowledge has become standard practice in automated claim verification, yet the impact of temporal reasoning has been largely overlooked. Our study demonstrates that time positively influences the claim verification process of evidence-based fact-checking. The temporal aspects and relations between claims and evidence are first established through grounding on shared timelines, which are constructed using publication dates and time expressions extracted from their text. Temporal information is then provided to RNN-based and Transformer-based classifiers before or after claim and evidence encoding. Our time-aware fact-checking models surpass base models by up to 9% Micro F1 (64.17%) and 15% Macro F1 (47.43%) on the MultiFC dataset. They also outperform prior methods that explicitly model temporal relations between evidence. Our findings show that the presence of temporal information and the manner in which timelines are constructed greatly influence how fact-checking models determine the relevance and supporting or refuting character of evidence documents.

information retrieval, machine learning, temporal reasoning, (17 more...)

arXiv.org Artificial Intelligence

2302.12569

Country:

North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
Oceania > Australia > Western Australia (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(4 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Voting & Elections (0.68)
Government > Regional Government > Oceania Government > Australia Government (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Temporal Reasoning (0.62)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.34)

Add feedback

Microsoft cofounder Bill Gates says the rise of AI poses a threat to Google's search engine profit

#artificialintelligenceFeb-23-2023, 09:02:04 GMT

Microsoft cofounder Bill Gates said AI is the "biggest thing in this decade" Dimitrios Kambouris/Getty Images Bill Gates said in a podcast Google's search engine profits could fall as Microsoft moves into AI. Gates said AI is the "biggest thing in this decade" and could reshuffle the tech space. Microsoft unveiled an AI-powered Bing in a challenge to Google's search engine…

google, microsoft cofounder bill gate, search engine profit, (3 more...)

#artificialintelligence

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)

Add feedback

Open-domain Visual Entity Recognition: Towards Recognizing Millions of Wikipedia Entities

Hu, Hexiang, Luan, Yi, Chen, Yang, Khandelwal, Urvashi, Joshi, Mandar, Lee, Kenton, Toutanova, Kristina, Chang, Ming-Wei

arXiv.org Artificial IntelligenceFeb-23-2023

Large-scale multi-modal pre-training models such as CLIP and PaLI exhibit strong generalization on various visual domains and tasks. However, existing image classification benchmarks often evaluate recognition on a specific domain (e.g., outdoor images) or a specific task (e.g., classifying plant species), which falls short of evaluating whether pre-trained foundational models are universal visual recognizers. To address this, we formally present the task of Open-domain Visual Entity recognitioN (OVEN), where a model need to link an image onto a Wikipedia entity with respect to a text query. We construct OVEN-Wiki by re-purposing 14 existing datasets with all labels grounded onto one single label space: Wikipedia entities. OVEN challenges models to select among six million possible Wikipedia entities, making it a general visual recognition benchmark with the largest number of labels. Our study on state-of-the-art pre-trained models reveals large headroom in generalizing to the massive-scale label space. We show that a PaLI-based auto-regressive visual recognition model performs surprisingly well, even on Wikipedia entities that have never been seen during fine-tuning. We also find existing pretrained models yield different strengths: while PaLI-based models obtain higher overall performance, CLIP-based models are better at recognizing tail entities.

information retrieval, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2302.11154

Country:

North America > United States > Arkansas (0.04)
Europe > Germany (0.04)

Genre: Research Report (0.82)

Industry:

Transportation > Air (0.67)
Transportation > Passenger (0.67)
Aerospace & Defense > Aircraft (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(3 more...)

Add feedback

Automated Extraction of Fine-Grained Standardized Product Information from Unstructured Multilingual Web Data

Flick, Alexander, Jäger, Sebastian, Trajanovska, Ivana, Biessmann, Felix

arXiv.org Artificial IntelligenceFeb-23-2023

Extracting structured information from unstructured data is one of the key challenges in modern information retrieval applications, including e-commerce. Here, we demonstrate how recent advances in machine learning, combined with a recently published multilingual data set with standardized fine-grained product category information, enable robust product attribute extraction in challenging transfer learning settings. Our models can reliably predict product attributes across online shops, languages, or both. Furthermore, we show that our models can be used to match product taxonomies between online retailers.

category, information retrieval, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2302.12139

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > New York > New York County > New York City (0.05)
North America > United States > Michigan > Washtenaw County > Ann Arbor (0.04)
(2 more...)

Genre: Research Report (0.64)

Industry: Retail (0.54)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.37)
Information Technology > Artificial Intelligence > Natural Language > Information Extraction (0.30)

Add feedback

Keyword Decisions in Sponsored Search Advertising: A Literature Review and Research Agenda

Yang, Yanwu, Li, Huiran

arXiv.org Artificial IntelligenceFeb-23-2023

In sponsored search advertising (SSA), keywords serve as the basic unit of business model, linking three stakeholders: consumers, advertisers and search engines. This paper presents an overarching framework for keyword decisions that highlights the touchpoints in search advertising management, including four levels of keyword decisions, i.e., domain-specific keyword pool generation, keyword targeting, keyword assignment and grouping, and keyword adjustment. Using this framework, we review the state-of-the-art research literature on keyword decisions with respect to techniques, input features and evaluation metrics. Finally, we discuss evolving issues and identify potential gaps that exist in the literature and outline novel research perspectives for future exploration.

data mining, decision support system, machine learning, (26 more...)

arXiv.org Artificial Intelligence

doi: 10.1016/j.ipm.2022.103142

2302.12372

Country:

North America > United States > Michigan > Washtenaw County > Ann Arbor (0.14)
North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.14)
(28 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.93)

Industry:

Marketing (1.00)
Information Technology > Services (1.00)
Education > Educational Setting > Online (0.46)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Web (1.00)
(10 more...)

Add feedback

Natural Language Processing in the Legal Domain

Katz, Daniel Martin, Hartung, Dirk, Gerlach, Lauritz, Jana, Abhik, Bommarito, Michael J. II

arXiv.org Artificial IntelligenceFeb-23-2023

In this paper, we summarize the current state of the field of NLP & Law with a specific focus on recent technical and substantive developments. To support our analysis, we construct and analyze a nearly complete corpus of more than six hundred NLP & Law related papers published over the past decade. Our analysis highlights several major trends. Namely, we document an increasing number of papers written, tasks undertaken, and languages covered over the course of the past decade. We observe an increase in the sophistication of the methods which researchers deployed in this applied context. Slowly but surely, Legal NLP is beginning to match not only the methodological sophistication of general NLP but also the professional standards of data availability and code reproducibility observed within the broader scientific community. We believe all of these trends bode well for the future of the field, but many questions in both the academic and commercial sphere still remain open.

arxiv preprint arxiv, large language model, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2302.12039

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
North America > United States > Iowa (0.04)
(4 more...)

Genre:

Research Report (1.00)
Overview (1.00)

Industry: Law > Statutes (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.68)

Add feedback

Hierarchical Interdisciplinary Topic Detection Model for Research Proposal Classification

Xiao, Meng, Qiao, Ziyue, Fu, Yanjie, Dong, Hao, Du, Yi, Wang, Pengyang, Xiong, Hui, Zhou, Yuanchun

arXiv.org Artificial IntelligenceFeb-22-2023

The peer merit review of research proposals has been the major mechanism for deciding grant awards. However, research proposals have become increasingly interdisciplinary. It has been a longstanding challenge to assign interdisciplinary proposals to appropriate reviewers, so proposals are fairly evaluated. One of the critical steps in reviewer assignment is to generate accurate interdisciplinary topic labels for proposal-reviewer matching. Existing systems mainly collect topic labels manually generated by principal investigators. However, such human-reported labels can be non-accurate, incomplete, labor intensive, and time costly. What role can AI play in developing a fair and precise proposal reviewer assignment system? In this study, we collaborate with the National Science Foundation of China to address the task of automated interdisciplinary topic path detection. For this purpose, we develop a deep Hierarchical Interdisciplinary Research Proposal Classification Network (HIRPCN). Specifically, we first propose a hierarchical transformer to extract the textual semantic information of proposals. We then design an interdisciplinary graph and leverage GNNs for learning representations of each discipline in order to extract interdisciplinary knowledge. After extracting the semantic and interdisciplinary knowledge, we design a level-wise prediction component to fuse the two types of knowledge representations and detect interdisciplinary topic paths for each proposal. We conduct extensive experiments and expert evaluations on three real-world datasets to demonstrate the effectiveness of our proposed model.

data mining, information retrieval, machine learning, (21 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/TKDE.2023.3248608

2209.13519

Country:

Asia > China > Beijing > Beijing (0.04)
Asia > Macao (0.04)
Asia > China > Guangdong Province > Guangzhou (0.04)
(6 more...)

Genre: Research Report (1.00)

Industry:

Information Technology (0.67)
Government (0.66)
Health & Medicine > Therapeutic Area > Neurology (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
(2 more...)

Add feedback