AITopics

doi: 10.1145/3576840.3578327

2212.07476

Country:

North America > United States > Texas > Travis County > Austin (0.05)
North America > United States > New York > New York County > New York City (0.05)
Europe > Germany > Saxony > Leipzig (0.04)
(28 more...)

Genre:

Overview (1.00)
Research Report (0.82)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Information Technology (1.00)
Media > News (0.93)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.66)

Tran, Hanh Thi Hong, Martinc, Matej, Caporusso, Jaya, Doucet, Antoine, Pollak, Senja

The Recent Advances in Automatic Term Extraction: A survey

arXiv.org Artificial IntelligenceJan-17-2023

Automatic term extraction (ATE) is a Natural Language Processing (NLP) task that eases the effort of manually identifying terms from domain-specific corpora by providing a list of candidate terms. As units of knowledge in a specific field of expertise, extracted terms are not only beneficial for several terminographical tasks, but also support and improve several complex downstream tasks, e.g., information retrieval, machine translation, topic detection, and sentiment analysis. ATE systems, along with annotated datasets, have been studied and developed widely for decades, but recently we observed a surge in novel neural systems for the task at hand. Despite a large amount of new research on ATE, systematic survey studies covering novel neural approaches are lacking. We present a comprehensive survey of deep learning-based approaches to ATE, with a focus on Transformer-based neural models. The study also offers a comparison between these systems and previous ATE approaches, which were based on feature engineering and non-neural supervised learning algorithms.

information retrieval, machine learning, natural language, (18 more...)

2301.06767

Country:

South America > Brazil (0.14)
Europe > Slovenia (0.05)
Europe > France > Nouvelle-Aquitaine (0.04)
(9 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.66)

Industry: Health & Medicine > Pharmaceuticals & Biotechnology (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceJan-15-2023, 21:25:52 GMT

9 "Best" SEO Tools (January 2023) - Channel969

SEO (Search Engine Optimization) requires a multifaceted strategy that includes researching competition, analyzing what keywords are capable of driving traffic, creating an external and internal link building strategy, and optimizing page loading speed. Below we feature the best SEO tools to increase your odds of ranking high in Google. This powerful SEO platform offers a range of tools that replaces the functionality of other products that includes Google Trends, MOZ, Hootsuite and SimilarWeb. Traffic Analysis – Benchmark your website traffic against competitors to see where you stand. See their estimated total traffic, top traffic sources, bounce rate, time on page, and more to inform your next strategy.

artificial intelligence, information retrieval, natural language, (19 more...)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

#artificialintelligenceJan-12-2023, 17:10:22 GMT

Microsoft's ChatGPT investment could create 'game-changer' AI

Microsoft (MSFT) is going all in on ChatGPT, an artificial intelligence (AI) technology that could power a new search engine that could disrupt the dominance of Google (GOOG). News site Semafor reported on Tuesday that Microsoft is investing $10bn (£8.2bn) in OpenAI, the artificial intelligence firm that launched the AI generative tool ChatGPT in November 2022. This will value the San Francisco-based firm at $29bn, and industry analysts say that Google should pay close attention to the deal. Microsoft spends billions of dollars every year to try to compete with Google's search engine dominance, but with comparatively low user interaction on Bing they have failed for over a decade. Microsoft has so far failed to replicate the algorithm that powers Google search but if they incorporate the AI generating power of ChatGPT into Bing, or a new search engine, this could be "a game changer", an industry commentator has suggested.

information retrieval, large language model, machine learning, (18 more...)

Country: North America > United States > California > San Francisco County > San Francisco (0.25)

Industry: Information Technology (0.56)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Fiergolla, Sven, Goergen, Kevin, Neises, Patrick, Wolf, Petra

Heuristic for Diverse Kemeny Rank Aggregation based on Quantum Annealing

arXiv.org Artificial IntelligenceJan-12-2023

The Kemeny Rank Aggregation (KRA) problem is a well-studied problem in the field of Social Choice with a variety of applications in many different areas like databases and search engines. Intuitively, given a set of votes over a set of candidates, the problem asks to find an aggregated ranking of candidates that minimizes the overall dissatisfaction concerning the votes. Recently, a diverse version of KRA was considered which asks for a sufficiently diverse set of sufficiently good solutions. The framework of diversity of solutions is a young and thriving topic in the field of artificial intelligence. The main idea is to provide the user with not just one, but with a set of different solutions, enabling her to pick a sufficiently good solution that satisfies additional subjective criteria that are hard or impossible to model. In this work, we use a quantum annealer to solve the KRA problem and to compute a representative set of solutions. Quantum annealing is a meta search heuristic that does not only show promising runtime behavior on currently existing prototypes but also samples the solutions space in an inherently different way, making use of quantum effects. We describe how KRA instances can be solved by a quantum annealer and provide an implementation as well as experimental evaluations. As existing quantum annealers are still restricted in their number of qubits, we further implement two different data reduction rules that can split an instance into a set of smaller instances. In our evaluation, we compare classical heuristics that allow to sample multiple solutions such as simulated annealing and local search with quantum annealing performed on a physical quantum annealer. We compare runtime, quality of solution, and diversity of solutions, with and without applying preceding data reduction rules.

information retrieval, machine learning, natural language, (19 more...)

2301.05146

Country:

Europe > Norway > Western Norway > Vestland > Bergen (0.04)
Europe > Germany (0.04)
Asia > Afghanistan > Parwan Province > Charikar (0.04)

Genre: Research Report (0.82)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.86)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.54)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.35)

arXiv.org Artificial IntelligenceJan-11-2023

KAER: A Knowledge Augmented Pre-Trained Language Model for Entity Resolution

Fang, Liri, Li, Lan, Liu, Yiren, Torvik, Vetle I., Ludäscher, Bertram

Entity resolution has been an essential and well-studied task in data cleaning research for decades. Existing work has discussed the feasibility of utilizing pre-trained language models to perform entity resolution and achieved promising results. However, few works have discussed injecting domain knowledge to improve the performance of pre-trained language models on entity resolution tasks. In this study, we propose Knowledge Augmented Entity Resolution (KAER), a novel framework named for augmenting pre-trained language models with external knowledge for entity resolution. We discuss the results of utilizing different knowledge augmentation and prompting methods to improve entity resolution performance. Our model improves on Ditto, the existing state-of-the-art entity resolution method. In particular, 1) KAER performs more robustly and achieves better results on "dirty data", and 2) with more general knowledge injection, KAER outperforms the existing baseline models on the textual dataset and dataset from the online product domain. 3) KAER achieves competitive results on highly domain-specific datasets, such as citation datasets, requiring the injection of expert knowledge in future work.

information retrieval, machine learning, natural language, (17 more...)

2301.0477

Country: North America > United States > Illinois > Champaign County > Urbana (0.04)

Genre: Research Report > New Finding (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

#artificialintelligenceJan-10-2023, 09:55:08 GMT

An introduction to NLP and its importance in today's technology landscape

Natural Language Processing (NLP) is a branch of artificial intelligence (AI) that deals with the interaction between computers and human language. The goal of NLP is to develop algorithms and models that can understand, interpret, and generate human language. NLP has a wide range of applications, from language translation to sentiment analysis, and is critical in today's technology landscape. One of the most notable areas where NLP has had a significant impact is in the field of search engines. Search engines use NLP algorithms to understand the intent behind a user's query and match it with relevant results. NLP also plays a crucial role in information retrieval, which is the process of finding relevant information from a large collection of documents.

artificial intelligence, information retrieval, natural language, (18 more...)

Industry: Health & Medicine (0.34)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.80)

#artificialintelligenceJan-9-2023, 09:55:54 GMT

The Dangers Of ChatGPT To Search Engines - AI Summary

LLMs are great but they won't replace search engines anytime soon. The biggest reason is that chat-based search interfaces lack the context and flexibility that users expect and need from a search engine. ChatBot and LLMs are great. Understanding user intent is fantastic. But search is here to stay.

deep learning, large language model, machine learning, (4 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.85)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.40)

Mysore, Sheshera, Jasim, Mahmood, Song, Haoru, Akbar, Sarah, Randall, Andre Kenneth Chase, Mahyar, Narges

How Data Scientists Review the Scholarly Literature

arXiv.org Artificial IntelligenceJan-9-2023

Keeping up with the research literature plays an important role in the workflow of scientists - allowing them to understand a field, formulate the problems they focus on, and develop the solutions that they contribute, which in turn shape the nature of the discipline. In this paper, we examine the literature review practices of data scientists. Data science represents a field seeing an exponential rise in papers, and increasingly drawing on and being applied in numerous diverse disciplines. Recent efforts have seen the development of several tools intended to help data scientists cope with a deluge of research and coordinated efforts to develop AI tools intended to uncover the research frontier. Despite these trends indicative of the information overload faced by data scientists, no prior work has examined the specific practices and challenges faced by these scientists in an interdisciplinary field with evolving scholarly norms. In this paper, we close this gap through a set of semi-structured interviews and think-aloud protocols of industry and academic data scientists (N = 20). Our results while corroborating other knowledge workers' practices uncover several novel findings: individuals (1) are challenged in seeking and sensemaking of papers beyond their disciplinary bubbles, (2) struggle to understand papers in the face of missing details and mathematical content, (3) grapple with the deluge by leveraging the knowledge context in code, blogs, and talks, and (4) lean on their peers online and in-person. Furthermore, we outline future directions likely to help data scientists cope with the burgeoning research literature.

computing machinery, information retrieval, machine learning, (16 more...)

doi: 10.1145/3576840.3578309

2301.03774

Country:

North America > United States > Texas > Travis County > Austin (0.14)
North America > United States > New York > New York County > New York City (0.07)
Europe > United Kingdom > Scotland > City of Glasgow > Glasgow (0.04)
(30 more...)

Genre:

Research Report > New Finding (1.00)
Questionnaire & Opinion Survey (1.00)

Industry:

Health & Medicine (1.00)
Education (1.00)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Data Science (1.00)
Information Technology > Communications > Social Media (1.00)
(6 more...)

arXiv.org Artificial IntelligenceJan-9-2023

Cross-Model Comparative Loss for Enhancing Neuronal Utility in Language Understanding

Zhu, Yunchang, Pang, Liang, Wu, Kangxi, Lan, Yanyan, Shen, Huawei, Cheng, Xueqi

Current natural language understanding (NLU) models have been continuously scaling up, both in terms of model size and input context, introducing more hidden and input neurons. While this generally improves performance on average, the extra neurons do not yield a consistent improvement for all instances. This is because some hidden neurons are redundant, and the noise mixed in input neurons tends to distract the model. Previous work mainly focuses on extrinsically reducing low-utility neurons by additional post- or pre-processing, such as network pruning and context selection, to avoid this problem. Beyond that, can we make the model reduce redundant parameters and suppress input noise by intrinsically enhancing the utility of each neuron? If a model can efficiently utilize neurons, no matter which neurons are ablated (disabled), the ablated submodel should perform no better than the original full model. Based on such a comparison principle between models, we propose a cross-model comparative loss for a broad range of tasks. Comparative loss is essentially a ranking loss on top of the task-specific losses of the full and ablated models, with the expectation that the task-specific loss of the full model is minimal. We demonstrate the universal effectiveness of comparative loss through extensive experiments on 14 datasets from 3 distinct NLU tasks based on 4 widely used pretrained language models, and find it particularly superior for models with few parameters or long input.

information retrieval, machine learning, natural language, (18 more...)

2301.03765

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > New York > New York County > New York City (0.04)
Asia > China > Beijing > Beijing (0.04)
(8 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)