AITopics

2405.11219

Country:

North America > United States > New Mexico > Santa Fe County > Santa Fe (0.04)
North America > Mexico > Mexico City > Mexico City (0.04)
North America > Dominican Republic (0.04)
(7 more...)

Genre:

Research Report > New Finding (0.47)
Research Report > Experimental Study (0.34)

Industry:

Health & Medicine (1.00)
Banking & Finance > Insurance (1.00)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
(2 more...)

Laue, Sören, Prusina, Tomislav

Efficient Line Search Method Based on Regression and Uncertainty Quantification

arXiv.org Artificial IntelligenceMay-17-2024

Unconstrained optimization problems are typically solved using iterative methods, which often depend on line search techniques to determine optimal step lengths in each iteration. This paper introduces a novel line search approach. Traditional line search methods, aimed at determining optimal step lengths, often discard valuable data from the search process and focus on refining step length intervals. This paper proposes a more efficient method using Bayesian optimization, which utilizes all available data points, i.e., function values and gradients, to guide the search towards a potential global minimum. This new approach more effectively explores the search space, leading to better solution quality. It is also easy to implement and integrate into existing frameworks. Tested on the challenging CUTEst test set, it demonstrates superior performance compared to existing state-of-the-art methods, solving more problems to optimality with equivalent resource usage.

function value, line search method, optimization, (12 more...)

2405.10897

Country: Europe > Germany > Hamburg (0.04)

Genre: Research Report > Promising Solution (0.49)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.87)

arXiv.org Artificial IntelligenceMay-16-2024

Navigating Public Sentiment in the Circular Economy through Topic Modelling and Hyperparameter Optimisation

Song, Junhao, Yuan, Yingfang, Chang, Kaiwen, Xu, Bing, Xuan, Jin, Pang, Wei

To advance the circular economy (CE), it is crucial to gain insights into the evolution of public sentiments, cognitive pathways of the masses concerning circular products and digital technology, and recognise the primary concerns. To achieve this, we collected data related to the CE from diverse platforms including Twitter, Reddit, and The Guardian. This comprehensive data collection spanned across three distinct strata of the public: the general public, professionals, and official sources. Subsequently, we utilised three topic models on the collected data. Topic modelling represents a type of data-driven and machine learning approach for text mining, capable of automatically categorising a large number of documents into distinct semantic groups. Simultaneously, these groups are described by topics, and these topics can aid in understanding the semantic content of documents at a high level. However, the performance of topic modelling may vary depending on different hyperparameter values. Therefore, in this study, we proposed a framework for topic modelling with hyperparameter optimisation for CE and conducted a series of systematic experiments to ensure that topic models are set with appropriate hyperparameters and to gain insights into the correlations between the CE and public opinion based on well-established models. The results of this study indicate that concerns about sustainability and economic impact persist across all three datasets. Official sources demonstrate a higher level of engagement with the application and regulation of CE. To the best of our knowledge, this study is pioneering in investigating various levels of public opinions concerning CE through topic modelling with the exploration of hyperparameter optimisation.

circular economy, dataset, sustainability, (14 more...)

2405.10452

Country:

Oceania > Australia (0.04)
Europe > United Kingdom > Scotland > City of Edinburgh > Edinburgh (0.04)
Europe > Russia (0.04)
(13 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Water & Waste Management > Solid Waste Management (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Government (1.00)
(6 more...)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Discourse & Dialogue (0.69)
(5 more...)

Palen-Michel, Chester, Liang, Lizzie, Wu, Zhe, Lignos, Constantine

QueryNER: Segmentation of E-commerce Queries

arXiv.org Artificial IntelligenceMay-15-2024

Prior work in sequence labeling for e-commerce has largely addressed aspect-value extraction which focuses on extracting portions of a product title or query for narrowly defined aspects. Our work instead focuses on the goal of dividing a query into meaningful chunks with broadly applicable types. We report baseline tagging results and conduct experiments comparing token and entity dropping for null and low recall query recovery. Challenging test sets are created using automatic transformations and show how simple data augmentation techniques can make the models more robust to noise. We make the QueryNER dataset publicly available.

dataset, query, span, (15 more...)

2405.09507

Country:

North America > Dominican Republic (0.04)
North America > United States > Wisconsin > Milwaukee County > Milwaukee (0.04)
North America > United States > Washington > King County > Seattle (0.04)
(12 more...)

Genre: Research Report (1.00)

Industry:

Transportation > Passenger (1.00)
Transportation > Ground > Road (1.00)
Leisure & Entertainment (0.93)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.69)

arXiv.org Artificial IntelligenceMay-15-2024

From Matching to Generation: A Survey on Generative Information Retrieval

Li, Xiaoxi, Jin, Jiajie, Zhou, Yujia, Zhang, Yuyao, Zhang, Peitian, Zhu, Yutao, Dou, Zhicheng

Information Retrieval (IR) systems are crucial tools for users to access information, widely applied in scenarios like search engines, question answering, and recommendation systems. Traditional IR methods, based on similarity matching to return ranked lists of documents, have been reliable means of information acquisition, dominating the IR field for years. With the advancement of pre-trained language models, generative information retrieval (GenIR) has emerged as a novel paradigm, gaining increasing attention in recent years. Currently, research in GenIR can be categorized into two aspects: generative document retrieval (GR) and reliable response generation. GR leverages the generative model's parameters for memorizing documents, enabling retrieval by directly generating relevant document identifiers without explicit indexing. Reliable response generation, on the other hand, employs language models to directly generate the information users seek, breaking the limitations of traditional IR in terms of document granularity and relevance matching, offering more flexibility, efficiency, and creativity, thus better meeting practical needs. This paper aims to systematically review the latest research progress in GenIR. We will summarize the advancements in GR regarding model training, document identifier, incremental learning, downstream tasks adaptation, multi-modal GR and generative recommendation, as well as progress in reliable response generation in aspects of internal knowledge memorization, external knowledge augmentation, generating response with citations and personal information assistant. We also review the evaluation, challenges and future prospects in GenIR systems. This review aims to offer a comprehensive reference for researchers in the GenIR field, encouraging further development in this area.

computational linguistic, language model, retrieval, (15 more...)

2404.14851

Country:

Asia > Japan > Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
North America > Canada > Ontario > Toronto (0.04)
(30 more...)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.45)
Research Report > New Finding (0.45)

Industry:

Education (1.00)
Law (0.67)
Information Technology (0.66)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Hager, Philipp, Deffayet, Romain, Renders, Jean-Michel, Zoeter, Onno, de Rijke, Maarten

Unbiased Learning to Rank Meets Reality: Lessons from Baidu's Large-Scale Search Dataset

arXiv.org Artificial IntelligenceMay-15-2024

Unbiased learning-to-rank (ULTR) is a well-established framework for learning from user clicks, which are often biased by the ranker collecting the data. While theoretically justified and extensively tested in simulation, ULTR techniques lack empirical validation, especially on modern search engines. The Baidu-ULTR dataset released for the WSDM Cup 2023, collected from Baidu's search engine, offers a rare opportunity to assess the real-world performance of prominent ULTR techniques. Despite multiple submissions during the WSDM Cup 2023 and the subsequent NTCIR ULTRE-2 task, it remains unclear whether the observed improvements stem from applying ULTR or other learning techniques. In this work, we revisit and extend the available experiments on the Baidu-ULTR dataset. We find that standard unbiased learning-to-rank techniques robustly improve click predictions but struggle to consistently improve ranking performance, especially considering the stark differences obtained by choice of ranking loss and query-document features. Our experiments reveal that gains in click prediction do not necessarily translate to enhanced ranking performance on expert relevance annotations, implying that conclusions strongly depend on how success is measured in this benchmark.

dataset, learning, proceedings, (15 more...)

doi: 10.1145/3626772.3657892

2404.02543

Country:

North America > United States > District of Columbia > Washington (0.05)
Europe > Netherlands > North Holland > Amsterdam (0.05)
Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (0.68)

Technology:

Information Technology > Information Management > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.68)

Carvallo, Andrés, Quiroga, Tamara, Aspillaga, Carlos, Mendoza, Marcelo

Unveiling Social Media Comments with a Novel Named Entity Recognition System for Identity Groups

While civilized users employ social media to stay informed and discuss daily occurrences, haters perceive these platforms as fertile ground for attacking groups and individuals. The prevailing approach to counter this phenomenon involves detecting such attacks by identifying toxic language. Effective platform measures aim to report haters and block their network access. In this context, employing hate speech detection methods aids in identifying these attacks amidst vast volumes of text, which are impossible for humans to analyze manually. In our study, we expand upon the usual hate speech detection methods, typically based on text classifiers, to develop a Named Entity Recognition (NER) System for Identity Groups. To achieve this, we created a dataset that allows extending a conventional NER to recognize identity groups. Consequently, our tool not only detects whether a sentence contains an attack but also tags the sentence tokens corresponding to the mentioned group. Results indicate that the model performs competitively in identifying groups with an average f1-score of 0.75, outperforming in identifying ethnicity attack spans with an f1-score of 0.80 compared to other identity groups. Moreover, the tool shows an outstanding generalization capability to minority classes concerning sexual orientation and gender, achieving an f1-score of 0.77 and 0.72, respectively. We tested the utility of our tool in a case study on social media, annotating and comparing comments from Facebook related to news mentioning identity groups. The case study reveals differences in the types of attacks recorded, effectively detecting named entities related to the categories of the analyzed news articles. Entities are accurately tagged within their categories, with a negligible error rate for inter-category tagging.

category, dataset, identity group, (13 more...)

2405.13011

Country:

South America > Chile (0.05)
North America > United States > Virginia (0.04)
North America > United States > Ohio (0.04)
(11 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Government > Regional Government (0.93)
Law Enforcement & Public Safety > Terrorism (0.93)
Health & Medicine > Therapeutic Area (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Optimizing E-commerce Search: Toward a Generalizable and Rank-Consistent Pre-Ranking Model

Xu, Enqiang, Qiu, Yiming, Bai, Junyang, Zhang, Ping, Miao, Dadong, Wang, Songlin, Tang, Guoyu, Liu, Lin, Li, Mingming

Beyond these optimizations, meeting the system To enhance user experience and conversion efficiency, the online performance requirements presents a significant challenge. Contrasting search system is employed with a cascading architecture, mainly with existing industry works, we propose a novel method: a including recall and ranking. The ranking stage as the downstream Generalizable and RAnk-ConsistEnt Pre-Ranking Model (GRACE), component directly influences the efficiency of item sorting. Several which achieves: 1) Ranking consistency by introducing multiple superior ranking models have been identified in industrial research, binary classification tasks that predict whether a product is within such as MMoE [4], PLE [12], ESMM [5], DeepFM [1], DIN [18], the top-k results as estimated by the ranking model, which facilitates MIMN [8], SDIM [16], and SIM [12], with a focus on feature engineering, the addition of learning objectives on common point-wise behavioral sequence modeling, and objective function ranking models; 2) Generalizability through contrastive learning optimization. However, as the scale of products within the search of representation for all products by pre-training on a subset of system grows, there is an increasing demand for managing the ranking product embeddings; 3) Ease of implementation in feature time complexity of the sorting module.

proceedings, ranking model, representation, (11 more...)

doi: 10.1145/3626772.3661343

2405.05606

Country:

Asia > China > Beijing > Beijing (0.06)
North America > United States > District of Columbia > Washington (0.05)
North America > United States > Texas > Travis County > Austin (0.04)
(2 more...)

Genre:

Research Report > New Finding (0.46)
Research Report > Promising Solution (0.34)

Industry: Information Technology > Services > e-Commerce Services (0.43)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.48)

Akhtar, Mubashara, Pang, Chenxi, Marzoca, Andreea, Altun, Yasemin, Eisenschlos, Julian Martin

TANQ: An open domain dataset of table answered questions

Language models, potentially augmented with tool usage such as retrieval are becoming the go-to means of answering questions. Understanding and answering questions in real-world settings often requires retrieving information from different sources, processing and aggregating data to extract insights, and presenting complex findings in form of structured artifacts such as novel tables, charts, or infographics. In this paper, we introduce TANQ, the first open domain question answering dataset where the answers require building tables from information across multiple sources. We release the full source attribution for every cell in the resulting table and benchmark state-of-the-art language models in open, oracle, and closed book setups. Our best-performing baseline, GPT4 reaches an overall F1 score of 29.1, lagging behind human performance by 19.7 points. We analyse baselines' performance across different dataset attributes such as different skills required for this task, including multi-hop reasoning, math operations, and unit conversions. We further discuss common failures in model-generated answers, suggesting that TANQ is a complex task with many challenges ahead.

answer table, dataset, relation, (16 more...)

2405.07765

Country:

North America > Canada > Ontario > Toronto (0.04)
North America > United States > Florida (0.04)
Oceania > Australia > New South Wales > Sydney (0.04)
(17 more...)

Genre: Research Report (0.82)

Industry:

Media > Film (1.00)
Leisure & Entertainment (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.90)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)
Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.56)
Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.54)

Franz, Maja, Winker, Tobias, Groppe, Sven, Mauerer, Wolfgang

Hype or Heuristic? Quantum Reinforcement Learning for Join Order Optimisation

Identifying optimal join orders (JOs) stands out as a key challenge in database research and engineering. Owing to the large search space, established classical methods rely on approximations and heuristics. Recent efforts have successfully explored reinforcement learning (RL) for JO. Likewise, quantum versions of RL have received considerable scientific attention. Yet, it is an open question if they can achieve sustainable, overall practical advantages with improved quantum processors. In this paper, we present a novel approach that uses quantum reinforcement learning (QRL) for JO based on a hybrid variational quantum ansatz. It is able to handle general bushy join trees instead of resorting to simpler left-deep variants as compared to approaches based on quantum(-inspired) optimisation, yet requires multiple orders of magnitudes fewer qubits, which is a scarce resource even for post-NISQ systems. Despite moderate circuit depth, the ansatz exceeds current NISQ capabilities, which requires an evaluation by numerical simulations. While QRL may not significantly outperform classical approaches in solving the JO problem with respect to result quality (albeit we see parity), we find a drastic reduction in required trainable parameters. This benefits practically relevant aspects ranging from shorter training times compared to classical RL, less involved classical optimisation passes, or better use of available training data, and fits data-stream and low-latency processing scenarios. Our comprehensive evaluation and careful discussion delivers a balanced perspective on possible practical quantum advantage, provides insights for future systemic approaches, and allows for quantitatively assessing trade-offs of quantum approaches for one of the most crucial problems of database management systems.

algorithm, join order, relation, (15 more...)

2405.0777

Country:

Europe > Germany > Bavaria > Regensburg (0.04)
Europe > Germany > Saxony-Anhalt > Magdeburg (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)

Genre:

Research Report > New Finding (0.67)
Research Report > Promising Solution (0.48)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Information Retrieval > Query Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.93)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.93)