AITopics | Pacific Ocean

Collaborating Authors

Pacific Ocean

Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval

Caffagni, Davide, Sarto, Sara, Cornia, Marcella, Baraldi, Lorenzo, Cucchiara, Rita

arXiv.org Artificial IntelligenceMar-3-2025

Cross-modal retrieval is gaining increasing efficacy and interest from the research community, thanks to large-scale training, novel architectural and learning designs, and its application in LLMs and multimodal LLMs. In this paper, we move a step forward and design an approach that allows for multimodal queries, composed of both an image and a text, and can search within collections of multimodal documents, where images and text are interleaved. Our model, ReT, employs multi-level representations extracted from different layers of both visual and textual backbones, both at the query and document side. To allow for multi-level and cross-modal understanding and feature extraction, ReT employs a novel Transformer-based recurrent cell that integrates both textual and visual features at different layers, and leverages sigmoidal gates inspired by the classical design of LSTMs. Extensive experiments on M2KR and M-BEIR benchmarks show that ReT achieves state-of-the-art performance across diverse settings. Our source code and trained models are publicly available at https://github.com/aimagelab/ReT.

dataset, representation, ret, (16 more...)

arXiv.org Artificial Intelligence

2503.0198

Country:

Asia > China (0.04)
North America > United States > Nebraska (0.04)
North America > United States > Illinois (0.04)
(8 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer

Goldman, Omer, Shaham, Uri, Malkin, Dan, Eiger, Sivan, Hassidim, Avinatan, Matias, Yossi, Maynez, Joshua, Gilady, Adi Mayrav, Riesa, Jason, Rijhwani, Shruti, Rimell, Laura, Szpektor, Idan, Tsarfaty, Reut, Eyal, Matan

arXiv.org Artificial IntelligenceMar-3-2025

To achieve equitable performance across languages, multilingual large language models (LLMs) must be able to abstract knowledge beyond the language in which it was acquired. However, the current literature lacks reliable ways to measure LLMs' capability of cross-lingual knowledge transfer. To that end, we present ECLeKTic, a multilingual closed-book QA (CBQA) dataset that Evaluates Cross-Lingual Knowledge Transfer in a simple, black-box manner. We detected information with uneven coverage across languages by controlling for presence and absence of Wikipedia articles in 12 languages. We generated knowledge-seeking questions in a source language, for which the answer appears in a relevant Wikipedia article and translated them to all other 11 languages, for which the respective Wikipedias lack equivalent articles. Assuming that Wikipedia reflects the prominent knowledge in the LLM's training data, to solve ECLeKTic's CBQA task the model is required to transfer knowledge between languages. Experimenting with 8 LLMs, we show that SOTA models struggle to effectively share knowledge across, languages even if they can predict the answer well for queries in the same language the knowledge was acquired in.

computational linguistic, knowledge, kt ic, (16 more...)

arXiv.org Artificial Intelligence

2502.21228

Country:

Asia > Singapore (0.04)
Pacific Ocean > South Pacific Ocean > Coral Sea (0.04)
Oceania > Australia > Queensland (0.04)
(7 more...)

Genre: Research Report (0.82)

Industry: Leisure & Entertainment > Sports > Snooker (1.00)

Technology:

Information Technology > Knowledge Management (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.95)

Add feedback

ReFocus: Reinforcing Mid-Frequency and Key-Frequency Modeling for Multivariate Time Series Forecasting

Yu, Guoqi, Li, Yaoming, Wang, Juncheng, Guo, Xiaoyu, Aviles-Rivero, Angelica I., Yang, Tong, Wang, Shujun

arXiv.org Artificial IntelligenceMar-3-2025

Recent advancements have progressively incorporated frequency-based techniques into deep learning models, leading to notable improvements in accuracy and efficiency for time series analysis tasks. However, the Mid-Frequency Spectrum Gap in the real-world time series, where the energy is concentrated at the low-frequency region while the middle-frequency band is negligible, hinders the ability of existing deep learning models to extract the crucial frequency information. Additionally, the shared Key-Frequency in multivariate time series, where different time series share indistinguishable frequency patterns, is rarely exploited by existing literature. This work introduces a novel module, Adaptive Mid-Frequency Energy Optimizer, based on convolution and residual learning, to emphasize the significance of mid-frequency bands. We also propose an Energy-based Key-Frequency Picking Block to capture shared Key-Frequency, which achieves superior inter-series modeling performance with fewer parameters. A novel Key-Frequency Enhanced Training strategy is employed to further enhance Key-Frequency modeling, where spectral information from other channels is randomly introduced into each channel. Our approach advanced multivariate time series forecasting on the challenging Traffic, ECL, and Solar benchmarks, reducing MSE by 4%, 6%, and 5% compared to the previous SOTA iTransformer. Code is available at this GitHub Repository: https://github.com/Levi-Ackman/ReFocus.

forecasting, series forecasting, time series forecasting, (13 more...)

arXiv.org Artificial Intelligence

2502.1689

Country:

Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
Asia > China > Hong Kong (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
(2 more...)

Genre: Research Report > New Finding (0.92)

Industry: Energy (0.95)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Unify and Anchor: A Context-Aware Transformer for Cross-Domain Time Series Forecasting

Hong, Xiaobin, Zhang, Jiawen, Li, Wenzhong, Lu, Sanglu, Li, Jia

arXiv.org Artificial IntelligenceMar-2-2025

The rise of foundation models has revolutionized natural language processing and computer vision, yet their best practices to time series forecasting remains underexplored. Existing time series foundation models often adopt methodologies from these fields without addressing the unique characteristics of time series data. In this paper, we identify two key challenges in cross-domain time series forecasting: the complexity of temporal patterns and semantic misalignment. To tackle these issues, we propose the ``Unify and Anchor" transfer paradigm, which disentangles frequency components for a unified perspective and incorporates external context as domain anchors for guided adaptation. Based on this framework, we introduce ContexTST, a Transformer-based model that employs a time series coordinator for structured representation and the Transformer blocks with a context-informed mixture-of-experts mechanism for effective cross-domain generalization. Extensive experiments demonstrate that ContexTST advances state-of-the-art forecasting performance while achieving strong zero-shot transferability across diverse domains.

arxiv preprint arxiv, dataset, forecasting, (11 more...)

arXiv.org Artificial Intelligence

2503.01157

Country:

Asia > China > Jiangsu Province > Nanjing (0.04)
Pacific Ocean > North Pacific Ocean > San Francisco Bay (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry: Energy > Power Industry (0.46)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Evaluating Personalized Tool-Augmented LLMs from the Perspectives of Personalization and Proactivity

Hao, Yupu, Cao, Pengfei, Jin, Zhuoran, Liao, Huanxuan, Chen, Yubo, Liu, Kang, Zhao, Jun

arXiv.org Artificial IntelligenceMar-2-2025

Personalized tool utilization is essential for aligning large language models (LLMs) with user preference in interaction scenarios with various tools. However, most of the current benchmarks primarily focus on either personalization of text generation or direct tool-utilizing, without considering both. In this work, we introduce a novel benchmark ETAPP for evaluating personalized tool invocation, establishing a sandbox environment, and a comprehensive dataset of 800 testing cases covering diverse user profiles. To improve the accuracy of our evaluation, we propose a key-point-based LLM evaluation method, mitigating biases in the LLM-as-a-judge system by manually annotating key points for each test case and providing them to LLM as the reference. Additionally, we evaluate the excellent LLMs and provide an in-depth analysis. Furthermore, we investigate the impact of different tool-invoking strategies on LLMs' personalization performance and the effects of fine-tuning in our task. The effectiveness of our preference-setting and key-point-based evaluation method is also validated. Our findings offer insights into improving personalized LLM agents. Our Code is available at https://github.com/hypasd-art/ETAPP.

instruction, wang, zhang, (17 more...)

arXiv.org Artificial Intelligence

2503.00771

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
North America > United States > California > San Francisco County > San Francisco (0.04)
Asia > Singapore (0.04)
(6 more...)

Genre: Research Report > New Finding (0.87)

Industry:

Information Technology (1.00)
Health & Medicine > Consumer Health (1.00)
Media (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Synthetic data enables context-aware bioacoustic sound event detection

Hoffman, Benjamin, Robinson, David, Miron, Marius, Baglione, Vittorio, Canestrari, Daniela, Elias, Damian, Trapote, Eva, Pietquin, Olivier

arXiv.org Artificial IntelligenceFeb-28-2025

We propose a methodology for training foundation models that enhances their in-context learning capabilities within the domain of bioacoustic signal processing. We use synthetically generated training data, introducing a domain-randomization-based pipeline that constructs diverse acoustic scenes with temporally strong labels. We generate over 8.8 thousand hours of strongly-labeled audio and train a query-by-example, transformer-based model to perform few-shot bioacoustic sound event detection. Our second contribution is a public benchmark of 13 diverse few-shot bioacoustics tasks. Our model outperforms previously published methods by 49%, and we demonstrate that this is due to both model design and data scale. We make our trained model available via an API, to provide ecologists and ethologists with a training-free tool for bioacoustic sound event detection.

dataset, detection, pseudovox, (15 more...)

arXiv.org Artificial Intelligence

2503.00296

Country:

North America > United States > Pennsylvania (0.04)
Africa > Gabon (0.04)
North America > United States > Hawaii (0.04)
(11 more...)

Genre: Research Report (0.83)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.88)

Add feedback

The Illusion of Rights based AI Regulation

Mei, Yiyang, Sag, Matthew

arXiv.org Artificial IntelligenceFeb-26-2025

Whether and how to regulate AI is one of the defining questions of our times - a question that is being debated locally, nationally, and internationally. We argue that much of this debate is proceeding on a false premise. Specifically, our article challenges the prevailing academic consensus that the European Union's AI regulatory framework is fundamentally rights-driven and the correlative presumption that other rights-regarding nations should therefore follow Europe's lead in AI regulation. Rather than taking rights language in EU rules and regulations at face value, we show how EU AI regulation is the logical outgrowth of a particular cultural, political, and historical context. We show that although instruments like the General Data Protection Regulation (GDPR) and the AI Act invoke the language of fundamental rights, these rights are instrumentalized - used as rhetorical cover for governance tools that address systemic risks and maintain institutional stability. As such, we reject claims that the EU's regulatory framework and the substance of its rules should be adopted as universal imperatives and transplanted to other liberal democracies. To add weight to our argument from historical context, we conduct a comparative analysis of AI regulation in five contested domains: data privacy, cybersecurity, healthcare, labor, and misinformation. This EU-US comparison shows that the EU's regulatory architecture is not meaningfully rights-based. Our article's key intervention in AI policy debates is not to suggest that the current American regulatory model is necessarily preferable but that the presumed legitimacy of the EU's AI regulatory approach must be abandoned.

llusion, regulation, rights, (14 more...)

arXiv.org Artificial Intelligence

2503.05784

Country:

North America > United States > New York (0.04)
North America > United States > Illinois (0.04)
Europe > Russia (0.04)
(29 more...)

Genre:

Overview (0.92)
Research Report (0.82)

Industry:

Law > Statutes (1.00)
Information Technology > Security & Privacy (1.00)
Health & Medicine > Health Care Providers & Services > Reimbursement (1.00)
(3 more...)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)

Add feedback

REALM-Bench: A Real-World Planning Benchmark for LLMs and Multi-Agent Systems

Geng, Longling, Chang, Edward Y.

arXiv.org Artificial IntelligenceFeb-26-2025

This benchmark suite provides a comprehensive evaluation framework for assessing both individual LLMs and multi-agent systems in real-world planning scenarios. The suite encompasses eleven designed problems that progress from basic to highly complex, incorporating key aspects such as multi-agent coordination, inter-agent dependencies, and dynamic environmental disruptions. Each problem can be scaled along three dimensions: the number of parallel planning threads, the complexity of inter-dependencies, and the frequency of unexpected disruptions requiring real-time adaptation. The benchmark includes detailed specifications, evaluation metrics, and baseline implementations using contemporary frameworks like LangGraph, enabling rigorous testing of both single-agent and multi-agent planning capabilities. Through standardized evaluation criteria and scalable complexity, this benchmark aims to drive progress in developing more robust and adaptable AI planning systems for real-world applications.

agent, scenario, textbf, (14 more...)

arXiv.org Artificial Intelligence

2502.18836

Country:

Pacific Ocean > North Pacific Ocean > San Francisco Bay > Golden Gate (0.05)
North America > United States > Illinois > Cook County > Chicago (0.04)
Asia > Middle East > Republic of Türkiye (0.04)
(3 more...)

Genre:

Workflow (0.47)
Research Report (0.41)

Industry:

Transportation (1.00)
Information Technology (1.00)
Consumer Products & Services > Travel (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts

Wu, Mingyan, Liu, Zhenghao, Yan, Yukun, Li, Xinze, Yu, Shi, Zeng, Zheni, Gu, Yu, Yu, Ge

arXiv.org Artificial IntelligenceFeb-25-2025

Retrieval-Augmented Generation (RAG) enhances the performance of Large Language Models (LLMs) by incorporating external knowledge. However, LLMs still encounter challenges in effectively utilizing the knowledge from retrieved documents, often being misled by irrelevant or noisy information. To address this issue, we introduce RankCoT, a knowledge refinement method that incorporates reranking signals in generating CoT-based summarization for knowledge refinement based on given query and all retrieval documents. During training, RankCoT prompts the LLM to generate Chain-of-Thought (CoT) candidates based on the query and individual documents. It then fine-tunes the LLM to directly reproduce the best CoT from these candidate outputs based on all retrieved documents, which requires LLM to filter out irrelevant documents during generating CoT-style summarization. Additionally, RankCoT incorporates a self-reflection mechanism that further refines the CoT outputs, resulting in higher-quality training data. Our experiments demonstrate the effectiveness of RankCoT, showing its superior performance over other knowledge refinement models. Further analysis reveals that RankCoT can provide shorter but effective refinement results, enabling the generator to produce more accurate answers. All code and data are available at https://github.com/NEUIR/RankCoT.

large language model, machine learning, natural language, (15 more...)

arXiv.org Artificial Intelligence

2502.17888

Country:

Oceania > Australia > Western Australia (0.14)
Indian Ocean (0.05)
Pacific Ocean > South Pacific Ocean > Coral Sea (0.04)
(13 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)

Add feedback

From Perceptions to Decisions: Wildfire Evacuation Decision Prediction with Behavioral Theory-informed LLMs

Chen, Ruxiao, Wang, Chenguang, Sun, Yuran, Zhao, Xilei, Xu, Susu

arXiv.org Artificial IntelligenceFeb-24-2025

Evacuation decision prediction is critical for efficient and effective wildfire response by helping emergency management anticipate traffic congestion and bottlenecks, allocate resources, and minimize negative impacts. Traditional statistical methods for evacuation decision prediction fail to capture the complex and diverse behavioral logic of different individuals. In this work, for the first time, we introduce FLARE, short for facilitating LLM for advanced reasoning on wildfire evacuation decision prediction, a Large Language Model (LLM)-based framework that integrates behavioral theories and models to streamline the Chain-of-Thought (CoT) reasoning and subsequently integrate with memory-based Reinforcement Learning (RL) module to provide accurate evacuation decision prediction and understanding. Our proposed method addresses the limitations of using existing LLMs for evacuation behavioral predictions, such as limited survey data, mismatching with behavioral theory, conflicting individual preferences, implicit and complex mental states, and intractable mental state-behavior mapping. Experiments on three post-wildfire survey datasets show an average of 20.47% performance improvement over traditional theory-informed behavioral models, with strong cross-event generalizability. Our complete code is publicly available at https://github.com/SusuXu-s-Lab/FLARE

evacuate 0, evacuation decision, prediction, (11 more...)

arXiv.org Artificial Intelligence

2502.17701

Country:

Asia > Thailand > Bangkok > Bangkok (0.04)
North America > United States > Colorado > Boulder County (0.04)
North America > United States > California > Sonoma County (0.04)
(5 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.46)

Industry:

Law Enforcement & Public Safety > Fire & Emergency Services (0.46)
Information Technology (0.30)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback