AITopics

2504.00877

Country:

South America > Brazil > São Paulo (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
Europe > Poland > Lower Silesia Province > Wroclaw (0.04)
(6 more...)

Genre: Research Report > New Finding (1.00)

Industry: Government > Regional Government (0.67)

Technology: Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.68)

arXiv.org Artificial IntelligenceApr-2-2025

CrossFormer: Cross-Segment Semantic Fusion for Document Segmentation

Ni, Tongke, Fan, Yang, Zhou, Junru, Wu, Xiangping, Chen, Qingcai

Text semantic segmentation involves partitioning a document into multiple paragraphs with continuous semantics based on the subject matter, contextual information, and document structure. Traditional approaches have typically relied on preprocessing documents into segments to address input length constraints, resulting in the loss of critical semantic information across segments. To address this, we present CrossFormer, a transformer-based model featuring a novel cross-segment fusion module that dynamically models latent semantic dependencies across document segments, substantially elevating segmentation accuracy. Additionally, CrossFormer can replace rule-based chunk methods within the Retrieval-Augmented Generation (RAG) system, producing more semantically coherent chunks that enhance its efficacy. Comprehensive evaluations confirm CrossFormer's state-of-the-art performance on public text semantic segmentation datasets, alongside considerable gains on RAG benchmarks.

large language model, machine learning, natural language, (19 more...)

2503.23671

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
South America > Colombia > Bolivar Department > Cartagena (0.04)
North America > United States > New Mexico > Doña Ana County > Las Cruces (0.04)
(12 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)

Investigating Large Language Models in Diagnosing Students' Cognitive Skills in Math Problem-solving

Jin, Hyoungwook, Kim, Yoonsu, Jung, Dongyun, Kim, Seungju, Choi, Kiyoon, Son, Jinho, Kim, Juho

Mathematics learning entails mastery of both content knowledge and cognitive processing of knowing, applying, and reasoning with it. Automated math assessment primarily has focused on grading students' exhibition of content knowledge by finding textual evidence, such as specific numbers, formulas, and statements. Recent advancements in problem-solving, image recognition, and reasoning capabilities of large language models (LLMs) show promise for nuanced evaluation of students' cognitive skills. Diagnosing cognitive skills needs to infer students' thinking processes beyond textual evidence, which is an underexplored task in LLM-based automated assessment. In this work, we investigate how state-of-the-art LLMs diagnose students' cognitive skills in mathematics. We constructed MathCog, a novel benchmark dataset comprising 639 student responses to 110 expert-curated middle school math problems, each annotated with detailed teachers' diagnoses based on cognitive skill checklists. Using MathCog, we evaluated 16 closed and open LLMs of varying model sizes and vendors. Our evaluation reveals that even the state-of-the-art LLMs struggle with the task, all F1 scores below 0.5, and tend to exhibit strong false confidence for incorrect cases ($r_s=.617$). We also found that model size positively correlates with the diagnosis performance ($r_s=.771$). Finally, we discuss the implications of these findings, the overconfidence issue, and directions for improving automated cognitive skill diagnosis.

large language model, machine learning, natural language, (17 more...)

2504.00843

Country:

South America > Argentina > Pampas > Buenos Aires F.D. > Buenos Aires (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
Asia > South Korea > Seoul > Seoul (0.04)
(2 more...)

Genre: Research Report > New Finding (0.46)

Industry:

Education > Curriculum (0.93)
Education > Educational Setting > K-12 Education (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Cognitive Science (1.00)

Preconditioned Additive Gaussian Processes with Fourier Acceleration

Wagner, Theresa, Xu, Tianshi, Nestler, Franziska, Xi, Yuanzhe, Stoll, Martin

Gaussian processes (GPs) are crucial in machine learning for quantifying uncertainty in predictions. However, their associated covariance matrices, defined by kernel functions, are typically dense and large-scale, posing significant computational challenges. This paper introduces a matrix-free method that utilizes the Non-equispaced Fast Fourier Transform (NFFT) to achieve nearly linear complexity in the multiplication of kernel matrices and their derivatives with vectors for a predetermined accuracy level. To address high-dimensional problems, we propose an additive kernel approach. Each sub-kernel in this approach captures lower-order feature interactions, allowing for the efficient application of the NFFT method and potentially increasing accuracy across various real-world datasets. Additionally, we implement a preconditioning strategy that accelerates hyperparameter tuning, further improving the efficiency and effectiveness of GPs.

artificial intelligence, kernel, machine learning, (14 more...)

2504.0048

Country:

South America > Paraguay > Asunción > Asunción (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > Georgia > Fulton County > Atlanta (0.04)
(3 more...)

Genre: Research Report > New Finding (0.46)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)

Bourrée, Jade Garcia, Lautraite, Hadrien, Gambs, Sébastien, Tredan, Gilles, Merrer, Erwan Le, Rottembourg, Benoît

P2NIA: Privacy-Preserving Non-Iterative Auditing

The emergence of AI legislation has increased the need to assess the ethical compliance of high-risk AI systems. Traditional auditing methods rely on platforms' application programming interfaces (APIs), where responses to queries are examined through the lens of fairness requirements. However, such approaches put a significant burden on platforms, as they are forced to maintain APIs while ensuring privacy, facing the possibility of data leaks. This lack of proper collaboration between the two parties, in turn, causes a significant challenge to the auditor, who is subject to estimation bias as they are unaware of the data distribution of the platform. To address these two issues, we present P2NIA, a novel auditing scheme that proposes a mutually beneficial collaboration for both the auditor and the platform. Extensive experiments demonstrate P2NIA's effectiveness in addressing both issues. In summary, our work introduces a privacy-preserving and non-iterative audit scheme that enhances fairness assessments using synthetic or local data, avoiding the challenges associated with traditional API-based audits.

artificial intelligence, data mining, machine learning, (19 more...)

2504.00874

Country:

Europe (0.14)
South America > Argentina (0.04)
North America > United States > New York (0.04)
(2 more...)

Genre: Research Report (0.82)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Data Science > Data Mining > Big Data (0.62)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.46)

Heidrich, Mario, Heidemann, Jeffrey, Buchkremer, Rüdiger, de Bobadilla, Gonzalo Wandosell Fernández

ffstruc2vec: Flat, Flexible and Scalable Learning of Node Representations from Structural Identities

These embeddings can be leveraged in various downstream tasks, including node classification, link prediction, clustering, exploratory data analysis, and network visualization. The method has found broad application across diverse domains, such as fraud detection in financial networks (van Belle et al. 2023), friendship recommendation and bot detection in social networks (Saxena et al. 2022; Dehghan et al. 2023), knowledge discovery in knowledge graphs (Egami et al. 2023; Liu et al. 2023), analysis of biological networks (Jiang et al. 2021; Pasquier et al. 2023), and fake review detection on online platforms (Zaki et al. 2024). A key challenge in Node Embedding is developing a scalable method for preserving the structural properties of nodes suitable for the required structural patterns of a downstream application task. The type of structural patterns in which a node is embedded within the graph can vary depending on the role or function of the node in a specific application task. For instance, fraudulent activities such as money laundering can be embedded in particular money flow patterns among illicit entities, resulting in characteristic structural patterns within the financial transaction network, such as suspicious cyclic transaction chains (Granados Vargas 2022). These structural patterns differ significantly from those observed in social networks, where specific roles such as bridge and core nodes define the network's connectivity and influence (Huang et al. 2014). As Node Embedding methods cannot preserve all types of structural patterns simultaneously, they must align with the requirements of a specific application task when defining types of structural identities.

data mining, machine learning, node, (19 more...)

2504.01122

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
(6 more...)

Genre:

Overview (0.46)
Research Report (0.40)

Industry:

Information Technology (0.88)
Law Enforcement & Public Safety > Fraud (0.86)

Technology:

Information Technology > Information Management (1.00)
Information Technology > Data Science > Data Mining (1.00)
Information Technology > Communications > Social Media (1.00)
(4 more...)

Self-Routing RAG: Binding Selective Retrieval with Knowledge Verbalization

Wu, Di, Gu, Jia-Chen, Chang, Kai-Wei, Peng, Nanyun

Selective retrieval improves retrieval-augmented generation (RAG) by reducing distractions from low-quality retrievals and improving efficiency. However, existing approaches under-utilize the inherent knowledge of large language models (LLMs), leading to suboptimal retrieval decisions and degraded generation performance. To bridge this gap, we propose Self-Routing RAG (SR-RAG), a novel framework that binds selective retrieval with knowledge verbalization. SR-RAG enables an LLM to dynamically decide between external retrieval and verbalizing its own parametric knowledge. To this end, we design a multi-task objective that jointly optimizes an LLM on knowledge source selection, knowledge verbalization, and response generation. We further introduce dynamic knowledge source inference via nearest neighbor search to improve the accuracy of knowledge source decision under domain shifts. Fine-tuning three LLMs with SR-RAG significantly improves both their response accuracy and inference latency. Compared to the strongest selective retrieval baseline, SR-RAG reduces retrievals by 29% while improving the performance by 5.1%.

large language model, machine learning, natural language, (19 more...)

2504.01018

Country:

North America > Haiti (0.46)
Europe > Austria > Vienna (0.14)
Europe > France (0.14)
(21 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.36)

Flamich, Gergely, Vilar, David, Peter, Jan-Thorsten, Freitag, Markus

You Cannot Feed Two Birds with One Score: the Accuracy-Naturalness Tradeoff in Translation

The goal of translation, be it by human or by machine, is, given some text in a source language, to produce text in a target language that simultaneously 1) preserves the meaning of the source text and 2) achieves natural expression in the target language. However, researchers in the machine translation community usually assess translations using a single score intended to capture semantic accuracy and the naturalness of the output simultaneously. In this paper, we build on recent advances in information theory to mathematically prove and empirically demonstrate that such single-score summaries do not and cannot give the complete picture of a system's true performance. Concretely, we prove that a tradeoff exists between accuracy and naturalness and demonstrate it by evaluating the submissions to the WMT24 shared task. Our findings help explain well-known empirical phenomena, such as the observation that optimizing translation systems for a specific accuracy metric (like BLEU) initially improves the system's naturalness, while ``overfitting'' the system to the metric can significantly degrade its naturalness. Thus, we advocate for a change in how translations are evaluated: rather than comparing systems using a single number, they should be compared on an accuracy-naturalness plane.

machine learning, natural language, translation, (17 more...)

2503.24013

Country:

Asia > Middle East > Jordan (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
Asia > Middle East > UAE > Abu Dhabi Emirate > Abu Dhabi (0.04)
(17 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Kulothungan, Vikram, Gupta, Deepti

Towards Adaptive AI Governance: Comparative Insights from the U.S., EU, and Asia

--Artificial intelligence (AI) trends vary significantly across global regions, shaping the trajectory of innovation, regulation, and societal impact. This variation influences how dif - ferent regions approach AI development, balancing technological progress with ethical and regulatory considerations. This study conducts a comparative analysis of AI trends in the United States (US), the European Union (EU), and Asia, focusing on three key dimensions: generative AI, ethical oversight, and industrial applications. The US prioritizes market -driven innovation with minimal regulatory constraints, the EU enforces a precautionary risk -based framework emphasizing ethical safeguards, and Asia employs state -guided AI strategies that balance rapid deployment with regulatory oversight. Although these approaches reflect different economic models and policy priorities, their divergence poses challenges to international collaboration, regulatory harmonization, and the development of global AI standards. To address these challenges, this paper synthesizes regional strengths to propose an adaptive AI governance framework that integrates risk -tiered oversight, innovation accelerators, and strategic alignment mechanisms. By bridging governance gaps, this study offers actionable insights for fostering responsible AI development while ensuring a balance between technological progress, ethical imperatives, and regulatory coherence. Artificial intelligence (AI) has emerged as a transformative force in the 21st century, reshaping industries, governance structures, and societal interactions at an unprecedented pace. From generative AI creating human - like text and images to autonomous systems revolutionizing healthcare, finance, and manufacturing, AI's influence is profound and far - reaching.

governance, machine learning, natural language, (18 more...)

2504.00652

Country:

Europe (0.50)
Asia > China (0.07)
Asia > South Korea (0.05)
(7 more...)

Genre:

Research Report (1.00)
Overview (0.68)

Industry:

Law (1.00)
Information Technology > Security & Privacy (1.00)
Government > Regional Government > Europe Government (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.55)

Role and Use of Race in AI/ML Models Related to Health

Were, Martin C., Li, Ang, Malin, Bradley A., Yin, Zhijun, Coco, Joseph R., Collins, Benjamin X., Clayton, Ellen Wright, Novak, Laurie L., Hendricks-Sturrup, Rachele, Oluyomi, Abiodun, Anders, Shilo, Yan, Chao

The role and use of race within health - related artificial intelligence and machine learning (AI/ML) models has sparked increasing attention and controversy. Despite the complexity and breadth of related issues, a robust and holistic framework to guide stakeholders in their examination and resolution remains lacking . This perspective provides a broad - based, systematic, and cross - cutting landscape analysis of race - related challenges, structured around the AI/ML lifecycle and framed through " p oints to c onsider " to support inquiry and decision - making. INTRODUCTION The role and use of the social construct of race within health - related artificial intelligence and machine learning (AI/ML) models has become a subject of increased attention and controversy. As noted in the National Academies recent report " Ending Unequal Treatment ", it is increasingly clear that race in all its complexity is a powerful predictor of unequal treatment and health care outcomes.

ai ml model, artificial intelligence, machine learning, (14 more...)

2504.00899

Country:

North America > United States > District of Columbia > Washington (0.14)
North America > United States > Tennessee > Davidson County > Nashville (0.05)
South America > Brazil (0.04)
(6 more...)

Genre: Research Report (0.50)

Industry:

Health & Medicine > Health Care Technology (1.00)
Health & Medicine > Health Care Providers & Services (1.00)
Health & Medicine > Therapeutic Area > Cardiology/Vascular Diseases (0.94)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)