AITopics

2503.0655

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
Asia > China (0.04)
Oceania > Australia (0.04)
North America > United States > Connecticut (0.04)

Genre:

Instructional Material (1.00)
Research Report (0.82)

Industry:

Media > News (1.00)
Law > Criminal Law (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Scarlatos, Alexander, Liu, Naiming, Lee, Jaewook, Baraniuk, Richard, Lan, Andrew

Training LLM-based Tutors to Improve Student Learning Outcomes in Dialogues

Recent advances in generative artificial intelligence (AI), including large language models (LLMs), have opened new possibilities in education and in particular on scaling up personalization. One form of personalization that generative AI powers is interactive learning via tutoring dialogues between AI-powered tutors and students. These interactions have the potential to tailor instruction to each student's needs and progress, while offering personalized feedback, all in real time, in a scalable way. Given the widespread success of human tutors for improving student outcomes [29], many recent works have developed LLM-based tutors, showing promise across various educational domains [15, 25, 30, 32, 33, 39, 42, 50]. Many LLM-based tutors are even deployed in practice, such as Khan Academy's Khanmigo [21] and Carnegie Learning's LiveHint [4]. Several preliminary studies have shown that interacting with LLMs can increase student learning [52], although some have shown that students can develop an over-reliance on LLMs which negatively impacts their learning [23]. Many prior works have focused on improving LLMs' ability to follow effective tutoring principles, adapting them for the tutoring task that they are not pre-trained for. One approach, explored in [46], analyzes the decision-making process underlying human tutor utterances, showing that integrating expert decisions enhances LLM-based tutoring. Another study, [28], examines tutor moves in interactions with an LLM-powered simulated student agent, demonstrating that move annotation data contributes to better tutoring performance.

dialogue, student, tutor utterance, (10 more...)

2503.06424

Country:

North America > United States > Massachusetts > Hampshire County > Amherst (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
North America > Mexico > Mexico City > Mexico City (0.04)
(7 more...)

Genre:

Research Report > New Finding (0.68)
Instructional Material > Course Syllabus & Notes (0.52)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (1.00)
Education > Educational Setting (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.68)

Heterogeneous bimodal attention fusion for speech emotion recognition

Luo, Jiachen, Phan, Huy, Wang, Lin, Reiss, Joshua

heterogeneous bimodal attention fusion, speech emotion recognition

Multi-modal emotion recognition in conversations is a challenging problem due to the complex and complementary interactions between different modalities. Audio and textual cues are particularly important for understanding emotions from a human perspective. Most existing studies focus on exploring interactions between audio and text modalities at the same representation level. However, a critical issue is often overlooked: the heterogeneous modality gap between low-level audio representations and high-level text representations. To address this problem, we propose a novel framework called Heterogeneous Bimodal Attention Fusion (HBAF) for multi-level multi-modal interaction in conversational emotion recognition. The proposed method comprises three key modules: the uni-modal representation module, the multi-modal fusion module, and the inter-modal contrastive learning module. The uni-modal representation module incorporates contextual content into low-level audio representations to bridge the heterogeneous multi-modal gap, enabling more effective fusion. The multi-modal fusion module uses dynamic bimodal attention and a dynamic gating mechanism to filter incorrect cross-modal relationships and fully exploit both intra-modal and inter-modal interactions. Finally, the inter-modal contrastive learning module captures complex absolute and relative interactions between audio and text modalities. Experiments on the MELD and IEMOCAP datasets demonstrate that the proposed HBAF method outperforms existing state-of-the-art baselines.

2503.06405

Genre:

Instructional Material > Course Syllabus & Notes (0.44)
Research Report (0.40)

Technology: Information Technology > Artificial Intelligence > Cognitive Science > Emotion (0.80)

Schoepf, Stefan, Hameed, Muhammad Zaid, Rawat, Ambrish, Fraser, Kieran, Zizzo, Giulio, Cornacchia, Giandomenico, Purcell, Mark

MAD-MAX: Modular And Diverse Malicious Attack MiXtures for Automated LLM Red Teaming

With LLM usage rapidly increasing, their vulnerability to jailbreaks that create harmful outputs are a major security risk. As new jailbreaking strategies emerge and models are changed by fine-tuning, continuous testing for security vulnerabilities is necessary. Existing Red Teaming methods fall short in cost efficiency, attack success rate, attack diversity, or extensibility as new attack types emerge. We address these challenges with Modular And Diverse Malicious Attack MiXtures (MAD-MAX) for Automated LLM Red Teaming. MAD-MAX uses automatic assignment of attack strategies into relevant attack clusters, chooses the most relevant clusters for a malicious goal, and then combines strategies from the selected clusters to achieve diverse novel attacks with high attack success rates. MAD-MAX further merges promising attacks together at each iteration of Red Teaming to boost performance and introduces a similarity filter to prune out similar attacks for increased cost efficiency. The MAD-MAX approach is designed to be easily extensible with newly discovered attack strategies and outperforms the prominent Red Teaming method Tree of Attacks with Pruning (TAP) significantly in terms of Attack Success Rate (ASR) and queries needed to achieve jailbreaks. MAD-MAX jailbreaks 97% of malicious goals in our benchmarks on GPT-4o and Gemini-Pro compared to TAP with 66%. MAD-MAX does so with only 10.9 average queries to the target LLM compared to TAP with 23.3. WARNING: This paper contains contents which are offensive in nature.

jailbreak, language model, mad-max, (11 more...)

2503.06253

Country:

Europe > Austria > Vienna (0.14)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report (0.82)
Instructional Material (0.70)
Workflow (0.68)

Industry:

Information Technology > Security & Privacy (1.00)
Government (1.00)

Technology:

Information Technology > Security & Privacy (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

A Survey on Post-training of Large Language Models

Tie, Guiyao, Zhao, Zeli, Song, Dingjie, Wei, Fuyang, Zhou, Rong, Dai, Yurou, Yin, Wen, Yang, Zhejian, Yan, Jiangyue, Su, Yao, Dai, Zhenhan, Xie, Yifeng, Cao, Yihan, Sun, Lichao, Zhou, Pan, He, Lifang, Chen, Hechang, Zhang, Yu, Wen, Qingsong, Liu, Tianming, Gong, Neil Zhenqiang, Tang, Jiliang, Xiong, Caiming, Ji, Heng, Yu, Philip S., Gao, Jianfeng

group relative policy optimization, point response positive ai reference, point response positive human reference, (12 more...)

The emergence of Large Language Models (LLMs) has fundamentally transformed natural language processing, making them indispensable across domains ranging from conversational systems to scientific exploration. However, their pre-trained architectures often reveal limitations in specialized contexts, including restricted reasoning capacities, ethical uncertainties, and suboptimal domain-specific performance. These challenges necessitate advanced post-training language models (PoLMs) to address these shortcomings, such as OpenAI-o1/o3 and DeepSeek-R1 (collectively known as Large Reasoning Models, or LRMs). This paper presents the first comprehensive survey of PoLMs, systematically tracing their evolution across five core paradigms: Fine-tuning, which enhances task-specific accuracy; Alignment, which ensures alignment with human preferences; Reasoning, which advances multi-step inference despite challenges in reward design; Efficiency, which optimizes resource utilization amidst increasing complexity; and Integration and Adaptation, which extend capabilities across diverse modalities while addressing coherence issues. Charting progress from ChatGPT's foundational alignment strategies to DeepSeek-R1's innovative reasoning advancements, we illustrate how PoLMs leverage datasets to mitigate biases, deepen reasoning capabilities, and enhance domain adaptability. Our contributions include a pioneering synthesis of PoLM evolution, a structured taxonomy categorizing techniques and datasets, and a strategic agenda emphasizing the role of LRMs in improving reasoning proficiency and domain flexibility. As the first survey of its scope, this work consolidates recent PoLM advancements and establishes a rigorous intellectual framework for future research, fostering the development of LLMs that excel in precision, ethical robustness, and versatility across scientific and societal applications.

2503.06072

Country:

Asia > Myanmar > Tanintharyi Region > Dawei (0.04)
Asia > Middle East > Jordan (0.04)
North America > United States > Oregon > Multnomah County > Portland (0.04)
(8 more...)

Genre:

Workflow (1.00)
Overview (1.00)
Instructional Material (1.00)
Research Report > Promising Solution (0.92)

Industry:

Law (0.93)
Education > Educational Setting (0.67)
Leisure & Entertainment > Sports (0.45)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.45)

arXiv.org Artificial IntelligenceMar-7-2025

QG-SMS: Enhancing Test Item Analysis via Student Modeling and Simulation

Nguyen, Bang, Du, Tingting, Yu, Mengxia, Angrave, Lawrence, Jiang, Meng

While the Question Generation (QG) task has been increasingly adopted in educational assessments, its evaluation remains limited by approaches that lack a clear connection to the educational values of test items. In this work, we introduce test item analysis, a method frequently used by educators to assess test question quality, into QG evaluation. Specifically, we construct pairs of candidate questions that differ in quality across dimensions such as topic coverage, item difficulty, item discrimination, and distractor efficiency. We then examine whether existing QG evaluation approaches can effectively distinguish these differences. Our findings reveal significant shortcomings in these approaches with respect to accurately assessing test item quality in relation to student performance. To address this gap, we propose a novel QG evaluation framework, QG-SMS, which leverages Large Language Model for Student Modeling and Simulation to perform test item analysis. As demonstrated in our extensive experiments and human evaluation study, the additional perspectives introduced by the simulated student profiles lead to a more effective and robust assessment of test items.

computational linguistic, discrimination, student, (14 more...)

2503.05888

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Florida > Miami-Dade County > Miami (0.04)
(13 more...)

Genre:

Research Report > New Finding (0.34)
Instructional Material > Course Syllabus & Notes (0.31)

Industry:

Education > Educational Technology > Educational Software (0.61)
Education > Assessment & Standards > Student Performance (0.51)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.73)

arXiv.org Artificial IntelligenceMar-7-2025

Robust Multimodal Learning for Ophthalmic Disease Grading via Disentangled Representation

Wang, Xinkun, Wang, Yifang, Liang, Senwei, Tang, Feilong, Liu, Chengzhi, Hu, Ming, Hu, Chao, He, Junjun, Ge, Zongyuan, Razzak, Imran

This paper discusses how ophthalmologists often rely on multimodal data to improve diagnostic accuracy. However, complete multimodal data is rare in real-world applications due to a lack of medical equipment and concerns about data privacy. Traditional deep learning methods typically address these issues by learning representations in latent space. However, the paper highlights two key limitations of these approaches: (i) Task-irrelevant redundant information (e.g., numerous slices) in complex modalities leads to significant redundancy in latent space representations. (ii) Overlapping multimodal representations make it difficult to extract unique features for each modality. To overcome these challenges, the authors propose the Essence-Point and Disentangle Representation Learning (EDRL) strategy, which integrates a self-distillation mechanism into an end-to-end framework to enhance feature selection and disentanglement for more robust multimodal learning. Specifically, the Essence-Point Representation Learning module selects discriminative features that improve disease grading performance. The Disentangled Representation Learning module separates multimodal data into modality-common and modality-unique representations, reducing feature entanglement and enhancing both robustness and interpretability in ophthalmic disease diagnosis. Experiments on multimodal ophthalmology datasets show that the proposed EDRL strategy significantly outperforms current state-of-the-art methods.

information, modality, representation, (14 more...)

2503.05319

Country:

Asia > China > Shanghai > Shanghai (0.04)
Oceania > Australia (0.04)
Europe > United Kingdom (0.04)
Asia > Middle East > UAE (0.04)

Genre:

Instructional Material > Course Syllabus & Notes (0.44)
Research Report > Promising Solution (0.34)

Industry: Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

arXiv.org Artificial IntelligenceMar-7-2025

Unity RL Playground: A Versatile Reinforcement Learning Framework for Mobile Robots

Ye, Linqi, Li, Rankun, Hu, Xiaowen, Li, Jiayi, Xing, Boyang, Peng, Yan, Liang, Bin

This paper introduces Unity RL Playground, an open-source reinforcement learning framework built on top of Unity ML-Agents. Unity RL Playground automates the process of training mobile robots to perform various locomotion tasks such as walking, running, and jumping in simulation, with the potential for seamless transfer to real hardware. Key features include one-click training for imported robot models, universal compatibility with diverse robot configurations, multi-mode motion learning capabilities, and extreme performance testing to aid in robot design optimization and morphological evolution. The attached video can be found at https://linqi-ye.github.io/video/iros25.mp4 and the code is coming soon.

module, robot, unity rl playground, (11 more...)

2503.05146

Country:

Asia > China > Shanghai > Shanghai (0.05)
Asia > China > Beijing > Beijing (0.04)

Genre:

Research Report (0.64)
Instructional Material (0.48)

Industry: Education (0.96)

Technology: Information Technology > Artificial Intelligence > Robots > Locomotion (1.00)

WIREDMar-6-2025, 13:34:39 GMT

The US Army Is Using 'CamoGPT' to Purge DEI From Training Materials

The United States Army is employing a prototype generative artificial intelligence tool to identify references to diversity, equity, inclusion, and accessibility (DEIA) for removal from training materials in line with a recent executive order from President Donald Trump. Officials at the Army's Training and Doctrine Command (TRADOC)--the major command responsible for training soldiers, developing leaders, and shaping the service's guidelines, strategies, and concepts--are currently using the AI tool, dubbed CamoGPT, to "review policies, programs, publications, and initiatives for DEIA and report findings," according to an internal memo reviewed by WIRED. The memo followed Trump's signing of a January 27 executive order entitled, "Restoring America's Fighting Force," which directed Defense Secretary Pete Hegseth to eliminate all Pentagon policies seen as promoting what that the commander-in-chief declared "un-American, divisive, discriminatory, radical, extremist, and irrational theories" regarding race and gender, a linguistic dragnet that extends as far as past social media posts from official US military accounts. Chris Robinson confirmed the use of CamoGPT to review DEIA materials. "[TRADOC] will fully execute and implement all directives outlined in the Executive Orders issued by the President. We ensure that these directives are carried out with the utmost professionalism, efficiency, and in alignment with national security objectives," Robinson says.

camogpt, machine learning, natural language, (15 more...)

WIRED

Country: North America > United States > District of Columbia > Washington (0.06)

Genre: Instructional Material (0.62)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Government > Military > Army (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.37)

arXiv.org Artificial IntelligenceMar-6-2025

Artificial Intelligence in Pronunciation Teaching: Use and Beliefs of Foreign Language Teachers

Georgiou, Georgios P.

Pronunciation instruction in foreign language classrooms has often been an overlooked area of focus. With the widespread adoption of Artificial Intelligence (AI) and its potential benefits, investigating how AI is utilized in pronunciation teaching and understanding the beliefs of teachers about this tool is essential for improving learning outcomes. This study aims to examine how AI use for pronunciation instruction varies across different demographic and professional factors among teachers, and how these factors, including AI use, influence the beliefs of teachers about AI. The study involved 117 English as a Foreign Language (EFL) in-service teachers working in Cyprus, who completed an online survey designed to assess their beliefs about the effectiveness of AI, its drawbacks, and their willingness to integrate AI into their teaching practices. The results revealed that teachers were significantly more likely to agree on the perceived effectiveness of AI and their willingness to adopt it, compared to their concerns about its use. Furthermore, teachers working in higher education and adult education, as well as those who had received more extensive training, reported using AI more frequently in their teaching. Teachers who utilized AI more often expressed stronger agreement with its effectiveness, while those who had received more training were less likely to express concerns about its integration. Given the limited training that many teachers currently receive, these findings demonstrate the need for tailored training sessions that address the specific needs and concerns of educators, ultimately fostering the adoption of AI in pronunciation instruction.

artificial intelligence, machine learning, natural language, (13 more...)

2503.04128

Country:

Europe > Middle East > Cyprus > Nicosia > Nicosia (0.04)
Europe > Slovakia (0.04)
Asia > China > Hong Kong (0.04)
Africa > Ghana (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
Instructional Material (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Education > Educational Setting > K-12 Education (1.00)
Education > Educational Setting > Higher Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (0.68)