AITopics

doi: 10.1016/j.imavis.2025.105495

2504.13186

Country:

Asia > Middle East (0.67)
Africa > Middle East > Algeria (0.28)
North America > United States > Texas (0.27)

Genre:

Research Report > Promising Solution (1.00)
Research Report > New Finding (1.00)
Overview (1.00)
Research Report > Experimental Study (0.92)

Industry:

Health & Medicine > Diagnostic Medicine (1.00)
Health & Medicine > Therapeutic Area > Oncology > Lung Cancer (0.47)
Health & Medicine > Therapeutic Area > Oncology > Breast Cancer (0.32)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceApr-21-2025

Factors That Influence the Adoption of AI-enabled Conversational Agents (AICAs) as an Augmenting Therapeutic Tool by Frontline Healthcare Workers: From Technology Acceptance Model 3 (TAM3) Lens -- A Systematic Mapping Review

AlMakinah, Rawan

Artificial intelligent (AI) conversational agents hold a promising future in the field of mental health, especially in helping marginalized communities that lack access to mental health support services. It is tempting to have a 24/7 mental health companion that can be accessed anywhere using mobile phones to provide therapist-like advice. Yet, caution should be taken, and studies around their feasibility need to be surveyed. Before adopting such a rapidly changing technology, studies on its feasibility should be explored, summarized, and synthesized to gain a solid understanding of the status quo and to enable us to build a framework that can guide us throughout the development and deployment processes. Different perspectives must be considered when investigating the feasibility of AI conversational agents, including the mental healthcare professional perspective. The literature can provide insights into their perspectives in terms of opportunities, concerns, and implications. Mental health professionals, the subject-matter experts in this field, have their points of view that should be understood and considered. This systematic literature review will explore mental health practitioners' attitudes toward AI conversational agents and the factors that affect their adoption and recommendation of the technology to augment their services and treatments. The TAM3 Framework will be the lens through which this systematic literature review will be conducted.

ai technology, artificial intelligence, natural language, (15 more...)

2504.13183

Country: North America > United States (0.28)

Genre:

Overview (1.00)
Research Report > New Finding (0.47)

Industry:

Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)
Health & Medicine > Health Care Providers & Services (1.00)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Personal Assistant Systems (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Applied AI (1.00)

arXiv.org Machine LearningApr-20-2025

Knowledge Distillation and Dataset Distillation of Large Language Models: Emerging Trends, Challenges, and Future Directions

Fang, Luyang, Yu, Xiaowei, Cai, Jiazhang, Chen, Yongkai, Wu, Shushan, Liu, Zhengliang, Yang, Zhenyuan, Lu, Haoran, Gong, Xilin, Liu, Yufang, Ma, Terry, Ruan, Wei, Abbasi, Ali, Zhang, Jing, Wang, Tao, Latif, Ehsan, Liu, Wei, Zhang, Wei, Kolouri, Soheil, Zhai, Xiaoming, Zhu, Dajiang, Zhong, Wenxuan, Liu, Tianming, Ma, Ping

The exponential growth of Large Language Models (LLMs) continues to highlight the need for efficient strategies to meet ever-expanding computational and data demands. This survey provides a comprehensive analysis of two complementary paradigms: Knowledge Distillation (KD) and Dataset Distillation (DD), both aimed at compressing LLMs while preserving their advanced reasoning capabilities and linguistic diversity. We first examine key methodologies in KD, such as task-specific alignment, rationale-based training, and multi-teacher frameworks, alongside DD techniques that synthesize compact, high-impact datasets through optimization-based gradient matching, latent space regularization, and generative synthesis. Building on these foundations, we explore how integrating KD and DD can produce more effective and scalable compression strategies. Together, these approaches address persistent challenges in model scalability, architectural heterogeneity, and the preservation of emergent LLM abilities. We further highlight applications across domains such as healthcare and education, where distillation enables efficient deployment without sacrificing performance. Despite substantial progress, open challenges remain in preserving emergent reasoning and linguistic diversity, enabling efficient adaptation to continually evolving teacher models and datasets, and establishing comprehensive evaluation protocols. By synthesizing methodological innovations, theoretical foundations, and practical insights, our survey charts a path toward sustainable, resource-efficient LLMs through the tighter integration of KD and DD principles.

distillation, large language model, machine learning, (16 more...)

arXiv.org Machine Learning

2504.14772

Country:

North America > United States > Texas > Tarrant County > Arlington (0.04)
North America > United States > California > Santa Clara County > Palo Alto (0.04)
North America > United States > Arizona (0.04)
Africa > Togo (0.04)

Genre:

Overview (1.00)
Research Report > Promising Solution (0.92)
Research Report > Experimental Study (0.67)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Education (1.00)
Information Technology > Security & Privacy (0.92)
Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Hegde, Akshata, Nguyen, Tom, Cheng, Jianlin

Machine Learning Methods for Gene Regulatory Network Inference

Proper regulation of gene expression is essential to ensure that genes are activated only when necessary and that their activity is properly controlled [3]. The regulation of gene expression is achieved through understanding the intricate interactions between genes and other molecules. In this effort, Gene Regulatory Networks have emerged as a strong tool[2]. Gene regulatory networks (GRNs) are complex systems that determine the development, differentiation, and function of cells and organisms, as well as their response to environmental stimuli [4][5]. GRNs consist of genes, transcription factors (TFs), microRNAs, and other regulatory molecules that interact with each other to control gene expression [6]. The regulatory interactions between these molecules can form complex networks that exhibit emergent properties, such as robustness and adaptability [7]. In its simplest form, a GRN is a network of genes and their regulatory interactions, which govern the expression of these genes in response to various cellular cues. It is worth noting that in this definition, a transcription factor (TF) is considered a special kind of gene that may regulate the expression of other non-TF or TF genes. Each gene in the network acts as a node, and the regulatory interactions between genes are represented by directed edges connecting these nodes[8].

artificial intelligence, evolutionary algorithm, machine learning, (20 more...)

2504.1261

Country: North America > United States > New York (0.28)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Evolutionary Systems (0.67)

NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement: Methods and Results

Li, Xin, Yuan, Kun, Li, Bingchen, Guan, Fengbin, Shao, Yizhen, Yu, Zihao, Wang, Xijun, Lu, Yiting, Luo, Wei, Yao, Suhang, Sun, Ming, Zhou, Chao, Chen, Zhibo, Timofte, Radu, Zhang, Yabin, Zhang, Ao-Xiang, Zhi, Tianwu, Liu, Jianzhao, Li, Yang, Xu, Jingwen, Liao, Yiting, Zuo, Yushen, Wu, Mingyang, Li, Renjie, Zhong, Shengyun, Tu, Zhengzhong, Liu, Yufan, Chen, Xiangguang, Cao, Zuowei, Tang, Minhao, Liu, Shan, Zhang, Kexin, Xie, Jingfen, Wang, Yan, Chen, Kai, Zhao, Shijie, Zhang, Yunchen, Xu, Xiangkai, Gao, Hong, Shi, Ji, Bao, Yiming, Dong, Xiugang, Zhou, Xiangsheng, Tu, Yaofeng, Liang, Ying, Wang, Yiwen, Chai, Xinning, Zhang, Yuxuan, Cheng, Zhengxue, Qin, Yingsheng, Yang, Yucai, Xie, Rong, Song, Li, Sun, Wei, Fu, Kang, Cao, Linhan, Zhu, Dandan, Zhang, Kaiwei, Zhu, Yucheng, Zhang, Zicheng, Hu, Menghan, Min, Xiongkuo, Zhai, Guangtao, Jin, Zhi, Wu, Jiawei, Wang, Wei, Zhang, Wenjian, Lan, Yuhai, Yi, Gaoxiong, Na, Hengyuan, Luo, Wang, Wu, Di, Bai, MingYin, Du, Jiawang, Lu, Zilong, Jiang, Zhenyu, Zeng, Hui, Cui, Ziguan, Gan, Zongliang, Tang, Guijin, Xie, Xinglin, Song, Kehuan, Lu, Xiaoqiang, Jiao, Licheng, Liu, Fang, Liu, Xu, Chen, Puhua, Nguyen, Ha Thu, De Moor, Katrien, Amirshahi, Seyed Ali, Larabi, Mohamed-Chaker, Tang, Qi, He, Linfeng, Gao, Zhiyong, Gao, Zixuan, Zhang, Guohua, Huang, Zhiye, Deng, Yi, Jiang, Qingmiao, Chen, Lu, Yang, Yi, Liao, Xi, Nadir, Nourine Mohammed, Jiang, Yuxuan, Zhu, Qiang, Teng, Siyue, Zhang, Fan, Zhu, Shuyuan, Zeng, Bing, Bull, David, Liu, Meiqin, Yao, Chao, Zhao, Yao

This paper presents a review for the NTIRE 2025 Challenge on Short-form UGC Video Quality Assessment and Enhancement. The challenge comprises two tracks: (i) Efficient Video Quality Assessment (KVQ), and (ii) Diffusion-based Image Super-Resolution (KwaiSR). Track 1 aims to advance the development of lightweight and efficient video quality assessment (VQA) models, with an emphasis on eliminating reliance on model ensembles, redundant weights, and other computationally expensive components in the previous IQA/VQA competitions. Track 2 introduces a new short-form UGC dataset tailored for single image super-resolution, i.e., the KwaiSR dataset. It consists of 1,800 synthetically generated S-UGC image pairs and 1,900 real-world S-UGC images, which are split into training, validation, and test sets using a ratio of 8:1:1. The primary objective of the challenge is to drive research that benefits the user experience of short-form UGC platforms such as Kwai and TikTok. This challenge attracted 266 participants and received 18 valid final submissions with corresponding fact sheets, significantly contributing to the progress of short-form UGC VQA and image superresolution. The project is publicly available at https://github.com/lixinustc/KVQE- ChallengeCVPR-NTIRE2025.

artificial intelligence, machine learning, natural language, (13 more...)

2504.13131

Country: Asia > China (0.93)

Genre:

Overview (0.54)
Research Report (0.40)

Industry: Education (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Communications (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
(3 more...)

Chrysos, Grigorios G, Wu, Yongtao, Pascanu, Razvan, Torr, Philip, Cevher, Volkan

Hadamard product in deep learning: Introduction, Advances and Challenges

Abstract--While convolution and self-attention mechanisms have dominated architectural design in deep learning, this survey examines a fundamental yet understudied primitive: the Hadamard product . Despite its widespread implementation across various applications, the Hadamard product has not been systematically analyzed as a core architectural primitive. We present the first comprehensive taxonomy of its applications in deep learning, identifying four principal domains: higher-order correlation, multimodal data fusion, dynamic representation modulation, and efficient pairwise operations. The Hadamard product's ability to model nonlinear interactions with linear computational complexity makes it particularly valuable for resource-constrained deployments and edge computing scenarios. We demonstrate its natural applicability in multimodal fusion tasks, such as visual question answering, and its effectiveness in representation masking for applications including image inpainting and pruning. This systematic review not only consolidates existing knowledge about the Hadamard product's role in deep learning architectures but also establishes a foundation for future architectural innovations. Our analysis reveals the Hadamard product as a versatile primitive that offers compelling trade-offs between computational efficiency and representational power, positioning it as a crucial component in the deep learning toolkit.

artificial intelligence, hadamard product, machine learning, (13 more...)

2504.13112

Country:

North America > United States (0.46)
Europe > United Kingdom > England (0.27)

Genre: Overview (1.00)

Industry:

Health & Medicine > Diagnostic Medicine > Imaging (0.67)
Education (0.67)
Health & Medicine > Therapeutic Area > Neurology (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Curto, Georgina, Kiritchenko, Svetlana, Siddiqui, Muhammad Hammad Fahim, Nejadgholi, Isar, Fraser, Kathleen C.

Tackling Social Bias against the Poor: A Dataset and Taxonomy on Aporophobia

Eradicating poverty is the first goal in the United Nations Sustainable Development Goals. However, aporophobia -- the societal bias against people living in poverty -- constitutes a major obstacle to designing, approving and implementing poverty-mitigation policies. This work presents an initial step towards operationalizing the concept of aporophobia to identify and track harmful beliefs and discriminative actions against poor people on social media. In close collaboration with non-profits and governmental organizations, we conduct data collection and exploration. Then we manually annotate a corpus of English tweets from five world regions for the presence of (1) direct expressions of aporophobia, and (2) statements referring to or criticizing aporophobic views or actions of others, to comprehensively characterize the social media discourse related to bias and discrimination against the poor. Based on the annotated data, we devise a taxonomy of categories of aporophobic attitudes and actions expressed through speech on social media. Finally, we train several classifiers and identify the main challenges for automatic detection of aporophobia in social networks. This work paves the way towards identifying, tracking, and mitigating aporophobic views on social media at scale.

large language model, machine learning, natural language, (21 more...)

2504.13085

Country:

Asia (1.00)
Africa (1.00)
Europe > United Kingdom > England (0.46)
(2 more...)

Genre:

Overview (1.00)
Research Report > New Finding (0.92)

Industry:

Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Consumer Health (1.00)
(5 more...)

Technology:

Information Technology > Communications > Social Media (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.69)

A Comprehensive Survey of Reward Models: Taxonomy, Applications, Challenges, and Future

Zhong, Jialun, Shen, Wei, Li, Yanzeng, Gao, Songyang, Lu, Hua, Chen, Yicheng, Zhang, Yang, Zhou, Wei, Gu, Jinjie, Zou, Lei

Reward Model (RM) has demonstrated impressive potential for enhancing Large Language Models (LLM), as RM can serve as a proxy for human preferences, providing signals to guide LLMs' behavior in various tasks. In this paper, we provide a comprehensive overview of relevant research, exploring RMs from the perspectives of preference collection, reward modeling, and usage. Next, we introduce the applications of RMs and discuss the benchmarks for evaluation. Furthermore, we conduct an in-depth analysis of the challenges existing in the field and dive into the potential research directions. This paper is dedicated to providing beginners with a comprehensive introduction to RMs and facilitating future studies. The resources are publicly available at github\footnote{https://github.com/JLZhong23/awesome-reward-models}.

arxiv preprint, large language model, machine learning, (18 more...)

2504.12328

Country:

North America > United States (1.00)
Asia (1.00)
North America > Canada (0.68)
Europe > Austria > Vienna (0.15)

Genre:

Overview (1.00)
Research Report (0.87)

Industry: Leisure & Entertainment (0.45)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Towards Scientific Intelligence: A Survey of LLM-based Scientific Agents

Ren, Shuo, Jian, Pu, Ren, Zhenjiang, Leng, Chunlin, Xie, Can, Zhang, Jiajun

As scientific research becomes increasingly complex, innovative tools are needed to manage vast data, facilitate interdisciplinary collaboration, and accelerate discovery. Large language models (LLMs) are now evolving into LLM-based scientific agents that automate critical tasks, ranging from hypothesis generation and experiment design to data analysis and simulation. Unlike general-purpose LLMs, these specialized agents integrate domain-specific knowledge, advanced tool sets, and robust validation mechanisms, enabling them to handle complex data types, ensure reproducibility, and drive scientific breakthroughs. This survey provides a focused review of the architectures, design, benchmarks, applications, and ethical considerations surrounding LLM-based scientific agents. We highlight why they differ from general agents and the ways in which they advance research across various scientific fields. By examining their development and challenges, this survey offers a comprehensive roadmap for researchers and practitioners to harness these agents for more efficient, reliable, and ethically sound scientific discovery.

large language model, machine learning, natural language, (18 more...)

2503.24047

Country:

Europe > Austria (0.28)
Asia > China (0.28)
North America > United States (0.28)
Asia > Middle East > UAE (0.27)

Genre:

Research Report (1.00)
Overview (1.00)

Industry:

Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Therapeutic Area > Oncology (0.67)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Machine LearningApr-17-2025

Adversarial Resilience against Clean-Label Attacks in Realizable and Noisy Settings

Heinzler, Carolin

We investigate the challenge of establishing stochastic-like guarantees when sequentially learning from a stream of i.i.d. data that includes an unknown quantity of clean-label adversarial samples. We permit the learner to abstain from making predictions when uncertain. The regret of the learner is measured in terms of misclassification and abstention error, where we allow the learner to abstain for free on adversarial injected samples. This approach is based on the work of Goel, Hanneke, Moran, and Shetty from arXiv:2306.13119. We explore the methods they present and manage to correct inaccuracies in their argumentation. However, this approach is limited to the realizable setting, where labels are assigned according to some function $f^*$ from the hypothesis space $\mathcal{F}$. Based on similar arguments, we explore methods to make adaptations for the agnostic setting where labels are random. Introducing the notion of a clean-label adversary in the agnostic context, we are the first to give a theoretical analysis of a disagreement-based learner for thresholds, subject to a clean-label adversary with noise.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Machine Learning

2504.13966

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
Oceania > Australia > Australian Capital Territory > Canberra (0.04)
(7 more...)

Genre:

Overview (1.00)
Instructional Material > Course Syllabus & Notes (0.46)

Industry:

Education (0.46)
Information Technology > Security & Privacy (0.45)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.92)
(2 more...)