AITopics

doi: 10.1007/978-3-031-83793-7_28

2503.17424

Country: Asia > India (1.00)

Genre:

Instructional Material (0.88)
Research Report > New Finding (0.67)

Industry:

Banking & Finance (1.00)
Information Technology > Software (0.94)
Education > Educational Setting > Higher Education (0.68)
Government > Regional Government (0.68)

Technology:

Information Technology > Data Science > Data Mining (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)

Neural Information Processing SystemsMar-20-2025, 08:16:12 GMT

DOPPLER: Differentially Private Optimizers with Low-pass Filter for Privacy Noise Reduction Xinwei Zhang University of Southern California Zhiqi Bu

Privacy is a growing concern in modern deep-learning systems and applications. Differentially private (DP) training prevents the leakage of sensitive information in the collected training data from the trained machine learning models. DP op-timizers, including DP stochastic gradient descent (DPSGD) and its variants, privatize the training procedure by gradient clipping and DP noise injection. However, in practice, DP models trained using DPSGD and its variants often suffer from significant model performance degradation. Such degradation prevents the application of DP optimization in many key tasks, such as foundation model pre-training.

artificial intelligence, deep learning, machine learning, (19 more...)

Country: North America > United States > California (0.86)

Genre: Research Report > Experimental Study (0.93)

Industry:

Information Technology > Security & Privacy (1.00)
Education > Educational Setting > Higher Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)

Neural Information Processing SystemsMar-20-2025, 02:50:09 GMT

Learning Deep Input-Output Stable Dynamics Graduate School of Medicine Graduate School of Medicine Kyoto University

Learning stable dynamics from observed time-series data is an essential problem in robotics, physical modeling, and systems biology. Many of these dynamics are represented as an inputs-output system to communicate with the external environment. In this study, we focus on input-output stable systems, exhibiting robustness against unexpected stimuli and noise. We propose a method to learn nonlinear systems guaranteeing the input-output stability. Our proposed method utilizes the differentiable projection onto the space satisfying the Hamilton-Jacobi inequality to realize the input-output stability. The problem of finding this projection can be formulated as a quadratic constraint quadratic programming problem, and we derive the particular solution analytically. Also, we apply our method to a toy bistable model and the task of training a benchmark generated from a glucoseinsulin simulator. The results show that the nonlinear system with neural networks by our method achieves the input-output stability, unlike naive neural networks.

artificial intelligence, machine learning, optimization problem, (16 more...)

Country: Asia > Japan > Honshū > Kansai > Kyoto Prefecture > Kyoto (0.41)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (1.00)
Education > Educational Setting > Higher Education (0.76)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.72)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.66)

Neural Information Processing SystemsMar-19-2025, 16:50:16 GMT

6a571fe98a2ba453e84923b447d79cff-Paper.pdf

artificial intelligence, data mining, machine learning, (19 more...)

Country: North America > United States (0.93)

Industry:

Education > Educational Setting > Higher Education (1.00)
Health & Medicine (0.68)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Data Science > Data Mining (0.68)

arXiv.org Artificial IntelligenceMar-19-2025

Advancing Problem-Based Learning in Biomedical Engineering in the Era of Generative AI

Nnamdi, Micky C., Tamo, J. Ben, Shi, Wenqi, Wang, May D.

Problem-Based Learning (PBL) has significantly impacted biomedical engineering (BME) education since its introduction in the early 2000s, effectively enhancing critical thinking and real-world knowledge application among students. With biomedical engineering rapidly converging with artificial intelligence (AI), integrating effective AI education into established curricula has become challenging yet increasingly necessary. Recent advancements, including AI's recognition by the 2024 Nobel Prize, have highlighted the importance of training students comprehensively in biomedical AI. However, effective biomedical AI education faces substantial obstacles, such as diverse student backgrounds, limited personalized mentoring, constrained computational resources, and difficulties in safely scaling hands-on practical experiments due to privacy and ethical concerns associated with biomedical data. To overcome these issues, we conducted a three-year (2021-2023) case study implementing an advanced PBL framework tailored specifically for biomedical AI education, involving 92 undergraduate and 156 graduate students from the joint Biomedical Engineering program of Georgia Institute of Technology and Emory University. Our approach emphasizes collaborative, interdisciplinary problem-solving through authentic biomedical AI challenges. The implementation led to measurable improvements in learning outcomes, evidenced by high research productivity (16 student-authored publications), consistently positive peer evaluations, and successful development of innovative computational methods addressing real biomedical challenges. Additionally, we examined the role of generative AI both as a teaching subject and an educational support tool within the PBL framework. Our study presents a practical and scalable roadmap for biomedical engineering departments aiming to integrate robust AI education into their curricula.

artificial intelligence, machine learning, natural language, (15 more...)

2503.16558

Country:

North America > United States (1.00)
North America > Canada > Ontario > Toronto (0.14)

Genre:

Instructional Material (1.00)
Research Report > New Finding (0.46)
Personal > Honors (0.34)

Industry:

Health & Medicine > Therapeutic Area (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)
Health & Medicine > Health Care Technology (1.00)
(3 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.86)

arXiv.org Artificial IntelligenceMar-19-2025

ChatGPT and U(X): A Rapid Review on Measuring the User Experience

Seaborn, Katie

ChatGPT, powered by a large language model (LLM), has revolutionized everyday human-computer interaction (HCI) since its 2022 release. While now used by millions around the world, a coherent pathway for evaluating the user experience (UX) ChatGPT offers remains missing. In this rapid review (N = 58), I explored how ChatGPT UX has been approached quantitatively so far. I focused on the independent variables (IVs) manipulated, the dependent variables (DVs) measured, and the methods used for measurement. Findings reveal trends, gaps, and emerging consensus in UX assessments. This work offers a first step towards synthesizing existing approaches to measuring ChatGPT UX, urgent trajectories to advance standardization and breadth, and two preliminary frameworks aimed at guiding future research and tool development. I seek to elevate the field of ChatGPT UX by empowering researchers and practitioners in optimizing user interactions with ChatGPT and similar LLM-based systems.

chatgpt, human factor, retrieved, (16 more...)

2503.15808

Country:

North America > United States (0.28)
Asia (0.28)

Industry:

Education > Educational Setting > Higher Education (0.46)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Akhtarshenas, Azim, Dini, Afshin, Ayoobi, Navid

ChatGPT or A Silent Everywhere Helper: A Survey of Large Language Models

arXiv.org Artificial IntelligenceMar-19-2025

Large Language Models (LLMs) have revo lutionized natural language processing Natural Language Processing (NLP), with Chat Generative Pre-trained Transformer (ChatGPT) standing out as a notable exampledue to its advanced capabilities and widespread applications. This survey provides a comprehensive analysis of ChatGPT, exploring its architecture, training processes, and functionalities. We examine its integration into various domains across industries such as customer service, education, healthcare, and entertainment. A comparative analysis with other LLMs highlights ChatGPT's unique features and performance metrics. Regarding benchmarks, the paper examines ChatGPT's comparative performance against other LLMs and discusses potential risks such as misinformation, bias, and data privacy concerns. Additionally, we offer a number of figures and tables that outline the backdrop of the discussion, the main ideas of the article, the numerous LLM models, a thorough list of datasets used for pre-training, fine-tuning, and evaluation, as well as particular LLM applications with pertinent references. Finally, we identify future research directions and technological advancements, underscoring the evolving landscape of LLMs and their profound impact on artificial intelligence Artificial Intelligence (AI) and society.

arxiv preprint, comprehension, information processing system, (14 more...)

2503.17403

Country:

Europe (1.00)
Asia (1.00)
North America > United States > Louisiana (0.13)

Industry:

Media (1.00)
Law (1.00)
Information Technology > Security & Privacy (1.00)
(6 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.46)

Neural Information Processing SystemsMar-18-2025, 21:41:40 GMT

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective - Supplementary Material - Shenzhen International Graduate School, Tsinghua University

A.1 Basic setting Setting 1 Without loss of generality, we suppose that the long-tailed distribution satisfies some kind exponential distribution with parameter [8]. Proof A.1 Follow the Basic Setting 1, when mixing factor Beta(,), consider a -Aug sample generated by ex Therefore, the head gets more regulation than the tail. One the one hand, the classification performance will be promoted. On the other hand, however, the performance gap between the head and tail still exists. Hence we can generalize Eq.A.9 to: Z y According to Eq.A.11, it's easy to find a derivative zero point in range [1,C]. UniMix Factor (green) alleviates such situation and the full pipeline ( = 1) constructs a more uniform distribution of -Aug (red), which contributes to a well-calibrated model. As illustrated in Fig.A1, when the class number C and imbalance factor get larger, the limitations of mixup in LT scenarios gradually appear. It has limited contribution for the tail class' feature learning and regulation, which is the reason for its poor calibration.

artificial intelligence, calibration, machine learning, (13 more...)

Country: Asia > China > Guangdong Province > Shenzhen (0.40)

Industry:

Health & Medicine (0.68)
Education > Educational Setting > Higher Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Neural Information Processing SystemsMar-18-2025, 18:51:22 GMT

Conformal Prediction using Conditional Histograms Matteo Sesia Department of Data Sciences and Operations University of Southern California, USA

This paper develops a conformal method to compute prediction intervals for nonparametric regression that can automatically adapt to skewed data. Leveraging black-box machine learning algorithms to estimate the conditional distribution of the outcome using histograms, it translates their output into the shortest prediction intervals with approximate conditional coverage. The resulting prediction intervals provably have marginal coverage in finite samples, while asymptotically achieving conditional coverage and optimal length if the black-box model is consistent. Numerical experiments with simulated and real data demonstrate improved performance compared to state-of-the-art alternatives, including conformalized quantile regression and other distributional conformal prediction approaches.

artificial intelligence, machine learning, prediction interval, (18 more...)

Country: North America > United States > California (0.86)

Industry:

Health & Medicine (0.68)
Education > Educational Setting > Higher Education (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.47)

Neural Information Processing SystemsMar-18-2025, 04:31:37 GMT

Compositional Generalization via Neural-Symbolic Stack Machines Chen Liang, Adams Wei Yu UC Berkeley

Despite achieving tremendous success, existing deep learning models have exposed limitations in compositional generalization, the capability to learn compositional rules and apply them to unseen cases in a systematic manner. To tackle this issue, we propose the Neural-Symbolic Stack Machine (NeSS). It contains a neural network to generate traces, which are then executed by a symbolic stack machine enhanced with sequence manipulation operations. NeSS combines the expressive power of neural sequence models with the recursion supported by the symbolic stack machine. Without training supervision on execution traces, NeSS achieves 100% generalization performance in four domains: the SCAN benchmark of language-driven navigation tasks, the task of few-shot learning of compositional instructions, the compositional machine translation benchmark, and context-free grammar parsing tasks.

artificial intelligence, machine learning, natural language, (20 more...)

Country: North America (0.28)

Genre: Overview (0.47)

Industry: Education > Educational Setting > Higher Education (0.40)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Grammars & Parsing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)