AITopics

2502.12611

Country:

Asia (1.00)
North America > United States (0.27)
Europe > United Kingdom (0.27)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

arXiv.org Artificial IntelligenceOct-2-2024

Causal Inference Tools for a Better Evaluation of Machine Learning

Soumm, Michaël

We present a comprehensive framework for applying rigorous statistical techniques from econometrics to analyze and improve machine learning systems. We introduce key statistical methods such as Ordinary Least Squares (OLS) regression, Analysis of Variance (ANOVA), and logistic regression, explaining their theoretical foundations and practical applications in machine learning evaluation. The document serves as a guide for researchers and practitioners, detailing how these techniques can provide deeper insights into model behavior, performance, and fairness. We cover the mathematical principles behind each method, discuss their assumptions and limitations, and provide step-by-step instructions for their implementation. The paper also addresses how to interpret results, emphasizing the importance of statistical significance and effect size. Through illustrative examples, we demonstrate how these tools can reveal subtle patterns and interactions in machine learning models that are not apparent from traditional evaluation metrics. By connecting the fields of econometrics and machine learning, this work aims to equip readers with powerful analytical tools for more rigorous and comprehensive evaluation of AI systems. The framework presented here contributes to developing more robust, interpretable, and fair machine learning technologies.

assumption, regression, variance, (17 more...)

2410.01392

Country:

North America > United States (0.04)
Europe > Slovakia > Bratislava > Bratislava (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Pataranutaporn, Pat, Winson, Kavin, Yin, Peggy, Lapapirojn, Auttasak, Ouppaphan, Pichayoot, Lertsutthiwong, Monchai, Maes, Pattie, Hershfield, Hal

Future You: A Conversation with an AI-Generated Future Self Reduces Anxiety, Negative Emotions, and Increases Future Self-Continuity

arXiv.org Artificial IntelligenceJul-9-2024

We introduce "Future You," an interactive, brief, single-session, digital chat intervention designed to improve future self-continuity--the degree of connection an individual feels with a temporally distant future self--a characteristic that is positively related to mental health and wellbeing. Our system allows users to chat with a relatable yet AI-powered virtual version of their future selves that is tuned to their future goals and personal qualities. To make the conversation realistic, the system generates a "synthetic memory"--a unique backstory for each user--that creates a throughline between the user's present age (between 18-30) and their life at age 60. The "Future You" character also adopts the persona of an age-progressed image of the user's present self. After a brief interaction with the "Future You" character, users reported decreased anxiety, and increased future self-continuity. This is the first study successfully demonstrating the use of personalized AI-generated characters to improve users' future self-continuity and wellbeing.

future self, intervention, participant, (17 more...)

2405.12514

Country:

Asia > Thailand (0.04)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
(2 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (0.94)

Industry: Health & Medicine > Therapeutic Area > Psychiatry/Psychology > Mental Health (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.69)
Information Technology > Human Computer Interaction > Interfaces > Virtual Reality (0.47)

Kölle, Michael, Maurer, Jonas, Altmann, Philipp, Sünkel, Leo, Stein, Jonas, Linnhoff-Popien, Claudia

Disentangling Quantum and Classical Contributions in Hybrid Quantum Machine Learning Architectures

arXiv.org Artificial IntelligenceJan-13-2024

Quantum computing offers the potential for superior computational capabilities, particularly for data-intensive tasks. However, the current state of quantum hardware puts heavy restrictions on input size. To address this, hybrid transfer learning solutions have been developed, merging pre-trained classical models, capable of handling extensive inputs, with variational quantum circuits. Yet, it remains unclear how much each component -- classical and quantum -- contributes to the model's results. We propose a novel hybrid architecture: instead of utilizing a pre-trained network for compression, we employ an autoencoder to derive a compressed version of the input data. This compressed data is then channeled through the encoder part of the autoencoder to the quantum component. We assess our model's classification capabilities against two state-of-the-art hybrid transfer learning architectures, two purely classical architectures and one quantum architecture. Their accuracy is compared across four datasets: Banknote Authentication, Breast Cancer Wisconsin, MNIST digits, and AudioMNIST. Our research suggests that classical components significantly influence classification in hybrid transfer learning, a contribution often mistakenly ascribed to the quantum element. The performance of our model aligns with that of a variational quantum circuit using amplitude embedding, positioning it as a feasible alternative.

accuracy, dataset, vqc, (14 more...)

2311.05559

Country:

Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > New Jersey > Mercer County > Princeton (0.04)
North America > United States > California > San Diego County > San Diego (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Ayub, Ali, Mehta, Jainish, De Francesco, Zachary, Holthaus, Patrick, Dautenhahn, Kerstin, Nehaniv, Chrystopher L.

How Do Human Users Teach a Continual Learning Robot in Repeated Interactions?

arXiv.org Artificial IntelligenceJun-30-2023

Continual learning (CL) has emerged as an important avenue of research in recent years, at the intersection of Machine Learning (ML) and Human-Robot Interaction (HRI), to allow robots to continually learn in their environments over long-term interactions with humans. Most research in continual learning, however, has been robot-centered to develop continual learning algorithms that can quickly learn new information on static datasets. In this paper, we take a human-centered approach to continual learning, to understand how humans teach continual learning robots over the long term and if there are variations in their teaching styles. We conducted an in-person study with 40 participants that interacted with a continual learning robot in 200 sessions. In this between-participant study, we used two different CL models deployed on a Fetch mobile manipulator robot. An extensive qualitative and quantitative analysis of the data collected in the study shows that there is significant variation among the teaching styles of individual users indicating the need for personalized adaptation to their distinct teaching styles. The results also show that although there is a difference in the teaching styles between expert and non-expert users, the style does not have an effect on the performance of the continual learning robot. Finally, our analysis shows that the constrained experimental setups that have been widely used to test most continual learning techniques are not adequate, as real users interact with and teach continual learning robots in a variety of ways. Our code is available at https://github.com/aliayub7/cl_hri.

artificial intelligence, participant, robot, (13 more...)

2307.00123

Country:

North America > United States (0.14)
Europe > United Kingdom > England > Hertfordshire (0.04)
North America > Canada > Ontario > Waterloo Region > Waterloo (0.04)
Europe > Spain > Catalonia > Girona Province > Girona (0.04)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)

Industry: Education (0.68)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

arXiv.org Artificial IntelligenceApr-12-2023

A Closer Look at Parameter-Efficient Tuning in Diffusion Models

Xiang, Chendong, Bao, Fan, Li, Chongxuan, Su, Hang, Zhu, Jun

Large-scale diffusion models like Stable Diffusion are powerful and find various real-world applications while customizing such models by fine-tuning is both memory and time inefficient. Motivated by the recent progress in natural language processing, we investigate parameter-efficient tuning in large diffusion models by inserting small learnable modules (termed adapters). In particular, we decompose the design space of adapters into orthogonal factors -- the input position, the output position as well as the function form, and perform Analysis of Variance (ANOVA), a classical statistical approach for analyzing the correlation between discrete (design options) and continuous variables (evaluation metrics). Our analysis suggests that the input position of adapters is the critical factor influencing the performance of downstream tasks. Then, we carefully study the choice of the input position, and we find that putting the input position after the cross-attention block can lead to the best performance, validated by additional visualization analyses. Finally, we provide a recipe for parameter-efficient tuning in diffusion models, which is comparable if not superior to the fully fine-tuned baseline (e.g., DreamBooth) with only 0.75 \% extra parameters, across various customized tasks.

diffusion model, machine learning, natural language, (21 more...)

2303.18181

Country:

Asia > China > Beijing > Beijing (0.04)
North America > Dominican Republic (0.04)
Europe > Romania > Sud - Muntenia Development Region > Giurgiu County > Giurgiu (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

#artificialintelligenceFeb-15-2022, 21:40:06 GMT

Statistics (III) ANOVA in Data Science & Machine Learning

For the last part of the Statistics series, we will cover the ANOVA, Post-hoc Pairwise Comparison, Two-way ANOVA, and R-squared. Previously, our study focused on one or two groups of subjects. How can we handle the concept of multiple groups with multiple factors? For example, the dose level and gender may impact the effectiveness of a vaccine. How can we determine whether it is statistically significant for particular combinations?

anova, compute, statistics, (13 more...)

Industry: Health & Medicine > Therapeutic Area (0.55)

Technology:

Information Technology > Data Science > Data Mining > Big Data (0.40)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.32)

#artificialintelligenceMay-7-2021, 07:30:44 GMT

Statistical Tests

Statistics is pretty nice and smooth until we come across "Inferential Statistics" because of so many things happening there. I must say, it stands right for its name as using it is "Inferential"as well! And with all of it, come the several Statistical Tests we conduct when we formulate a Statistical Hypothesis! It seems a boring step while working on a Data Science project but is relevant for what it stands as it speaks about how good you're going using the sample, to know the whole Population. Before jumping in directly into the tests, let's know some Introductory basis behind it all.

hypothesis, proportion, statistical test, (6 more...)

Genre: Research Report > Experimental Study (0.80)

Technology:

Information Technology > Data Science (0.35)
Information Technology > Artificial Intelligence > Machine Learning (0.30)

#artificialintelligenceOct-31-2020, 20:25:49 GMT

Statistical Tests in Machine Learning

When it comes to statistics in machine learning, a common approach to accept or reject a null hypothesis is to check for the p-values and give a result without really having an idea of what goes on in the background. Without getting into any kind of fancy jargons or mathematical technicalities, this article attempts to sum up the intuition behind statistics using some real life examples especially for people from a non-statistics background. Why do we need hypothesis testing? But what if suddenly, Dunkin' happens to shut down because Krispe Kreme claims the weight of their donuts is less than what Dunkin' claims. How do we choose sides?

artificial intelligence, categorical column, machine learning, (15 more...)

Genre: Research Report > Experimental Study (0.56)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.62)

#artificialintelligenceOct-14-2020, 00:18:16 GMT

What causes the test error? Going beyond bias-variance via ANOVA

Modern machine learning methods are often overparametrized, allowing adaptation to the data at a fine level. This can seem puzzling; in the worst case, such models do not need to generalize. This puzzle inspired a great amount of work, arguing when overparametrization reduces test error, in a phenomenon called "double descent". Recent work aimed to understand in greater depth why overparametrization is helpful for generalization. This leads to discovering the unimodality of variance as a function of the level of parametrization, and to decomposing the variance into that arising from label noise, initialization, and randomness in the training data to understand the sources of the error.

artificial intelligence, machine learning, variance, (5 more...)

AI-Alerts: 2020 > 2020-10 > AAAI AI-Alert for Oct 20, 2020 (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)