AITopics

2503.05012

Country: North America > United States > North Carolina (0.14)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
(3 more...)

Industry:

Information Technology > Security & Privacy (1.00)
Health & Medicine (0.92)
Education > Educational Setting > Higher Education (0.87)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.49)

Haldar, Susmita, Pierce, Mary, Capretz, Luiz Fernando

WIP: Assessing the Effectiveness of ChatGPT in Preparatory Testing Activities

arXiv.org Artificial IntelligenceMar-5-2025

This innovative practice WIP paper describes a research study that explores the integration of ChatGPT into the software testing curriculum and evaluates its effectiveness compared to human-generated testing artifacts. In a Capstone Project course, students were tasked with generating preparatory testing artifacts using ChatGPT prompts, which they had previously created manually. Their understanding and the effectiveness of the Artificial Intelligence generated artifacts were assessed through targeted questions. The results, drawn from this in-class assignment at a North American community college indicate that while ChatGPT can automate many testing preparation tasks, it cannot fully replace human expertise. However, students, already familiar with Information Technology at the postgraduate level, found the integration of ChatGPT into their workflow to be straightforward. The study suggests that AI can be gradually introduced into software testing education to keep pace with technological advancements.

large language model, machine learning, natural language, (22 more...)

doi: 10.1109/FIE61694.2024.10893214

2503.03951

Country:

North America > United States (0.16)
North America > Canada (0.15)

Genre: Research Report (0.85)

Industry:

Education > Curriculum > Subject-Specific Education (0.47)
Education > Educational Setting > Higher Education (0.47)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceMar-3-2025

OIPR: Evaluation for Time-series Anomaly Detection Inspired by Operator Interest

Jing, Yuhan, Wang, Jingyu, Zhang, Lei, Sun, Haifeng, He, Bo, Zhuang, Zirui, Wang, Chengsen, Qi, Qi, Liao, Jianxin

With the growing adoption of time-series anomaly detection (TAD) technology, numerous studies have employed deep learning-based detectors for analyzing time-series data in the fields of Internet services, industrial systems, and sensors. The selection and optimization of anomaly detectors strongly rely on the availability of an effective performance evaluation method for TAD. Since anomalies in time-series data often manifest as a sequence of points, conventional metrics that solely consider the detection of individual point are inadequate. Existing evaluation methods for TAD typically employ point-based or event-based metrics to capture the temporal context. However, point-based metrics tend to overestimate detectors that excel only in detecting long anomalies, while event-based metrics are susceptible to being misled by fragmented detection results. To address these limitations, we propose OIPR, a novel set of TAD evaluation metrics. It models the process of operators receiving detector alarms and handling faults, utilizing area under the operator interest curve to evaluate the performance of TAD algorithms. Furthermore, we build a special scenario dataset to compare the characteristics of different evaluation methods. Through experiments conducted on the special scenario dataset and five real-world datasets, we demonstrate the remarkable performance of OIPR in extreme and complex scenarios. It achieves a balance between point and event perspectives, overcoming their primary limitations and offering applicability to broader situations.

anomaly event, data mining, machine learning, (15 more...)

2503.0126

Country:

North America > United States (0.46)
Europe > France (0.28)
North America > Canada > Ontario (0.14)
(2 more...)

Genre: Research Report (0.63)

Industry:

Information Technology > Security & Privacy (0.92)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Data Science > Data Mining > Anomaly Detection (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceMar-1-2025

A Multi-Labeled Dataset for Indonesian Discourse: Examining Toxicity, Polarization, and Demographics Information

Susanto, Lucky, Wijanarko, Musa, Pratama, Prasetia, Tang, Zilu, Akyas, Fariz, Hong, Traci, Idris, Ika, Aji, Alham, Wijaya, Derry

Polarization is defined as divisive opinions held by two or more groups on substantive issues. As the world's third-largest democracy, Indonesia faces growing concerns about the interplay between political polarization and online toxicity, which is often directed at vulnerable minority groups. Despite the importance of this issue, previous NLP research has not fully explored the relationship between toxicity and polarization. To bridge this gap, we present a novel multi-label Indonesian dataset that incorporates toxicity, polarization, and annotator demographic information. Benchmarking this dataset using BERT-base models and large language models (LLMs) shows that polarization information enhances toxicity classification, and vice versa. Furthermore, providing demographic information significantly improves the performance of polarization classification.

large language model, machine learning, natural language, (21 more...)

2503.00417

Country:

Asia > Indonesia (0.49)
North America > United States > California (0.14)
Europe > Spain > Galicia (0.14)

Genre: Research Report > New Finding (0.93)

Industry:

Government (0.93)
Media > News (0.68)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

arXiv.org Artificial IntelligenceMar-1-2025

Generative Artificial Intelligence for Academic Research: Evidence from Guidance Issued for Researchers by Higher Education Institutions in the United States

Ganguly, Amrita, Johri, Aditya, Ali, Areej, McDonald, Nora

To address these concerns, many Higher Education Institutions ( HEI s) have released institutional gui dance for researchers . To better understand the guidance that is being provided we report findings from a thematic analysis of guidelines from thirty HEIs in the United States that are classified as R1 or "very high research activity. " We found that guidance provided to researchers: 1) asks them to refer to external sources of information such as funding agencies and publishers to keep updated and use institutional resources for training and education; 2) asks them to understand and learn about specific GenAI attributes that shape research such as predictive modeling, knowledge cutoff date, data provenance, and model limitations, and about ethical concerns such as authorship, attribution, privacy, and intellectual property issues; 3) incl udes instructions on how to acknowledge sources and disclose the use of GenAI, and how to communicate effectively about their GenAI use, and alerts researchers to long term implications such as over reliance on GenAI, legal consequences, and risks to their institutions from GenAI use. Overall, g uidance places the onus of compliance on individual researchers making them accountable for any lapses, thereby increasing their responsibility. Keywords: Generative Artificial Intelligence; Academic Research, Thematic Analysis, Policy and Guidance, Qualitative Data Analysis, Framework 1 Introduction As the use of generative artificial intelligence (GenAI) increases across all facets of society, one area of significant impact is higher education institutions (HEIs). Although the initial scholarship on the use of GenAI within HEIs has focused on teaching and learning (McDonald et al., 202 5; Ali et al., 2025) increasingly, studies are starting to examine how academic research is being impacted by GenAI ( Abernethy, 2024; Lehr, et al., 2024; Lin, 2024; Liu and Jagadish, 2024; Godwin et al., 2024) This shift is in keeping with increased uptake of the use of GenAI for research. GenAI has many potential benefits for researchers across different stages of the research process such as data analysis, creation of content for research dissemination, and as a tool to brainstorm new ideas (Joosten et al., 2024) For instance, Delios et al. (2024) report that almost 30% of scientists are using GenAI as partners in their tasks related to research such as summarizing l iterature review, data analysis, grant writing and assisting with other aspects of manuscript preparation (Morocco - Clarke et al., 2024; Xames and Shefa, 2023). In a 2023 Nature survey of 1600 scientists, 30% acknowledged that they used GenAI to write acade mic papers, conduct literature reviews, and/or develop grant applications (Chawla, 2024).

artificial intelligence, machine learning, natural language, (16 more...)

doi: 10.1007/s43681-025-00688-7

2503.00664

Country:

North America > United States (1.00)
Africa > Middle East > Morocco (0.24)

Genre: Research Report > Experimental Study (0.66)

Industry:

Government > Regional Government > North America Government > United States Government (1.00)
Education > Educational Setting > Higher Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Beyond Demographics: Fine-tuning Large Language Models to Predict Individuals' Subjective Text Perceptions

Orlikowski, Matthias, Pei, Jiaxin, Röttger, Paul, Cimiano, Philipp, Jurgens, David, Hovy, Dirk

People naturally vary in their annotations for subjective questions and some of this variation is thought to be due to the person's sociodemographic characteristics. LLMs have also been used to label data, but recent work has shown that models perform poorly when prompted with sociodemographic attributes, suggesting limited inherent sociodemographic knowledge. Here, we ask whether LLMs can be trained to be accurate sociodemographic models of annotator variation. Using a curated dataset of five tasks with standardized sociodemographics, we show that models do improve in sociodemographic prompting when trained but that this performance gain is largely due to models learning annotator-specific behaviour rather than sociodemographic patterns. Across all tasks, our results suggest that models learn little meaningful connection between sociodemographics and annotation, raising doubts about the current use of LLMs for simulating sociodemographic variation and behaviour.

annotator, large language model, natural language, (19 more...)

2502.20897

Country:

North America > Mexico > Mexico City (0.14)
Europe > Middle East > Malta (0.14)
North America > United States > Oregon (0.14)
Asia > Middle East > UAE (0.14)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine (0.48)
Materials > Chemicals > Industrial Gases > Liquified Gas (0.46)
Materials > Chemicals > Commodity Chemicals > Petrochemicals > LNG (0.46)
(2 more...)

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Kinfu, Kaleab A., Vidal, René

Transformers with Joint Tokens and Local-Global Attention for Efficient Human Pose Estimation

Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) have led to significant progress in 2D body pose estimation. However, achieving a good balance between accuracy, efficiency, and robustness remains a challenge. For instance, CNNs are computationally efficient but struggle with long-range dependencies, while ViTs excel in capturing such dependencies but suffer from quadratic computational complexity. This paper proposes two ViT-based models for accurate, efficient, and robust 2D pose estimation. The first one, EViTPose, operates in a computationally efficient manner without sacrificing accuracy by utilizing learnable joint tokens to select and process a subset of the most important body patches, enabling us to control the trade-off between accuracy and efficiency by changing the number of patches to be processed. The second one, UniTransPose, while not allowing for the same level of direct control over the trade-off, efficiently handles multiple scales by combining (1) an efficient multi-scale transformer encoder that uses both local and global attention with (2) an efficient sub-pixel CNN decoder for better speed and accuracy. Moreover, by incorporating all joints from different benchmarks into a unified skeletal representation, we train robust methods that learn from multiple datasets simultaneously and perform well across a range of scenarios -- including pose variations, lighting conditions, and occlusions. Experiments on six benchmarks demonstrate that the proposed methods significantly outperform state-of-the-art methods while improving computational efficiency. EViTPose exhibits a significant decrease in computational complexity (30% to 44% less in GFLOPs) with a minimal drop of accuracy (0% to 3.5% less), and UniTransPose achieves accuracy improvements ranging from 0.9% to 43.8% across these benchmarks.

artificial intelligence, machine learning, pose estimation, (15 more...)

2503.00232

Country: North America > United States > Pennsylvania (0.14)

Genre: Research Report > Promising Solution (0.48)

Industry:

Health & Medicine (1.00)
Education > Educational Setting > Higher Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (0.93)

Wu, Zihan, Tang, Yicheng, Ericson, Barbara

Learner and Instructor Needs in AI-Supported Programming Learning Tools: Design Implications for Features and Adaptive Control

AI-supported tools can help learners overcome challenges in programming education by providing adaptive assistance. However, existing research often focuses on individual tools rather than deriving broader design recommendations. A key challenge in designing these systems is balancing learner control with system-driven guidance. To explore user preferences for AI-supported programming learning tools, we conducted a participatory design study with 15 undergraduate novice programmers and 10 instructors to gather insights on their desired help features and control preferences, as well as a follow-up survey with 172 introductory programming students. Our qualitative findings show that learners prefer help that is encouraging, incorporates visual aids, and includes peer-related insights, whereas instructors prioritize scaffolding that reflects learners' progress and reinforces best practices. Both groups favor shared control, though learners generally prefer more autonomy, while instructors lean toward greater system guidance to prevent cognitive overload. Additionally, our interviews revealed individual differences in control preferences. Based on our findings, we propose design guidelines for AI-supported programming tools, particularly regarding user-centered help features and adaptive control mechanisms. Our work contributes to the human-centered design of AI-supported learning environments by informing the development of systems that effectively balance autonomy and guidance, enhancing AI-supported educational tools for programming and beyond.

artificial intelligence, learner, machine learning, (15 more...)

2503.00144

Country: North America > United States (0.28)

Genre: Research Report > New Finding (1.00)

Industry:

Education > Educational Technology > Educational Software > Computer Based Training (0.68)
Education > Educational Setting > Higher Education (0.68)
Education > Curriculum > Subject-Specific Education (0.68)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Sharma, Aakanksha, Shailendra, Samar, Kadel, Rajan

Experiences with Content Development and Assessment Design in the Era of GenAI

Generative Artificial Intelligence (GenAI) has the potential to transform higher education by generating human-like content. The advancement in GenAI has revolutionised several aspects of education, especially subject and assessment design. In this era, it is crucial to design assessments that challenge students and cannot be solved using GenAI tools. This makes it necessary to update the educational content with rapidly evolving technology. The assessment plays a significant role in ensuring the students learning, as it encourages students to engage actively, leading to the achievement of learning outcomes. The paper intends to determine how effectively GenAI can design a subject, including lectures, labs and assessments, using prompts and custom-based training. This paper aims to elucidate the direction to educators so they can leverage GenAI to create subject content. Additionally, we provided our experiential learning for educators to develop content, highlighting the importance of prompts and fine-tuning to ensure output quality. It has also been observed that expert evaluation is essential for assessing the quality of GenAI-generated materials throughout the content generation process.

large language model, machine learning, natural language, (16 more...)

2503.00081

Country:

Oceania > Australia (0.14)
North America > United States (0.14)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Higher Education (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.36)

MedSimAI: Simulation and Formative Feedback Generation to Enhance Deliberate Practice in Medical Education

Hicke, Yann, Geathers, Jadon, Rajashekar, Niroop, Chan, Colleen, Jack, Anyanate Gwendolyne, Sewell, Justin, Preston, Mackenzi, Cornes, Susannah, Shung, Dennis, Kizilcec, Rene

Medical education faces challenges in scalability, accessibility, and consistency, particularly in clinical skills training for physician-patient communication. Traditional simulation-based learning, while effective, is resource-intensive, difficult to schedule, and often highly variable in feedback quality. Through a collaboration between AI, learning science, and medical education experts, we co-developed MedSimAI, an AI-powered simulation platform that enables deliberate practice, self-regulated learning (SRL), and automated assessment through interactive patient encounters. Leveraging large language models (LLMs), MedSimAI generates realistic clinical interactions and provides immediate, structured feedback using established medical evaluation frameworks such as the Master Interview Rating Scale (MIRS). In a pilot study with 104 first-year medical students, we examined engagement, conversation patterns, and user perceptions. Students found MedSimAI beneficial for repeated, realistic patient-history practice. Conversation analysis revealed that certain higher-order skills were often overlooked, though students generally performed systematic histories and empathic listening. By integrating unlimited practice opportunities, real-time AI assessment, and SRL principles, MedSimAI addresses key limitations of traditional simulation-based training, making high-quality clinical education more accessible and scalable.

large language model, machine learning, natural language, (18 more...)

2503.05793

Country: North America > United States > California > San Francisco County > San Francisco (0.28)

Genre:

Instructional Material (1.00)
Questionnaire & Opinion Survey (0.93)
Research Report > Experimental Study (0.46)
Research Report > New Finding (0.46)

Industry:

Education > Educational Setting > Higher Education (1.00)
Education > Curriculum > Subject-Specific Education (1.00)
Education > Educational Setting > Online (0.94)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)