AITopics | Instructional Material

Collaborating Authors

Instructional Material

Insights from Social Shaping Theory: The Appropriation of Large Language Models in an Undergraduate Programming Course

Padiyath, Aadarsh, Hou, Xinying, Pang, Amy, Vargas, Diego Viramontes, Gu, Xingjian, Nelson-Fromm, Tamara, Wu, Zihan, Guzdial, Mark, Ericson, Barbara

arXiv.org Artificial IntelligenceJun-10-2024

The capability of large language models (LLMs) to generate, debug, and explain code has sparked the interest of researchers and educators in undergraduate programming, with many anticipating their transformative potential in programming education. However, decisions about why and how to use LLMs in programming education may involve more than just the assessment of an LLM's technical capabilities. Using the social shaping of technology theory as a guiding framework, our study explores how students' social perceptions influence their own LLM usage. We then examine the correlation of self-reported LLM usage with students' self-efficacy and midterm performances in an undergraduate programming course. Triangulating data from an anonymous end-of-course student survey (n = 158), a mid-course self-efficacy survey (n=158), student interviews (n = 10), self-reported LLM usage on homework, and midterm performances, we discovered that students' use of LLMs was associated with their expectations for their future careers and their perceptions of peer usage. Additionally, early self-reported LLM usage in our context correlated with lower self-efficacy and lower midterm scores, while students' perceived over-reliance on LLMs, rather than their usage itself, correlated with decreased self-efficacy later in the course.

llm, perception, student, (12 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3632620.3671098

2406.06451

Country:

North America > United States > New York > New York County > New York City (0.14)
North America > Canada > Ontario > Toronto (0.14)
Oceania > Australia > Victoria > Melbourne (0.05)
(7 more...)

Genre:

Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
Questionnaire & Opinion Survey (1.00)
(2 more...)

Industry:

Education > Educational Setting > Online (1.00)
Education > Curriculum (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.96)

Add feedback

Investigating Pre-Training Objectives for Generalization in Vision-Based Reinforcement Learning

Kim, Donghu, Lee, Hojoon, Lee, Kyungmin, Hwang, Dongyoon, Choo, Jaegul

arXiv.org Artificial IntelligenceJun-10-2024

Recently, various pre-training methods have been introduced in vision-based Reinforcement Learning (RL). However, their generalization ability remains unclear due to evaluations being limited to in-distribution environments and non-unified experimental setups. To address this, we introduce the Atari Pre-training Benchmark (Atari-PB), which pre-trains a ResNet-50 model on 10 million transitions from 50 Atari games and evaluates it across diverse environment distributions. Our experiments show that pre-training objectives focused on learning task-agnostic features (e.g., identifying objects and understanding temporal dynamics) enhance generalization across different environments. In contrast, objectives focused on learning task-specific knowledge (e.g., identifying agents and fitting reward functions) improve performance in environments similar to the pre-training dataset but not in varied ones. We publicize our codes, datasets, and model checkpoints at https://github.com/dojeon-ai/Atari-PB.

dataset, generalization, pre-training objective, (10 more...)

arXiv.org Artificial Intelligence

2406.06037

Genre:

Research Report > New Finding (1.00)
Instructional Material (0.85)

Industry: Leisure & Entertainment > Games > Computer Games (0.86)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Add feedback

Numerical solution of a PDE arising from prediction with expert advice

Calder, Jeff, Drenska, Nadejda, Mosaphir, Drisana

arXiv.org Artificial IntelligenceJun-9-2024

This work investigates the online machine learning problem of prediction with expert advice in an adversarial setting through numerical analysis of, and experiments with, a related partial differential equation. The problem is a repeated two-person game involving decision-making at each step informed by $n$ experts in an adversarial environment. The continuum limit of this game over a large number of steps is a degenerate elliptic equation whose solution encodes the optimal strategies for both players. We develop numerical methods for approximating the solution of this equation in relatively high dimensions ($n\leq 10$) by exploiting symmetries in the equation and the solution to drastically reduce the size of the computational domain. Based on our numerical results we make a number of conjectures about the optimality of various adversarial strategies, in particular about the non-optimality of the COMB strategy.

equation, prediction, viscosity solution, (14 more...)

arXiv.org Artificial Intelligence

2406.05754

Country:

North America > United States > Minnesota (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Louisiana > East Baton Rouge Parish > Baton Rouge (0.04)
(2 more...)

Genre:

Research Report (0.50)
Instructional Material > Online (0.34)

Industry:

Education (0.66)
Leisure & Entertainment > Games (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Mathematics of Computing (0.86)

Add feedback

Aligning Human Knowledge with Visual Concepts Towards Explainable Medical Image Classification

Gao, Yunhe, Gu, Difei, Zhou, Mu, Metaxas, Dimitris

arXiv.org Artificial IntelligenceJun-8-2024

Although explainability is essential in the clinical diagnosis, most deep learning models still function as black boxes without elucidating their decision-making process. In this study, we investigate the explainable model development that can mimic the decision-making process of human experts by fusing the domain knowledge of explicit diagnostic criteria. We introduce a simple yet effective framework, Explicd, towards Explainable language-informed criteria-based diagnosis. Explicd initiates its process by querying domain knowledge from either large language models (LLMs) or human experts to establish diagnostic criteria across various concept axes (e.g., color, shape, texture, or specific patterns of diseases). By leveraging a pretrained vision-language model, Explicd injects these criteria into the embedding space as knowledge anchors, thereby facilitating the learning of corresponding visual concepts within medical images. The final diagnostic outcome is determined based on the similarity scores between the encoded visual concepts and the textual criteria embeddings. Through extensive evaluation of five medical image classification benchmarks, Explicd has demonstrated its inherent explainability and extends to improve classification performance compared to traditional black-box models.

criteria, criteria axis, diagnostic criteria, (14 more...)

arXiv.org Artificial Intelligence

2406.05596

Genre:

Research Report (1.00)
Instructional Material > Course Syllabus & Notes (0.72)
Instructional Material > Online (0.62)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (1.00)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

A Knowledge-Component-Based Methodology for Evaluating AI Assistants

Qi, Laryn, Zamfirescu-Pereira, J. D., Kim, Taehan, Hartmann, Björn, DeNero, John, Norouzi, Narges

arXiv.org Artificial IntelligenceJun-8-2024

We evaluate an automatic hint generator for CS1 programming assignments powered by GPT-4, a large language model. This system provides natural language guidance about how students can improve their incorrect solutions to short programming exercises. A hint can be requested each time a student fails a test case. Our evaluation addresses three Research Questions: RQ1: Do the hints help students improve their code? RQ2: How effectively do the hints capture problems in student code? RQ3: Are the issues that students resolve the same as the issues addressed in the hints? To address these research questions quantitatively, we identified a set of fine-grained knowledge components and determined which ones apply to each exercise, incorrect solution, and generated hint. Comparing data from two large CS1 offerings, we found that access to the hints helps students to address problems with their code more quickly, that hints are able to consistently capture the most pressing errors in students' code, and that hints that address a few issues at once rather than a single bug are more likely to lead to direct student progress.

knowledge-component-based methodology, student, submission, (12 more...)

arXiv.org Artificial Intelligence

2406.05603

Country:

North America > United States > New York > New York County > New York City (0.05)
North America > United States > Oregon > Multnomah County > Portland (0.05)
North America > United States > Rhode Island > Providence County > Providence (0.04)
(5 more...)

Genre:

Research Report > New Finding (0.54)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Education (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.95)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Refining Minimax Regret for Unsupervised Environment Design

Beukman, Michael, Coward, Samuel, Matthews, Michael, Fellows, Mattie, Jiang, Minqi, Dennis, Michael, Foerster, Jakob

arXiv.org Artificial IntelligenceJun-8-2024

In unsupervised environment design, reinforcement learning agents are trained on environment configurations (levels) generated by an adversary that maximises some objective. Regret is a commonly used objective that theoretically results in a minimax regret (MMR) policy with desirable robustness guarantees; in particular, the agent's maximum regret is bounded. However, once the agent reaches this regret bound on all levels, the adversary will only sample levels where regret cannot be further reduced. Although there are possible performance improvements to be made outside of these regret-maximising levels, learning stagnates. In this work, we introduce Bayesian level-perfect MMR (BLP), a refinement of the minimax regret objective that overcomes this limitation. We formally show that solving for this objective results in a subset of MMR policies, and that BLP policies act consistently with a Perfect Bayesian policy over all levels. We further introduce an algorithm, ReMiDi, that results in a BLP policy at convergence. We empirically demonstrate that training on levels from a minimax regret adversary causes learning to prematurely stagnate, but that ReMiDi continues learning.

adversary, agent, minimax regret, (14 more...)

arXiv.org Artificial Intelligence

2402.12284

Country:

Europe > United Kingdom > England > Oxfordshire > Oxford (0.14)
Europe > Austria > Vienna (0.14)
North America > United States > Louisiana > Orleans Parish > New Orleans (0.04)
(6 more...)

Genre:

Research Report (0.50)
Instructional Material (0.46)

Industry: Education > Educational Setting > Continuing Education (0.68)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
(2 more...)

Add feedback

Online Continual Learning of Video Diffusion Models From a Single Video Stream

Yoo, Jason, Green, Dylan, Pleiss, Geoff, Wood, Frank

arXiv.org Artificial IntelligenceJun-7-2024

Diffusion models have shown exceptional capabilities in generating realistic videos. Yet, their training has been predominantly confined to offline environments where models can repeatedly train on i.i.d. data to convergence. This work explores the feasibility of training diffusion models from a semantically continuous video stream, where correlated video frames sequentially arrive one at a time. To investigate this, we introduce two novel continual video generative modeling benchmarks, Lifelong Bouncing Balls and Windows 95 Maze Screensaver, each containing over a million video frames generated from navigating stationary environments. Surprisingly, our experiments show that diffusion models can be effectively trained online using experience replay, achieving performance comparable to models trained with i.i.d. samples given the same number of gradient steps.

artificial intelligence, machine learning, windows 95, (12 more...)

arXiv.org Artificial Intelligence

2406.04814

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Genre:

Instructional Material > Online (0.42)
Research Report > New Finding (0.30)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Comprehensive AI Assessment Framework: Enhancing Educational Evaluation with Ethical AI Integration

Kılınç, Selçuk

arXiv.org Artificial IntelligenceJun-7-2024

The integration of generative artificial intelligence (GenAI) tools into education has been a game-changer for teaching and assessment practices, bringing new opportunities, but also novel challenges which need to be dealt with. This paper presents the Comprehensive AI Assessment Framework (CAIAF), an evolved version of the AI Assessment Scale (AIAS) by Perkins, Furze, Roe, and MacVaugh, targeted toward the ethical integration of AI into educational assessments. This is where the CAIAF differs, as it incorporates stringent ethical guidelines, with clear distinctions based on educational levels, and advanced AI capabilities of real-time interactions and personalized assistance. The framework developed herein has a very intuitive use, mainly through the use of a color gradient that enhances the user-friendliness of the framework. Methodologically, the framework has been developed through the huge support of a thorough literature review and practical insight into the topic, becoming a dynamic tool to be used in different educational settings. The framework will ensure better learning outcomes, uphold academic integrity, and promote responsible use of AI, hence the need for this framework in modern educational practice.

genai tool, integration, student, (11 more...)

arXiv.org Artificial Intelligence

2407.16887

Country:

Asia > Indonesia > Sumatra > Bengkulu > Bengkulu (0.04)
Asia > India > NCT > Delhi (0.04)
South America > Uruguay > Maldonado > Maldonado (0.04)
(2 more...)

Genre:

Instructional Material (1.00)
Research Report > Experimental Study (0.88)

Industry:

Education > Educational Setting > K-12 Education (1.00)
Education > Educational Setting > Higher Education (1.00)
Information Technology > Security & Privacy (0.93)
Education > Assessment & Standards (0.88)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Issues > Social & Ethical Issues (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.49)

Add feedback

Lean Workbook: A large-scale Lean problem set formalized from natural language math problems

Ying, Huaiyuan, Wu, Zijian, Geng, Yihan, Wang, Jiayu, Lin, Dahua, Chen, Kai

arXiv.org Artificial IntelligenceJun-7-2024

Large language models have demonstrated impressive capabilities across various natural language processing tasks, especially in solving mathematical problems. However, large language models are not good at math theorem proving using formal languages like Lean. A significant challenge in this area is the scarcity of training data available in these formal languages. To address this issue, we propose a novel pipeline that iteratively generates and filters synthetic data to translate natural language mathematical problems into Lean 4 statements, and vice versa. Our results indicate that the synthetic data pipeline can provide useful training data and improve the performance of LLMs in translating and understanding complex mathematical problems and proofs. Our final dataset contains about 57K formal-informal question pairs along with searched proof from the math contest forum and 21 new IMO questions.

dataset, language model, natural language problem, (13 more...)

arXiv.org Artificial Intelligence

2406.03847

Country: Asia > China > Shanghai > Shanghai (0.04)

Genre:

Instructional Material > Course Syllabus & Notes (0.50)
Research Report > New Finding (0.34)

Industry: Education (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Position: Embracing Negative Results in Machine Learning

Karl, Florian, Kemeter, Lukas Malte, Dax, Gabriel, Sierak, Paulina

arXiv.org Artificial IntelligenceJun-6-2024

Publications proposing novel machine learning methods are often primarily rated by exhibited predictive performance on selected problems. In this position paper we argue that predictive performance alone is not a good indicator for the worth of a publication. Using it as such even fosters problems like inefficiencies of the machine learning research community as a whole and setting wrong incentives for researchers. We therefore put out a call for the publication of "negative" results, which can help alleviate some of these problems and improve the scientific output of the machine learning research community. To substantiate our position, we present the advantages of publishing negative results and provide concrete measures for the community to move towards a paradigm where their publication is normalized.

embracing negative result, negative result, publication, (13 more...)

arXiv.org Artificial Intelligence

2406.0398

Country:

Europe > Austria > Vienna (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
North America > United States > California > Santa Clara County > Stanford (0.04)
(2 more...)

Genre:

Research Report > Promising Solution (0.68)
Research Report > New Finding (0.68)
Instructional Material > Course Syllabus & Notes (0.46)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback