AITopics | Instructional Material

Collaborating Authors

Instructional Material

Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos

Neural Information Processing SystemsOct-11-2024, 07:55:50 GMT

We introduce the task of spatially localizing narrated interactions in videos. Key to our approach is the ability to learn to spatially localize interactions with self-supervision on a large corpus of videos with accompanying transcribed narrations. To achieve this goal, we propose a multilayer cross-modal attention network that enables effective optimization of a contrastive loss during training. We introduce a divided strategy that alternates between computing inter- and intra-modal attention across the visual and natural language modalities, which allows effective training via directly contrasting the two modalities' representations. We demonstrate the effectiveness of our approach by self-training on the HowTo100M instructional video dataset and evaluating on a newly collected dataset of localized described interactions in the YouCook2 dataset.

instructional video, interaction, self-supervised spatial grounding, (3 more...)

Neural Information Processing Systems

Genre: Instructional Material > Course Syllabus & Notes (0.65)

Industry:

Education > Educational Technology > Media (0.65)
Education > Educational Technology > Audio & Video (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.43)

Add feedback

Gradient based sample selection for online continual learning

Neural Information Processing SystemsOct-11-2024, 03:48:57 GMT

A continual learning agent learns online with a non-stationary and never-ending stream of data. The key to such learning process is to overcome the catastrophic forgetting of previously seen data, which is a well known problem of neural networks. To prevent forgetting, a replay buffer is usually employed to store the previous data for the purpose of rehearsal. Previous work often depend on task boundary and i.i.d. In this work, we formulate sample selection as a constraint reduction problem based on the constrained optimization view of continual learning.

continual learning, online continual learning, sample selection, (4 more...)

Neural Information Processing Systems

Genre: Instructional Material > Online (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.81)

Add feedback

Accelerated Training via Incrementally Growing Neural Networks using Variance Transfer and Learning Rate Adaptation

Neural Information Processing SystemsOct-11-2024, 03:47:54 GMT

We develop an approach to efficiently grow neural networks, within which parameterization and optimization strategies are designed by considering their effects on the training dynamics. Unlike existing growing methods, which follow simple replication heuristics or utilize auxiliary gradient-based local optimization, we craft a parameterization scheme which dynamically stabilizes weight, activation, and gradient scaling as the architecture evolves, and maintains the inference functionality of the network. To address the optimization difficulty resulting from imbalanced training effort distributed to subnetworks fading in at different growth phases, we propose a learning rate adaption mechanism that rebalances the gradient contribution of these separate subcomponents. Experiments show that our method achieves comparable or better accuracy than training large fixed-size models, while saving a substantial portion of the original training computation budget. We demonstrate that these gains translate into real wall-clock training speedups.

accelerated training, transfer and learning rate adaptation, variance transfer, (1 more...)

Neural Information Processing Systems

Genre: Instructional Material (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.66)

Add feedback

Calibrating CNNs for Lifelong Learning

Neural Information Processing SystemsOct-11-2024, 03:47:16 GMT

We present an approach for lifelong/continual learning of convolutional neural networks (CNN) that does not suffer from the problem of catastrophic forgetting when moving from one task to the other. We show that the activation maps generated by the CNN trained on the old task can be calibrated using very few calibration parameters, to become relevant to the new task. Based on this, we calibrate the activation maps produced by each network layer using spatial and channel-wise calibration modules and train only these calibration parameters for each new task in order to perform lifelong learning. Our calibration modules introduce significantly less computation and parameters as compared to the approaches that dynamically expand the network. Our approach is immune to catastrophic forgetting since we store the task-adaptive calibration parameters, which contain all the task-specific knowledge and is exclusive to each task.

calibrating cnn, calibration parameter, lifelong learning, (7 more...)

Neural Information Processing Systems

Genre: Instructional Material (0.64)

Industry: Education > Educational Setting > Continuing Education (0.64)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.61)

Add feedback

Online-Within-Online Meta-Learning

Neural Information Processing SystemsOct-11-2024, 02:57:44 GMT

We study the problem of learning a series of tasks in a fully online Meta-Learning setting. The goal is to exploit similarities among the tasks to incrementally adapt an inner online algorithm in order to incur a low averaged cumulative error over the tasks. We focus on a family of inner algorithms based on a parametrized variant of online Mirror Descent. The inner algorithm is incrementally adapted by an online Mirror Descent meta-algorithm using the corresponding within-task minimum regularized empirical risk as the meta-loss. In order to keep the process fully online, we approximate the meta-subgradients by the online inner algorithm.

inner algorithm, online mirror descent, online-within-online meta-learning, (1 more...)

Neural Information Processing Systems

Genre: Instructional Material > Online (0.65)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

COBE: Contextualized Object Embeddings from Narrated Instructional Video

Neural Information Processing SystemsOct-11-2024, 02:36:09 GMT

Many objects in the real world undergo dramatic variations in visual appearance. For example, a tomato may be red or green, sliced or chopped, fresh or fried, liquid or solid. Training a single detector to accurately recognize tomatoes in all these different states is challenging. On the other hand, contextual cues (e.g., the presence of a knife, a cutting board, a strainer or a pan) are often strongly indicative of how the object appears in the scene. Recognizing such contextual cues is useful not only to improve the accuracy of object detection or to determine the state of the object, but also to understand its functional properties and to infer ongoing or upcoming human-object interactions.

cobe, contextualized object embedding, narrated instructional video, (4 more...)

Neural Information Processing Systems

Genre: Instructional Material > Course Syllabus & Notes (0.44)

Industry:

Education > Educational Technology > Media (0.44)
Education > Educational Technology > Audio & Video (0.44)

Technology: Information Technology > Artificial Intelligence (0.79)

Add feedback

Unsupervised Curricula for Visual Meta-Reinforcement Learning

Neural Information Processing SystemsOct-11-2024, 00:55:11 GMT

In principle, meta-reinforcement learning algorithms leverage experience across many tasks to learn fast and effective reinforcement learning (RL) strategies. However, current meta-RL approaches rely on manually-defined distributions of training tasks, and hand-crafting these task distributions can be challenging and time-consuming. Can useful'' pre-training tasks be discovered in an unsupervised manner? We develop an unsupervised algorithm for inducing an adaptive meta-training task distribution, i.e. an automatic curriculum, by modeling unsupervised interaction in a visual environment. The task distribution is scaffolded by a parametric density model of the meta-learner's trajectory distribution.

task distribution, unsupervised curriculum, visual meta-reinforcement learning

Neural Information Processing Systems

Genre: Instructional Material > Course Syllabus & Notes (0.40)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Systematic Assessment of OpenAI o1-Preview for Higher Order Thinking in Education

Latif, Ehsan, Zhou, Yifan, Guo, Shuchen, Gao, Yizhu, Shi, Lehong, Nayaaba, Matthew, Lee, Gyeonggeon, Zhang, Liang, Bewersdorff, Arne, Fang, Luyang, Yang, Xiantong, Zhao, Huaqin, Jiang, Hanqi, Lu, Haoran, Li, Jiaxi, Yu, Jichao, You, Weihang, Liu, Zhengliang, Liu, Vincent Shung, Wang, Hui, Wu, Zihao, Lu, Jin, Dou, Fei, Ma, Ping, Liu, Ninghao, Liu, Tianming, Zhai, Xiaoming

arXiv.org Artificial IntelligenceOct-11-2024

As artificial intelligence (AI) continues to advance, it demonstrates capabilities comparable to human intelligence, with significant potential to transform education and workforce development. This study evaluates OpenAI o1-preview's ability to perform higher-order cognitive tasks across 14 dimensions, including critical thinking, systems thinking, computational thinking, design thinking, metacognition, data literacy, creative thinking, abstract reasoning, quantitative reasoning, logical reasoning, analogical reasoning, and scientific reasoning. We used validated instruments like the Ennis-Weir Critical Thinking Essay Test and the Biological Systems Thinking Test to compare the o1-preview's performance with human performance systematically. Our findings reveal that o1-preview outperforms humans in most categories, achieving 150% better results in systems thinking, computational thinking, data literacy, creative thinking, scientific reasoning, and abstract reasoning. However, compared to humans, it underperforms by around 25% in logical reasoning, critical thinking, and quantitative reasoning. In analogical reasoning, both o1-preview and humans achieved perfect scores. Despite these strengths, the o1-preview shows limitations in abstract reasoning, where human psychology students outperform it, highlighting the continued importance of human oversight in tasks requiring high-level abstraction. These results have significant educational implications, suggesting a shift toward developing human skills that complement AI, such as creativity, abstract reasoning, and critical thinking. This study emphasizes the transformative potential of AI in education and calls for a recalibration of educational goals, teaching methods, and curricula to align with an AI-driven world.

large language model, machine learning, natural language, (24 more...)

arXiv.org Artificial Intelligence

2410.21287

Country:

Europe (1.00)
Asia > China (1.00)
North America > Canada > Alberta (0.45)
North America > United States > Arizona (0.27)

Genre:

Workflow (1.00)
Research Report > New Finding (1.00)
Research Report > Experimental Study (1.00)
(4 more...)

Industry:

Leisure & Entertainment (1.00)
Health & Medicine > Therapeutic Area > Psychiatry/Psychology (1.00)
Health & Medicine > Therapeutic Area > Neurology (1.00)
(22 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Expert Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
(3 more...)

Add feedback

One Step at a Time: Combining LLMs and Static Analysis to Generate Next-Step Hints for Programming Tasks

Birillo, Anastasiia, Artser, Elizaveta, Potriasaeva, Anna, Vlasov, Ilya, Dzialets, Katsiaryna, Golubev, Yaroslav, Gerasimov, Igor, Keuning, Hieke, Bryksin, Timofey

arXiv.org Artificial IntelligenceOct-11-2024

Students often struggle with solving programming problems when learning to code, especially when they have to do it online, with one of the most common disadvantages of working online being the lack of personalized help. This help can be provided as next-step hint generation, i.e., showing a student what specific small step they need to do next to get to the correct solution. There are many ways to generate such hints, with large language models (LLMs) being among the most actively studied right now. While LLMs constitute a promising technology for providing personalized help, combining them with other techniques, such as static analysis, can significantly improve the output quality. In this work, we utilize this idea and propose a novel system to provide both textual and code hints for programming tasks. The pipeline of the proposed approach uses a chain-of-thought prompting technique and consists of three distinct steps: (1) generating subgoals - a list of actions to proceed with the task from the current student's solution, (2) generating the code to achieve the next subgoal, and (3) generating the text to describe this needed action. During the second step, we apply static analysis to the generated code to control its size and quality. The tool is implemented as a modification to the open-source JetBrains Academy plugin, supporting students in their in-IDE courses. To evaluate our approach, we propose a list of criteria for all steps in our pipeline and conduct two rounds of expert validation. Finally, we evaluate the next-step hints in a classroom with 14 students from two universities. Our results show that both forms of the hints - textual and code - were helpful for the students, and the proposed system helped them to proceed with the coding tasks.

large language model, machine learning, natural language, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3699538.3699556

2410.09268

Country:

North America > United States > District of Columbia > Washington (0.05)
Europe > Serbia > Central Serbia > Belgrade (0.04)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
(4 more...)

Genre:

Workflow (1.00)
Research Report > New Finding (0.68)
Instructional Material > Course Syllabus & Notes (0.68)

Industry:

Education > Educational Setting > Online (1.00)
Education > Educational Technology > Educational Software > Computer Based Training (0.68)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Utilizing ChatGPT in a Data Structures and Algorithms Course: A Teaching Assistant's Perspective

Jamie, Pooriya, Hajihashemi, Reyhaneh, Alipour, Sharareh

arXiv.org Artificial IntelligenceOct-11-2024

ChatGPT, exploring how structured prompts and active TA support can enhance educational outcomes. Our work also integrates Integrating large language models (LLMs) like ChatGPT is revolutionizing two versions of ChatGPT--ChatGPT-4o and ChatGPT o1--each contributing the field of computer science education. These models uniquely: ChatGPT-4o supports routine educational tasks, offer new possibilities for enriching student learning and supporting while ChatGPT o1 enhances complex problem-solving through improved teaching assistants (TAs) in providing prompt feedback and reasoning, thereby addressing the limitations of using LLMs supplementary learning resources. This research delves into the use independently. of ChatGPT in a data structures and algorithms (DSA) course, particularly when combined with TA supervision. The findings demonstrate that incorporating ChatGPT with structured prompts and RQ1: How does ChatGPT impact students' learning and active TA guidance enhances students' understanding of intricate exam readiness when supervised by TAs? algorithmic concepts, boosts engagement, and elevates academic performance. However, challenges exist in addressing academic RQ2: What challenges arise in utilizing ChatGPT to answer integrity and the limitations of LLMs in tackling complex problems.

large language model, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2410.08899

Country:

North America > United States > New York > New York County > New York City (0.06)
Asia > Middle East > Iran > Tehran Province > Tehran (0.05)
North America > Canada > Ontario > Toronto (0.04)
(4 more...)

Genre:

Research Report > New Finding (1.00)
Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education > Educational Setting > Higher Education (0.86)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback