AITopics | taxnodes:Technology: Instructional Materials

Plotting

taxnodes:Technology: Instructional Materials

News Overviews Instructional Materials AI-Alerts Classics

Mitigating Forgetting in Online Continual Learning with Neuron Calibration

Neural Information Processing SystemsMar-19-2025, 08:43:55 GMT

This appendix is organized as follows: Section A: the detailed dataset statistics and a summary of model properties w.r.t. We present the details on each dataset in Table 4. Under the online continual setting, the tasks are observed following a fixed order and the data from each task is observed as a (one-pass) stream of samples. The batch size is 10 for all the datasets. We do not randomize the order of tasks or optimize the task orders.

artificial intelligence, benchmark, machine learning, (12 more...)

Neural Information Processing Systems

Country: North America > United States (0.15)

Genre: Instructional Material > Online (0.45)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.61)

Add feedback

Curriculum Learning by Dynamic Instance Hardness Tianyi Zhou 1, Jeff A. Bilmes 2

Neural Information Processing SystemsMar-19-2025, 05:38:36 GMT

A good teacher can adjust a curriculum based on students' learning history. By analogy, in this paper, we study the dynamics of a deep neural network's (DNN) performance on individual samples during its learning process. The observed properties allow us to develop an adaptive curriculum that leads to faster learning of more accurate models. We introduce dynamic instance hardness (DIH), the exponential moving average of a sample's instantaneous hardness (e.g., a loss, or a change in output) over the training history. A low DIH indicates that a model retains knowledge about a sample over time.

artificial intelligence, dih, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States (0.67)
North America > Canada (0.46)

Genre:

Research Report (0.68)
Instructional Material (0.46)

Industry:

Education (1.00)
Government > Regional Government > North America Government > United States Government (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.49)

Add feedback

4dea382d82666332fb564f2e711cbc71-Paper.pdf

Neural Information Processing SystemsMar-19-2025, 05:28:26 GMT

information retrieval, machine learning, reinforcement learning, (24 more...)

Neural Information Processing Systems

Country: North America > United States (1.00)

Genre:

Research Report (0.68)
Instructional Material > Course Syllabus & Notes (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Logic & Formal Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
(2 more...)

Add feedback

TFG: Unified Training-Free Guidance for Diffusion Models 2

Neural Information Processing SystemsMar-19-2025, 04:53:58 GMT

Given an unconditional diffusion model and a predictor for a target property of interest (e.g., a classifier), the goal of training-free guidance is to generate samples with desirable target properties without additional training.

large language model, machine learning, natural language, (20 more...)

Neural Information Processing Systems

Country:

North America > United States > California (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)

Genre:

Research Report > Experimental Study (0.93)
Overview (0.67)
Instructional Material (0.67)

Industry:

Media (0.67)
Transportation (0.46)
Health & Medicine (0.46)
Information Technology (0.45)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
(2 more...)

Add feedback

4b04b0dcd2ade339a3d7ce13252a29d4-Paper.pdf

Neural Information Processing SystemsMar-19-2025, 04:15:27 GMT

artificial intelligence, machine learning, optimization problem, (18 more...)

Neural Information Processing Systems

Genre:

Instructional Material (0.46)
Research Report > New Finding (0.34)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.94)

Add feedback

Neev Parikh Omer Gottesman George Konidaris Brown University

Neural Information Processing SystemsMar-19-2025, 01:44:31 GMT

A fundamental assumption of reinforcement learning in Markov decision processes (MDPs) is that the relevant decision process is, in fact, Markov. However, when MDPs have rich observations, agents typically learn by way of an abstract state representation, and such representations are not guaranteed to preserve the Markov property. We introduce a novel set of conditions and prove that they are sufficient for learning a Markov abstract state representation. We then describe a practical training procedure that combines inverse model estimation and temporal contrastive learning to learn an abstraction that approximately satisfies these conditions. Our novel training objective is compatible with both online and offline training: it does not require a reward signal, but agents can capitalize on reward information when available. We empirically evaluate our approach on a visual gridworld domain and a set of continuous control benchmarks. Our approach learns representations that capture the underlying structure of the domain and lead to improved sample efficiency over state-of-the-art deep reinforcement learning with visual features-- often matching or exceeding the performance achieved with hand-designed compact state information.

abstraction, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Genre: Instructional Material (0.87)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.49)

Add feedback

Equivariant Networks for Crystal Structures

Neural Information Processing SystemsMar-19-2025, 01:10:05 GMT

Supervised learning with deep models has tremendous potential for applications in materials science. Recently, graph neural networks have been used in this context, drawing direct inspiration from models for molecules. However, materials are typically much more structured than molecules, which is a feature that these models do not leverage. In this work, we introduce a class of models that are equivariant with respect to crystalline symmetry groups. We do this by defining a generalization of the message passing operations that can be used with more general permutation groups, or that can alternatively be seen as defining an expressive convolution operation on the crystal graph. Empirically, these models achieve competitive results with state-of-the-art on property prediction tasks.

artificial intelligence, atom, machine learning, (16 more...)

Neural Information Processing Systems

Country: North America > Canada (0.28)

Genre:

Instructional Material (0.68)
Research Report (0.67)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.47)

Add feedback

Curriculum learning for multilevel budgeted combinatorial problems

Neural Information Processing SystemsMar-19-2025, 00:18:40 GMT

Learning heuristics for combinatorial optimization problems through graph neural networks have recently shown promising results on some classic NP-hard problems. These are single-level optimization problems with only one player. Multilevel combinatorial optimization problems are their generalization, encompassing situations with multiple players taking decisions sequentially. By framing them in a multi-agent reinforcement learning setting, we devise a value-based method to learn to solve multilevel budgeted combinatorial problems involving two players in a zero-sum game over a graph. Our framework is based on a simple curriculum: if an agent knows how to estimate the value of instances with budgets up to B, then solving instances with budget B + 1 can be done in polynomial time regardless of the direction of the optimization by checking the value of every possible afterstate. Thus, in a bottom-up approach, we generate datasets of heuristically solved instances with increasingly larger budgets to train our agent.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

Neural Information Processing Systems

Country: North America > United States > California (0.28)

Genre:

Workflow (0.46)
Overview (0.46)
Instructional Material (0.46)

Industry:

Leisure & Entertainment > Games (1.00)
Information Technology (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Search (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.94)
Information Technology > Artificial Intelligence > Representation & Reasoning > Optimization (0.90)

Add feedback

Online Imitation Learning for Manipulation via Decaying Relative Correction through Teleoperation

Pan, Cheng, Cheng, Hung Hon, Hughes, Josie

arXiv.org Artificial IntelligenceMar-19-2025

Teleoperated robotic manipulators enable the collection of demonstration data, which can be used to train control policies through imitation learning. However, such methods can require significant amounts of training data to develop robust policies or adapt them to new and unseen tasks. While expert feedback can significantly enhance policy performance, providing continuous feedback can be cognitively demanding and time-consuming for experts. To address this challenge, we propose to use a cable-driven teleoperation system which can provide spatial corrections with 6 degree of freedom to the trajectories generated by a policy model. Specifically, we propose a correction method termed Decaying Relative Correction (DRC) which is based upon the spatial offset vector provided by the expert and exists temporarily, and which reduces the intervention steps required by an expert. Our results demonstrate that DRC reduces the required expert intervention rate by 30\% compared to a standard absolute corrective method. Furthermore, we show that integrating DRC within an online imitation learning framework rapidly increases the success rate of manipulation tasks such as raspberry harvesting and cloth wiping.

artificial intelligence, correction, machine learning, (15 more...)

arXiv.org Artificial Intelligence

2503.15368

Country: Europe > Switzerland (0.14)

Genre:

Instructional Material > Online (0.63)
Research Report > New Finding (0.54)

Industry: Education > Educational Setting > Online (0.47)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

VEGGIE: Instructional Editing and Reasoning of Video Concepts with Grounded Generation

Yu, Shoubin, Liu, Difan, Ma, Ziqiao, Hong, Yicong, Zhou, Yang, Tan, Hao, Chai, Joyce, Bansal, Mohit

arXiv.org Artificial IntelligenceMar-19-2025

Recent video diffusion models have enhanced video editing, but it remains challenging to handle instructional editing and diverse tasks (e.g., adding, removing, changing) within a unified framework. In this paper, we introduce VEGGIE, a Video Editor with Grounded Generation from Instructions, a simple end-to-end framework that unifies video concept editing, grounding, and reasoning based on diverse user instructions. Specifically, given a video and text query, VEGGIE first utilizes an MLLM to interpret user intentions in instructions and ground them to the video contexts, generating frame-specific grounded task queries for pixel-space responses. A diffusion model then renders these plans and generates edited videos that align with user intent. To support diverse tasks and complex instructions, we employ a curriculum learning strategy: first aligning the MLLM and video diffusion model with large-scale instructional image editing data, followed by end-to-end fine-tuning on high-quality multitask video data. Additionally, we introduce a novel data synthesis pipeline to generate paired instructional video editing data for model training. It transforms static image data into diverse, high-quality video editing samples by leveraging Image-to-Video models to inject dynamics. VEGGIE shows strong performance in instructional video editing with different editing skills, outperforming the best instructional baseline as a versatile model, while other models struggle with multi-tasking. VEGGIE also excels in video object grounding and reasoning segmentation, where other baselines fail. We further reveal how the multiple tasks help each other and highlight promising applications like zero-shot multimodal instructional and in-context video editing.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2503.1435

Country: