AITopics | iterative learning

Bayesian Optimization for Iterative Learning

Neural Information Processing SystemsDec-24-2025, 03:29:20 GMT

The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence. Traditional tuning algorithms only consider the final performance of hyperparameters acquired after many expensive iterations and ignore intermediate information from earlier training steps. In this paper, we present a Bayesian optimization(BO) approach which exploits the iterative structure of learning algorithms for efficient hyperparameter tuning. We propose to learn an evaluation function compressing learning progress at any stage of the training process into a single numeric score according to both training success and stability. Our BO framework is then trade-off the benefit of assessing a hyperparameter setting over additional training steps against their computation cost. We further increase model efficiency by selectively including scores from different training steps for any evaluated hyperparameter set. We demonstrate the efficiency of our algorithm by tuning hyperparameters for the training of deep reinforcement learning agents and convolutional neural networks. Our algorithm outperforms all existing baselines in identifying optimal hyperparameters in minimal time.

bayesian optimization, hyperparameter, name change, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Learning to Throw-Flip

Liu, Yang, Da Costa, Bruno, Billard, Aude

arXiv.org Artificial IntelligenceOct-14-2025

Dynamic manipulation, such as robot tossing or throwing objects, has recently gained attention as a novel paradigm to speed up logistic operations. However, the focus has predominantly been on the object's landing location, irrespective of its final orientation. In this work, we present a method enabling a robot to accurately "throw-flip" objects to a desired landing pose (position and orientation). Conventionally, objects thrown by revolute robots suffer from parasitic rotation, resulting in highly restricted and uncontrollable landing poses. Our approach is based on two key design choices: first, leveraging the impulse-momentum principle, we design a family of throwing motions that effectively decouple the parasitic rotation, significantly expanding the feasible set of landing poses. Second, we combine a physics-based model of free flight with regression-based learning methods to account for unmodeled effects. Real robot experiments demonstrate that our framework can learn to throw-flip objects to a pose target within ($\pm$5 cm, $\pm$45 degrees) threshold in dozens of trials. Thanks to data assimilation, incorporating projectile dynamics reduces sample complexity by an average of 40% when throw-flipping to unseen poses compared to end-to-end learning methods. Additionally, we show that past knowledge on in-hand object spinning can be effectively reused, accelerating learning by 70% when throwing a new object with a Center of Mass (CoM) shift. A video summarizing the proposed method and the hardware experiments is available at https://youtu.be/txYc9b1oflU.

artificial intelligence, landing pose, machine learning, (13 more...)

arXiv.org Artificial Intelligence

2510.10357

Genre: Research Report (0.50)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Degradation-A ware Unfolding Half-Shuffle Transformer for Spectral Compressive Imaging

Neural Information Processing SystemsAug-19-2025, 20:26:49 GMT

In coded aperture snapshot spectral compressive imaging (CASSI) systems, hyper-spectral image (HSI) reconstruction methods are employed to recover the spatial-spectral signal from a compressed measurement.

artificial intelligence, machine learning, reconstruction, (15 more...)

Neural Information Processing Systems

Country:

Asia > China > Guangdong Province > Shenzhen (0.04)
Europe > Switzerland > Zürich > Zürich (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)

Genre: Research Report (0.46)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Review for NeurIPS paper: Bayesian Optimization for Iterative Learning

Neural Information Processing SystemsJan-25-2025, 08:39:19 GMT

The paper proposes an idea for tuning hyper-parameters in deep (reinforcement) learning using Bayesian optimization. The key idea is to exploit the iterative structure of the problem and use a variable-augmentation trick to learn a score function that compresses the learning progress at any stage. The strengths of the paper are: - well written - good relation to prior work - good experimental study However, the paper also has weaknesses, which are mostly related to theoretical aspects and chosen heuristics (see some details below). If we are only interested in the predictive mean for the cost-GP, why do we use a GP in the first place, and not parametric function, which scales much better? That's the one part that caused us the most toothache.

bayesian optimization, condition number, iterative learning, (8 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.57)

Add feedback

TiniScript: A Simplified Language for Educational Robotics

Ramos, Gabriel Gonzalo Guzman, Ramos, Pedro Jesus Guzman

arXiv.org Artificial IntelligenceNov-9-2024

The constructionism theory, formulated by Seymour Papert, has been a transformative approach in education, particularly within STEM (Science, Technology, Engineering, and Mathematics) fields. This theory emphasizes learning through creation, where students engage actively by building knowledge structures through hands-on tasks and meaningful projects. One of the early milestones influenced by constructionism was the development of the Logo programming language. Logo's simple, block-based structure enabled students to grasp fundamental programming concepts visually by manipulating blocks, establishing a foundation for educational tools that remain essential in early computer science education. Over time, educational robotics kits, like those from LEGO Education (RCX, NXT, and EV3), have set standards for integrating physical construction with software programming. These kits demonstrate the potential of robotics in educational settings by engaging students in both mechanical assembly and logical problem-solving, thereby fostering an understanding of hardware and software as interconnected aspects of robotics. Building on this foundation, programming environments in educational robotics have largely adopted block-based interfaces. These environments simplify coding for beginners, allowing students to create programs by connecting blocks representing specific actions. Once completed, the program is uploaded to a microcontroller, enabling the robot to execute the instructions.

artificial intelligence, robot, tiniscript, (17 more...)

arXiv.org Artificial Intelligence

2411.06303

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre:

Research Report > Promising Solution (0.34)
Overview > Innovation (0.34)

Industry: Education > Curriculum > Subject-Specific Education (0.67)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Bayesian Optimization for Iterative Learning

Neural Information Processing SystemsOct-10-2024, 10:55:18 GMT

The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence. Traditional tuning algorithms only consider the final performance of hyperparameters acquired after many expensive iterations and ignore intermediate information from earlier training steps. In this paper, we present a Bayesian optimization(BO) approach which exploits the iterative structure of learning algorithms for efficient hyperparameter tuning. We propose to learn an evaluation function compressing learning progress at any stage of the training process into a single numeric score according to both training success and stability.

bayesian optimization, hyperparameter, iterative learning, (5 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.92)

Add feedback

Iterative Learning for Reliable Crowdsourcing Systems

Neural Information Processing SystemsApr-6-2023, 13:06:54 GMT

Crowdsourcing systems, in which tasks are electronically distributed to numerous information piece-workers'', have emerged as an effective paradigm for human-powered solving of large scale problems in domains such as image classification, data entry, optical character recognition, recommendation, and proofreading. Because these low-paid workers can be unreliable, nearly all crowdsourcers must devise schemes to increase confidence in their answers, typically by assigning each task multiple times and combining the answers in some way such as majority voting. In this paper, we consider a general model of such rowdsourcing tasks, and pose the problem of minimizing the total price (i.e., number of task assignments) that must be paid to achieve a target overall reliability. We give new algorithms for deciding which tasks to assign to which workers and for inferring correct answers from the workers' answers. We show that our algorithm significantly outperforms majority voting and, in fact, are asymptotically optimal through comparison to an oracle that knows the reliability of every worker.

iterative learning, majority voting, reliable crowdsourcing system, (1 more...)

Neural Information Processing Systems

Technology:

Information Technology > Communications > Social Media > Crowdsourcing (0.65)
Information Technology > Artificial Intelligence > Vision > Optical Character Recognition (0.64)

Add feedback

Learning Half-Spaces and other Concept Classes in the Limit with Iterative Learners

Khazraei, Ardalan, Kötzing, Timo, Seidel, Karen

arXiv.org Machine LearningOct-7-2020

In order to model an efficient learning paradigm, iterative learning algorithms access data one by one, updating the current hypothesis without regress to past data. Past research on iterative learning analyzed for example many important additional requirements and their impact on iterative learners. In this paper, our results are twofold. First, we analyze the relative learning power of various settings of iterative learning, including learning from text and from informant, as well as various further restrictions, for example we show that strongly non-U-shaped learning is restrictive for iterative learning from informant. Second, we investigate the learnability of the concept class of half-spaces and provide a constructive iterative algorithm to learn the set of half-spaces from informant.

artificial intelligence, informant, machine learning, (18 more...)

arXiv.org Machine Learning

2010.03227

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.14)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Visual Pivoting for (Unsupervised) Entity Alignment

Liu, Fangyu, Chen, Muhao, Roth, Dan, Collier, Nigel

arXiv.org Artificial IntelligenceSep-28-2020

This work studies the use of visual semantic representations to align entities in heterogeneous knowledge graphs (KGs). Images are natural components of many existing KGs. By combining visual knowledge with other auxiliary information, we show that the proposed new approach, EVA, creates a holistic entity representation that provides strong signals for cross-graph entity alignment. Besides, previous entity alignment methods require human labelled seed alignment, restricting availability. EVA provides a completely unsupervised solution by leveraging the visual similarity of entities to create an initial seed dictionary (visual pivots). Experiments on benchmark data sets DBP15k and DWY15k show that EVA offers state-of-the-art performance on both monolingual and cross-lingual entity alignment tasks. Furthermore, we discover that images are particularly useful to align long-tail KG entities, which inherently lack the structural contexts necessary for capturing the correspondences.

artificial intelligence, machine learning, natural language, (16 more...)

arXiv.org Artificial Intelligence

2009.13603

Country:

North America > United States > California (0.14)
Europe > Germany > Bavaria > Upper Bavaria > Munich (0.04)
Europe > Netherlands > North Holland > Amsterdam (0.04)
(3 more...)

Genre: Research Report > Experimental Study (0.46)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Iterative Learning for Reliable Crowdsourcing Systems

Karger, David R., Oh, Sewoong, Shah, Devavrat

Neural Information Processing SystemsFeb-14-2020, 23:26:50 GMT

Crowdsourcing systems, in which tasks are electronically distributed to numerous information piece-workers'', have emerged as an effective paradigm for human-powered solving of large scale problems in domains such as image classification, data entry, optical character recognition, recommendation, and proofreading. Because these low-paid workers can be unreliable, nearly all crowdsourcers must devise schemes to increase confidence in their answers, typically by assigning each task multiple times and combining the answers in some way such as majority voting. In this paper, we consider a general model of such rowdsourcing tasks, and pose the problem of minimizing the total price (i.e., number of task assignments) that must be paid to achieve a target overall reliability. We give new algorithms for deciding which tasks to assign to which workers and for inferring correct answers from the workers' answers. We show that our algorithm significantly outperforms majority voting and, in fact, are asymptotically optimal through comparison to an oracle that knows the reliability of every worker.

iterative learning, majority voting, reliable crowdsourcing system, (1 more...)

Neural Information Processing Systems

Technology: