AITopics | training loop

2508.06251

Country:

Europe (0.28)
North America > United States (0.14)

Genre: Research Report > New Finding (0.86)

Industry: Information Technology > Security & Privacy (1.00)

Technology:

Information Technology > Data Science (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Bailey, Alana A., Guy, Robert D.

Optimizing Metachronal Paddling with Reinforcement Learning at Low Reynolds Number

arXiv.org Machine LearningJul-28-2025

Metachronal paddling is a swimming strategy in which an organism oscillates sets of adjacent limbs with a constant phase lag, propagating a metachronal wave through its limbs and propelling it forward. This limb coordination strategy is utilized by swimmers across a wide range of Reynolds numbers, which suggests that this metachronal rhythm was selected for its optimality of swimming performance. In this study, we apply reinforcement learning to a swimmer at zero Reynolds number and investigate whether the learning algorithm selects this metachronal rhythm, or if other coordination patterns emerge. We design the swimmer agent with an elongated body and pairs of straight, inflexible paddles placed along the body for various fixed paddle spacings. Based on paddle spacing, the swimmer agent learns qualitatively different coordination patterns. At tight spacings, a back-to-front metachronal wave-like stroke emerges which resembles the commonly observed biological rhythm, but at wide spacings, different limb coordinations are selected. Across all resulting strokes, the fastest stroke is dependent on the number of paddles, however, the most efficient stroke is a back-to-front wave-like stroke regardless of the number of paddles.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Machine Learning

2507.18849

Country: North America > United States > California > Yolo County > Davis (0.04)

Genre: Research Report > New Finding (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

arXiv.org Artificial IntelligenceMay-22-2024

Learning To Play Atari Games Using Dueling Q-Learning and Hebbian Plasticity

Salehin, Md Ashfaq

In this work, an advanced deep reinforcement learning architecture is used to train neural network agents playing atari games. Given only the raw game pixels, action space, and reward information, the system can train agents to play any Atari game. At first, this system uses advanced techniques like deep Q-networks and dueling Q-networks to train efficient agents, the same techniques used by DeepMind to train agents that beat human players in Atari games. As an extension, plastic neural networks are used as agents, and their feasibility is analyzed in this scenario. The plasticity implementation was based on backpropagation and the Hebbian update rule. Plastic neural networks have excellent features like lifelong learning after the initial training, which makes them highly suitable in adaptive learning environments. As a new analysis of plasticity in this context, this work might provide valuable insights and direction for future works. Einforcement learning is a computational technique where an agent learns by directly interacting with its environment without having a complete model of the environment [1]. Reinforcement learning is a very good example of adaptive systems where an agent learns to make decisions and take actions in an environment in order to maximize some reward, which acts as feedback from the environment to the agent. Well-crafted reinforcement learning agents with optimized training loops are known to learn complex tasks, such as playing computer games. In previous work, a CNN-based agent was trained using discounted policy gradients, where all the rewards in an episode were fed to the agent as training data after discounting by a factor [2]. Although this approach served as a good starting point, it is not suitable for learning to control complex environments, such as Atari games. A better implementation is possible using the Q-Learning algorithm, which is based on the Bellman equation [3]. The Bellman equation is based on the Markov decision process [4] and states that the optimal value of a state is equal to the immediate reward plus the discounted expected optimal value of the next state under the optimal policy. While the Bellman equation requires all the reward values and transition probabilities to be known in advance, the Q-Learning algorithm uses Q-Values, which are initialized as random values and optimized gradually.

plastic weight, q-value, reward value, (16 more...)

2405.1396

Country: North America > United States > Indiana (0.04)

Genre: Research Report > Promising Solution (0.46)

Industry: Leisure & Entertainment > Games > Computer Games (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (0.48)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.35)

arXiv.org Artificial IntelligenceJun-10-2023

INK: Injecting kNN Knowledge in Nearest Neighbor Machine Translation

Zhu, Wenhao, Xu, Jingjing, Huang, Shujian, Kong, Lingpeng, Chen, Jiajun

Neural machine translation has achieved promising results on many translation tasks. However, previous studies have shown that neural models induce a non-smooth representation space, which harms its generalization results. Recently, kNN-MT has provided an effective paradigm to smooth the prediction based on neighbor representations during inference. Despite promising results, kNN-MT usually requires large inference overhead. We propose an effective training framework INK to directly smooth the representation space via adjusting representations of kNN neighbors with a small number of new parameters. The new parameters are then used to refresh the whole representation datastore to get new kNN knowledge asynchronously. This loop keeps running until convergence. Experiments on four benchmark datasets show that \method achieves average gains of 1.99 COMET and 1.0 BLEU, outperforming the state-of-the-art kNN-MT system with 0.02x memory space and 1.9x inference speedup.

artificial intelligence, natural language, representation, (16 more...)

2306.06381

Country:

Asia > China > Shanghai > Shanghai (0.04)
Asia > China > Jiangsu Province > Nanjing (0.04)
Asia > China > Hong Kong (0.04)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language > Machine Translation (1.00)

arXiv.org Artificial IntelligenceMay-26-2023

Simulator-Based Self-Supervision for Learned 3D Tomography Reconstruction

Kosomaa, Onni, Laine, Samuli, Karras, Tero, Aittala, Miika, Lehtinen, Jaakko

We propose a deep learning method for 3D volumetric reconstruction in low-dose helical cone-beam computed tomography. Prior machine learning approaches require reference reconstructions computed by another algorithm for training. In contrast, we train our model in a fully self-supervised manner using only noisy 2D X-ray data. This is enabled by incorporating a fast differentiable CT simulator in the training loop. As we do not rely on reference reconstructions, the fidelity of our results is not limited by their potential shortcomings. We evaluate our method on real helical cone-beam projections and simulated phantoms. Our results show significantly higher visual fidelity and better PSNR over techniques that rely on existing reconstructions. When applied to full-dose data, our method produces high-quality results orders of magnitude faster than iterative techniques.

artificial intelligence, machine learning, projection, (20 more...)

2212.07431

Country: Europe > Finland (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)

#artificialintelligenceApr-7-2023, 23:11:30 GMT

Introduction to Lightning Fabric

Lightning Fabric is a new, open-source library that allows you to quickly and easily scale models while maintaining full control over your training loop. In the past, getting PyTorch code to run efficiently on GPUs and scaling it up to many machines and large datasets was possible with PyTorch Lightning. As time went on, however, we became aware of the need to provide a scaling option that landed somewhere between a raw deep learning framework like PyTorch on the one hand, and a high-level, feature-rich framework like PyTorch Lightning. Lightning Fabric is just that. While PyTorch Lightning provides many features to save time and improve readability and collaboration, there are complex use cases where full control over the training loop is needed.

fabric, lightning fabric, pytorch lightning, (11 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceApr-7-2023, 20:14:43 GMT

Deep Learning with PyTorch (9-Day Mini-Course) - MachineLearningMastery.com Deep Learning with PyTorch (9-Day Mini-Course) - MachineLearningMastery.com

Deep learning is a fascinating field of study and the techniques are achieving world class results in a range of challenging machine learning problems. It can be hard to get started in deep learning. Which library should you use and which techniques should you focus on? In this 9-part crash course you will discover applied deep learning in Python with the easy to use and powerful PyTorch library. This mini-course is intended for practitioners that are already comfortable with programming in Python and knows the basic concept of machine learning. This is a long and useful post. You might want to print it out. Photo by Thomas Kinto, some rights reserved.

dataset, neural network, pytorch, (13 more...)

Genre: Instructional Material > Course Syllabus & Notes (1.00)

Industry: Education (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMar-28-2023, 17:00:40 GMT

Introduction to PyTorch: from training loop to prediction

That said, let's see what the code for writing a logistic regression model looks like. Our class inherits from nn.Module. This class provides the methods behind the scenes that make the model work. The __init__ method of a class contains the logic that runs when instantiating a class in Python. Here we pass two arguments: the number of features and the number of classes to predict.

neural network, pytorch, training loop, (15 more...)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.78)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.56)

#artificialintelligenceMar-6-2023, 10:05:19 GMT

PyLessons

Time to build our training loop. First, we want to make sure our network is in training mode.

artificial intelligence, deep learning, machine learning, (20 more...)

Genre: Instructional Material (0.49)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceFeb-12-2023, 18:05:49 GMT

Building a Multiclass Classification Model in PyTorch - MachineLearningMastery.com Building a Multiclass Classification Model in PyTorch - MachineLearningMastery.com

PyTorch library is for deep learning. Some applications of deep learning models are to solve regression or classification problems. In this tutorial, you will discover how to use PyTorch to develop and evaluate neural network models for multi-class classification problems. In this tutorial, you will use a standard machine learning dataset called the iris flowers dataset. It is a well-studied dataset and good for practicing machine learning.

multiclass classification model, pytorch, vector, (15 more...)

Genre: Instructional Material (0.34)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)