AITopics | darc

Collaborating Authors

darc

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

Neural Information Processing SystemsFeb-18-2026, 17:36:54 GMT

Training a policy in a source domain for deployment in the target domain under a dynamics shift can be challenging, often resulting in performance degradation.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Health & Medicine (0.67)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

Neural Information Processing SystemsOct-10-2025, 21:44:02 GMT

Training a policy in a source domain for deployment in the target domain under a dynamics shift can be challenging, often resulting in performance degradation.

darc, source domain, target domain, (16 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Health & Medicine (0.67)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.68)

Add feedback

Channel-Wise MLPs Improve the Generalization of Recurrent Convolutional Networks

Breslow, Nathan

arXiv.org Artificial IntelligenceAug-13-2025

We investigate the impact of channel-wise mixing via multi-layer perceptrons (MLPs) on the generalization capabilities of recurrent convolutional networks. Specifically, we compare two architectures: DARC (Depth Aware Recurrent Convolution), which employs a simple recurrent convolutional structure, and DAMP (Depth Aware Multi-layer Perceptron), which extends DARC with a gated MLP for channel mixing. Using the Re-ARC benchmark, we find that DAMP significantly outperforms DARC in both in-distribution and out-of-distribution generalization under exact-match grading criteria. These results suggest that explicit channel mixing through MLPs enables recurrent convolutional networks to learn more robust and generalizable computational patterns. Our findings have implications for neural program synthesis and highlight the potential of DAMP as a target architecture for hypernetwork approaches.

architecture, artificial intelligence, machine learning, (17 more...)

arXiv.org Artificial Intelligence

2508.08298

Genre: Research Report > New Finding (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Perceptrons (0.96)

Add feedback

Off-Dynamics Reinforcement Learning via Domain Adaptation and Reward Augmented Imitation

Guo, Yihong, Wang, Yixuan, Shi, Yuanyuan, Xu, Pan, Liu, Anqi

arXiv.org Artificial IntelligenceNov-14-2024

Training a policy in a source domain for deployment in the target domain under a dynamics shift can be challenging, often resulting in performance degradation. Previous work tackles this challenge by training on the source domain with modified rewards derived by matching distributions between the source and the target optimal trajectories. However, pure modified rewards only ensure the behavior of the learned policy in the source domain resembles trajectories produced by the target optimal policies, which does not guarantee optimal performance when the learned policy is actually deployed to the target domain. In this work, we propose to utilize imitation learning to transfer the policy learned from the reward modification to the target domain so that the new policy can generate the same trajectories in the target domain. Our approach, Domain Adaptation and Reward Augmented Imitation Learning (DARAIL), utilizes the reward modification for domain adaptation and follows the general framework of generative adversarial imitation learning from observation (GAIfO) by applying a reward augmented estimator for the policy optimization step. Theoretically, we present an error bound for our method under a mild assumption regarding the dynamics shift to justify the motivation of our method. Empirically, our method outperforms the pure modified reward method without imitation learning and also outperforms other baselines in benchmark off-dynamics environments.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Artificial Intelligence

2411.09891

Country:

North America > United States > California > San Diego County > San Diego (0.04)
North America > Canada > Quebec > Montreal (0.04)
Asia > Japan > Honshū > Chūbu > Ishikawa Prefecture > Kanazawa (0.04)

Genre: Research Report > Experimental Study (0.93)

Industry:

Health & Medicine (0.67)
Information Technology (0.46)

Technology:

Information Technology > Artificial Intelligence > Robots (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.67)

Add feedback

Double Actor-Critic with TD Error-Driven Regularization in Reinforcement Learning

Chen, Haohui, Chen, Zhiyong, Liu, Aoxiang, Fang, Wentuo

arXiv.org Artificial IntelligenceSep-28-2024

To obtain better value estimation in reinforcement learning, we propose a novel algorithm based on the double actor-critic framework with temporal difference error-driven regularization, abbreviated as TDDR. TDDR employs double actors, with each actor paired with a critic, thereby fully leveraging the advantages of double critics. Additionally, TDDR introduces an innovative critic regularization architecture. Compared to classical deterministic policy gradient-based algorithms that lack a double actor-critic structure, TDDR provides superior estimation. Moreover, unlike existing algorithms with double actor-critic frameworks, TDDR does not introduce any additional hyperparameters, significantly simplifying the design and implementation process. Experiments demonstrate that TDDR exhibits strong competitiveness compared to benchmark algorithms in challenging continuous control tasks.

algorithm, hyperparameter, tddr, (15 more...)

arXiv.org Artificial Intelligence

2409.19231

Country:

Asia > Middle East > Jordan (0.04)
Oceania > Australia > New South Wales > Callaghan (0.04)
North America > United States > Massachusetts > Middlesex County > Belmont (0.04)
Asia > China (0.04)

Genre: Research Report (0.50)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

A Trust Region Approach for Few-Shot Sim-to-Real Reinforcement Learning

Daoudi, Paul, Prieur, Christophe, Robu, Bogdan, Barlier, Merwan, Santos, Ludovic Dos

arXiv.org Machine LearningDec-24-2023

Simulation-to-Reality Reinforcement Learning (Sim-to-Real RL) seeks to use simulations to minimize the need for extensive real-world interactions. Specifically, in the few-shot off-dynamics setting, the goal is to acquire a simulator-based policy despite a dynamics mismatch that can be effectively transferred to the real-world using only a handful of real-world transitions. In this context, conventional RL agents tend to exploit simulation inaccuracies resulting in policies that excel in the simulator but underperform in the real environment. To address this challenge, we introduce a novel approach that incorporates a penalty to constrain the trajectories induced by the simulator-trained policy inspired by recent advances in Imitation Learning and Trust Region based RL algorithms. We evaluate our method across various environments representing diverse Sim-to-Real conditions, where access to the real environment is extremely limited. These experiments include high-dimensional systems relevant to real-world applications. Across most tested scenarios, our proposed method demonstrates performance improvements compared to existing baselines. Reinforcement Learning (RL) is often applied in simulation before deploying the learned policy on real systems (Ju et al., 2022; Muratore et al., 2019; Kaspar et al., 2020; Witman et al., 2019). This approach is considered to be one of the safest and most efficient ways of obtaining a near-optimal policy for complex systems (Jiang et al., 2021; Salvato et al., 2021; Hsu et al., 2023), as many of the challenges of applying RL to real-world systems (Dulac-Arnold et al., 2021) are mitigated. The agent can sample the simulator at will (Kamthe & Deisenroth, 2018; Schwarzer et al., 2021) without having to consider any safety constraints (Garcıa & Fernández, 2015; Achiam et al., 2017) during training. However, simulators of complex systems are often inaccurate. Indeed, many physical laws such as contact forces, material elasticity, and fluid dynamics are difficult to model, leading simulators to rely on approximations (Koenig & Howard, 2004; Todorov et al., 2012).

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Machine Learning

2312.15474

Country:

Europe > France > Île-de-France > Paris > Paris (0.04)
Europe > France > Auvergne-Rhône-Alpes > Isère > Grenoble (0.04)
Europe > Spain > Andalusia > Granada Province > Granada (0.04)

Genre:

Research Report > Promising Solution (0.48)
Research Report > Experimental Study (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Reinforcement Learning for Predicting Traffic Accidents

Cho, Injoon, Rajendran, Praveen Kumar, Kim, Taeyoung, Har, Dongsoo

arXiv.org Artificial IntelligenceDec-9-2022

As the demand for autonomous driving increases, it is paramount to ensure safety. Early accident prediction using deep learning methods for driving safety has recently gained much attention. In this task, early accident prediction and a point prediction of where the drivers should look are determined, with the dashcam video as input. We propose to exploit the double actors and regularized critics (DARC) method, for the first time, on this accident forecasting platform. We derive inspiration from DARC since it is currently a state-of-the-art reinforcement learning (RL) model on continuous action space suitable for accident anticipation. Results show that by utilizing DARC, we can make predictions 5\% earlier on average while improving in multiple metrics of precision compared to existing methods. The results imply that using our RL-based problem formulation could significantly increase the safety of autonomous driving.

artificial intelligence, machine learning, reinforcement learning, (16 more...)

arXiv.org Artificial Intelligence

2212.04677

Country: Asia > South Korea > Daejeon > Daejeon (0.05)

Genre: Research Report > New Finding (0.88)

Industry:

Transportation > Ground > Road (1.00)
Automobiles & Trucks (1.00)
Information Technology > Robotics & Automation (0.69)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.91)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.87)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.70)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.68)

Add feedback

Researchers use AI-based test to predict the retinal disease geographic atrophy

#artificialintelligenceOct-28-2022, 13:47:23 GMT

As part of a study released in Progress in Retinal and Eye Research, 113 patients were examined using Detection of Apoptosis in Retinal Cells (DARC) to detect areas of the eye indicative of the retinal disease geographic atrophy. The study was conducted by experts at Imperial College London. "DARC (Detection of Apoptosing Retinal Cells) is a retinal imaging technology that has been developed within the last 2 decades from basic laboratory science to Phase 2 clinical trials," according to the findings. "It uses ANX776 (fluorescently labelled Annexin A5) to identify stressed and apoptotic cells in the living eye. During its development, DARC has undergone biochemistry optimisation, scale-up and GMP manufacture and extensive preclinical evaluation."

researcher use ai-based test, retinal cell, retinal disease geographic atrophy, (2 more...)

#artificialintelligence

Genre:

Research Report > Experimental Study (1.00)
Research Report > Strength Medium (0.63)

Industry:

Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (0.88)
Health & Medicine > Pharmaceuticals & Biotechnology (0.63)

Technology: Information Technology > Artificial Intelligence (0.85)

Add feedback

It's all in the research: Using AI to solve issues in health care

#artificialintelligenceFeb-16-2022, 18:15:54 GMT

The University of Alberta uses SAS Viya to help its researchers expand their capacity for big data analysis and support the use of open source software and other tools popular among students. Conducting research is not a straightforward process, and the terabytes of data cascading into labs (both physical and virtual) requires serious horsepower to analyze. Personal desktops and small servers are increasingly coming up short in meeting the demands of artificial intelligence and machine learning projects. Data also comes in various shapes and sizes. Researchers often combine data related to diagnostic imaging, risk prediction, clinical trials and much more.

health care, platform, sas viya, (6 more...)

#artificialintelligence

Country: North America > Canada > Alberta (1.00)

Industry: Health & Medicine > Diagnostic Medicine > Imaging (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.59)
Information Technology > Artificial Intelligence > Applied AI (0.40)

Add feedback

Eye test uses AI to predict macular degeneration

Daily Mail - Science & techDec-18-2020, 16:31:31 GMT

A new eye test that uses artificial intelligence AI to study retina scans can predict age-related macular degeneration (AMD) three years before symptoms start. The first part of the'pioneering' test, developed by researchers at University College London, is called DARC. DARC involves injecting dye into a person's bloodstream to illuminate'stressed' endothelial cells in the retina, so they appear bright white under a fluorescent camera. These'stressed' retinal cells could lead to abnormalities and later leaking blood vessels – causing AMD, which can severely compromise the central field of vision. The second part of the test uses an AI algorithm, trained to detect whether the highlighted white spots are around the macula – which indicates high AMD risk.

blood vessel, central vision, macular degeneration, (14 more...)

Daily Mail - Science & tech

Country:

Europe > United Kingdom (0.06)
North America > United States > Texas (0.05)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine > Therapeutic Area > Ophthalmology/Optometry (1.00)

Technology: Information Technology > Artificial Intelligence > Applied AI (0.40)

Add feedback