AITopics | Chowdhary, Girish

Plotting

Chowdhary, Girish

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Energy Shaping Control of a CyberOctopus Soft Arm

Chang, Heng-Sheng, Halder, Udit, Shih, Chia-Hsien, Tekinalp, Arman, Parthasarathy, Tejaswin, Gribkova, Ekaterina, Chowdhary, Girish, Gillette, Rhanor, Gazzola, Mattia, Mehta, Prashant G.

arXiv.org Artificial IntelligenceOct-2-2020

This paper entails application of the energy shaping methodology to control a flexible, elastic Cosserat rod model. Recent interest in such continuum models stems from applications in soft robotics, and from the growing recognition of the role of mechanics and embodiment in biological control strategies: octopuses are often regarded as iconic examples of this interplay. Here, the dynamics of the Cosserat rod, modeling a single octopus arm, are treated as a Hamiltonian system and the internal muscle actuators are modeled as distributed forces and couples. The proposed energy shaping control design procedure involves two steps: (1) a potential energy is designed such that its minimizer is the desired equilibrium configuration; (2) an energy shaping control law is implemented to reach the desired equilibrium. By interpreting the controlled Hamiltonian as a Lyapunov function, asymptotic stability of the equilibrium configuration is deduced. The energy shaping control law is shown to require only the deformations of the equilibrium configuration. A forward-backward algorithm is proposed to compute these deformations in an online iterative manner. The overall control design methodology is implemented and demonstrated in a dynamic simulation environment. Results of several bio-inspired numerical experiments involving the control of octopus arms are reported.

artificial intelligence, configuration, deformation, (18 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/CDC42340.2020.9304408

2004.05747

Country: North America > United States (0.46)

Genre: Research Report (0.50)

Industry:

Health & Medicine (0.94)
Food & Agriculture > Agriculture (0.34)

Technology: Information Technology > Artificial Intelligence > Robots (1.00)

Add feedback

Learning to Cope with Adversarial Attacks

Lee, Xian Yeow, Havens, Aaron, Chowdhary, Girish, Sarkar, Soumik

arXiv.org Machine LearningJun-28-2019

The security of Deep Reinforcement Learning (Deep RL) algorithms deployed in real life applications are of a primary concern. In particular, the robustness of RL agents in cyber-physical systems against adversarial attacks are especially vital since the cost of a malevolent intrusions can be extremely high. Studies have shown Deep Neural Networks (DNN), which forms the core decision-making unit in most modern RL algorithms, are easily subjected to adversarial attacks. Hence, it is imperative that RL agents deployed in real-life applications have the capability to detect and mitigate adversarial attacks in an online fashion. An example of such a framework is the Meta-Learned Advantage Hierarchy (MLAH) agent that utilizes a meta-learning framework to learn policies robustly online. Since the mechanism of this framework are still not fully explored, we conducted multiple experiments to better understand the framework's capabilities and limitations. Our results shows that the MLAH agent exhibits interesting coping behaviors when subjected to different adversarial attacks to maintain a nominal reward. Additionally, the framework exhibits a hierarchical coping capability, based on the adaptability of the Master policy and sub-policies themselves. From empirical results, we also observed that as the interval of adversarial attacks increase, the MLAH agent can maintain a higher distribution of rewards, though at the cost of higher instabilities.

agent, deep learning, neural network, (18 more...)

arXiv.org Machine Learning

1906.12061

Genre: Research Report > New Finding (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.89)

Add feedback

Cross-Domain Transfer in Reinforcement Learning using Target Apprentice

Joshi, Girish, Chowdhary, Girish

arXiv.org Machine LearningJan-21-2018

In this paper, we present a new approach to Transfer Learning (TL) in Reinforcement Learning (RL) for cross-domain tasks. Many of the available techniques approach the transfer architecture as a method of speeding up the target task learning. We propose to adapt and reuse the mapped source task optimal-policy directly in related domains. We show the optimal policy from a related source task can be near optimal in target domain provided an adaptive policy accounts for the model error between target and source. The main benefit of this policy augmentation is generalizing policies across multiple related domains without having to re-learn the new tasks. Our results show that this architecture leads to better sample efficiency in the transfer, reducing sample complexity of target task learning to target apprentice learning.

artificial intelligence, reinforcement learning, target task, (17 more...)

arXiv.org Machine Learning

1801.0692

Country: North America > United States > Illinois > Champaign County (0.14)

Genre: Research Report > New Finding (0.54)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Kernel Observers: Systems-Theoretic Modeling and Inference of Spatiotemporally Evolving Processes

Kingravi, Hassan A., Maske, Harshal R., Chowdhary, Girish

Neural Information Processing SystemsDec-31-2016

We consider the problem of estimating the latent state of a spatiotemporally evolving continuous function using very few sensor measurements. We show that layering a dynamical systems prior over temporal evolution of weights of a kernel model is a valid approach to spatiotemporal modeling that does not necessarily require the design of complex nonstationary kernels. Furthermore, we show that such a predictive model can be utilized to determine sensing locations that guarantee that the hidden state of the phenomena can be recovered with very few measurements. We provide sufficient conditions on the number and spatial location of samples required to guarantee state recovery, and provide a lower bound on the minimum number of samples required to robustly infer the hidden states. Our approach outperforms existing methods in numerical experiments.

artificial intelligence, evolution, modeling & simulation, (17 more...)

Neural Information Processing Systems

Country:

North America > United States > Illinois (0.14)
North America > Canada > Ontario (0.14)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (1.00)
Information Technology > Modeling & Simulation (0.89)
Information Technology > Data Science (0.88)
Information Technology > Communications > Networks > Sensor Networks (0.46)

Add feedback

Trusting Learning Based Adaptive Flight Control Algorithms

Mühlegg, Maximilian (Technische Universität München) | Holzapfel, Florian (Technische Universität München) | Chowdhary, Girish (Oklahoma State University)

AAAI ConferencesNov-1-2015

Autonomous unmanned aerial systems (UAS) are envisioned to become increasingly utilized in commercial airspace. In order to be attractive for commercial applications, UAS are required to undergo a quick development cycle, ensure cost effectiveness and work reliably in changing environments. Learning based adaptive control systems have been proposed to meet these demands. These techniques promise more flexibility when compared with traditional linear control techniques. However, no consistent verification and validation (V&V) framework exists for adaptive controllers. The underlying purpose of the V&V processes in certifying control algorithms for aircraft is to build trust in a safety critical system. In the past, most adaptive control algorithms were solely designed to ensure stability of a model system and meet robustness requirements against selective uncertainties and disturbances. However, these assessments do not guarantee reliable performance of the real system required by the V&V process. The question arises how trust can be defined for learning based adaptive control algorithms. From our perspective, self-confidence of an adaptive flight controller will be an integral part of building trust in the system. The notion of self-confidence in the adaptive control context relates to the estimate of the adaptive controller in its capabilities to operate reliably, and its ability to foresee the need for taking action before undesired behaviors lead to a loss of the system. In this paper we present a pathway to a possible answer to the question of how self-confidence for adaptive controllers can be achieved. In particular, we elaborate how algorithms for diagnosis and prognosis can be integrated to help in this process.

air transportation, artificial intelligence, controller, (13 more...)

AAAI Conferences

2015 AAAI Fall Symposium Series

Country: North America > United States > Oklahoma > Payne County > Stillwater (0.15)

Industry:

Health & Medicine (1.00)
Transportation > Air (0.70)
Transportation > Infrastructure & Services (0.55)

Technology:

Information Technology > Control Systems > Adaptive Systems (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Uninformed-to-Informed Exploration in Unstructured Real-World Environments

Axelrod, Allan (Oklahoma State University) | Chowdhary, Girish (Oklahoma State University)

AAAI ConferencesNov-1-2015

Conventionally, the process of learning the model (exploration) is initialized as either an uninformed or informed policy, where the latter leverages observations to guide future exploration. Informed exploration is ideal as it may allow a model to be learned in fewer samples. However, informed exploration cannot be implemented from the onset when a-priori knowledge on the sensing domain statistics are not available; such policies would only sample the first set of locations, repeatedly. Hence, we present a theoretically-derived bound for transitioning from uninformed exploration to informed exploration for unstructured real-world environments which may be partially-observable and time-varying. This bound is used in tandem with a sparsified Bayesian nonparametric Poisson Exposure Process, which is used to learn to predict the value of information in partiallyobservable and time-varying domains. The result is an uninformed-to-informed exploration policy which outperforms baseline algorithms in real-world data-sets.

artificial intelligence, exploration, machine learning, (15 more...)

AAAI Conferences

2015 AAAI Fall Symposium Series

Country: North America > United States > Oklahoma > Payne County > Stillwater (0.15)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (0.70)
Information Technology > Artificial Intelligence > Machine Learning (0.70)

Add feedback