Sofge, Donald
Seasonal Station-Keeping of Short Duration High Altitude Balloons using Deep Reinforcement Learning
Schuler, Tristan K., Prasad, Chinthan, Kiselev, Georgiy, Sofge, Donald
Station-Keeping short-duration high-altitude balloons (HABs) in a region of interest is a challenging path-planning problem due to partially observable, complex, and dynamic wind flows. Deep reinforcement learning is a popular strategy for solving the station-keeping problem. A custom simulation environment was developed to train and evaluate Deep Q-Learning (DQN) for short-duration HAB agents in the simulation. To train the agents on realistic winds, synthetic wind forecasts were generated from aggregated historical radiosonde data to apply horizontal kinematics to simulated agents. The synthetic forecasts were closely correlated with ECWMF ERA5 Reanalysis forecasts, providing a realistic simulated wind field and seasonal and altitudinal variances between the wind models. DQN HAB agents were then trained and evaluated across different seasonal months. To highlight differences and trends in months with vastly different wind fields, a Forecast Score algorithm was introduced to independently classify forecasts based on wind diversity, and trends between station-keeping success and the Forecast Score were evaluated across all seasons.
Fine Tuning Swimming Locomotion Learned from Mosquito Larvae
Rajbhandari, Pranav, Dhileep, Karthick, Ravi, Sridhar, Sofge, Donald
In prior research, we analyzed the backwards swimming motion of mosquito larvae, parameterized it, and replicated it in a Computational Fluid Dynamics (CFD) model. Since the parameterized swimming motion is copied from observed larvae, it is not necessarily the most efficient locomotion for the model of the swimmer. In this project, we further optimize this copied solution for the swimmer model. We utilize Reinforcement Learning to guide local parameter updates. Since the majority of the computation cost arises from the CFD model, we additionally train a deep learning model to replicate the forces acting on the swimmer model. We find that this method is effective at performing local search to improve the parameterized swimming locomotion.
Transformer Guided Coevolution: Improved Team Formation in Multiagent Adversarial Games
Rajbhandari, Pranav, Dasgupta, Prithviraj, Sofge, Donald
Researchers have addressed the team selection problem in multiagent team formation using evolutionary computation-based approaches We consider the problem of team formation within multiagent adversarial [14, 31], albeit for non-adversarial settings like search and games. We propose BERTeam, a novel algorithm that uses reconnaissance. In this paper, we consider the use of a transformer a transformer-based deep neural network with Masked Language based neural network to predict the set of agents which form a team. Model training to select the best team of players from a trained population. We name this technique BERTeam, and investigate its suitability We integrate this with coevolutionary deep reinforcement for team formation in multiagent adversarial games.
Learning Emergent Behavior in Robot Swarms with NEAT
Rajbhandari, Pranav, Sofge, Donald
When researching robot swarms, many studies observe complex group behavior emerging from the individual agents' simple local actions. However, the task of learning an individual policy to produce a desired emergent behavior remains a challenging and largely unsolved problem. We present a method of training distributed robotic swarm algorithms to produce emergent behavior. Inspired by the biological evolution of emergent behavior in animals, we use an evolutionary algorithm to train a 'population' of individual behaviors to approximate a desired group behavior. We perform experiments using simulations of the Georgia Tech Miniature Autonomous Blimps (GT-MABs) aerial robotics platforms conducted in the CoppeliaSim simulator. Additionally, we test on simulations of Anki Vector robots to display our algorithm's effectiveness on various modes of actuation. We evaluate our algorithm on various tasks where a somewhat complex group behavior is required for success. These tasks include an Area Coverage task, a Surround Target task, and a Wall Climb task. We compare behaviors evolved using our algorithm against 'designed policies', which we create in order to exhibit the emergent behaviors we desire.
Path-Based Sensors: Will the Knowledge of Correlation in Random Variables Accelerate Information Gathering?
Srivastava, Alkesh K., Kontoudis, George P., Sofge, Donald, Otte, Michael
Effective communication is crucial for deploying robots in mission-specific tasks, but inadequate or unreliable communication can greatly reduce mission efficacy, for example in search and rescue missions where communication-denied conditions may occur. In such missions, robots are deployed to locate targets, such as human survivors, but they might get trapped at hazardous locations, such as in a trapping pit or by debris. Thus, the information the robot collected is lost owing to the lack of communication. In our prior work, we developed the notion of a path-based sensor. A path-based sensor detects whether or not an event has occurred along a particular path, but it does not provide the exact location of the event. Such path-based sensor observations are well-suited to communication-denied environments, and various studies have explored methods to improve information gathering in such settings. In some missions it is typical for target elements to be in close proximity to hazardous factors that hinder the information-gathering process. In this study, we examine a similar scenario and conduct experiments to determine if additional knowledge about the correlation between hazards and targets improves the efficiency of information gathering. To incorporate this knowledge, we utilize a Bayesian network representation of domain knowledge and develop an algorithm based on this representation. Our empirical investigation reveals that such additional information on correlation is beneficial only in environments with moderate hazard lethality, suggesting that while knowledge of correlation helps, further research and development is necessary for optimal outcomes.
Acoustic Beamforming for Object-relative Distance Estimation and Control in Unmanned Air Vehicles using Propulsion System Noise
Sharma, Alisha, Geder, Jason, Lingevitch, Joseph, Martin, Theodore, Lofaro, Daniel, Sofge, Donald
Unmanned air vehicles often produce significant noise from their propulsion systems. Using this broadband signal as "acoustic illumination" for an auxiliary sensing system could make vehicles more robust at a minimal cost. We present an acoustic beamforming-based algorithm that estimates object-relative distance with a small two-microphone array using the generated propulsion system noise of a vehicle. We demonstrate this approach in several closed-loop distance feedback control tests with a mounted quad-rotor vehicle in a noisy environment and show accurate object-relative distance estimates more than 2x further than the baseline channel-based approach. We conclude that this approach is robust to several practical vehicle and noise situations and shows promise for use in more complex operating environments.
Artificial Intelligence for the Internet of Everything
Lawless, W. F. (Paine College) | Mittu, Ranjeev (U.S. Naval Research Laboratory) | Sofge, Donald ( U.S. Naval Research Laboratory )
For the Internet of Everything (IoE), from an AI perspective, we discuss the meaning, value and effect that the internet of things (IoT) is expected to have on ordinary life, in industry (IIoT), on the battlefield (IoBT), in the medical field (IoMT) and with intelligent-agent feedback in the form of constructive and destructive interference (IoIT). We consider the topic open-ended but with an AI perspective that addresses how the IoE affects sensing, perception, cognition and behavior, or causal relations whether the context is clear or uncertain for mundane decisions, complex decisions on the battlefield, life and death decisions in the medical arena, or decisions affected by intelligent agents and machines. We pay attention to theoretical perspectives for how these “things” may affect individuals, teams and society; and in turn how they may affect these “things”. We are most interested in what may happen when these “things” begin to think. Our ultimate goal is to use AI to advance autonomy and autonomous characteristics to improve the performance of individual agents and hybrid teams of humans, machines, and robots for the betterment of society.
Reports of the 2013 AAAI Spring Symposium Series
Markman, Vita (Disney Interactive Studios) | Stojanov, Georgi (American University of Paris) | Indurkhya, Bipin (International Institute of Information Technology) | Kido, Takashi (Rikengenesis) | Takadama, Keiki (University of Electro-Communications) | Konidaris, George (Massachusetts Institute of Technology) | Eaton, Eric (Bryn Mawr College) | Matsumura, Naohiro (Osaka University) | Fruchter, Renate (Stanford University) | Sofge, Donald (Naval Research Laboratory) | Lawless, William (Paine College) | Madani, Omid (Google) | Sukthankaris, Rahul (Google)
The Association for the Advancement of Artificial Intelligence was pleased to present the AAAI 2013 Spring Symposium Series, held Monday through Wednesday, March 25-27, 2013. The titles of the eight symposia were Analyzing Microtext, Creativity and (Early) Cognitive Development, Data Driven Wellness: From Self-Tracking to Behavior Change, Designing Intelligent Robots: Reintegrating AI II, Lifelong Machine Learning, Shikakeology: Designing Triggers for Behavior Change, Trust and Autonomous Systems, and Weakly Supervised Learning from Multimedia. This report contains summaries of the symposia, written, in most cases, by the cochairs of the symposium.
Reports of the 2013 AAAI Spring Symposium Series
Markman, Vita (Disney Interactive Studios) | Stojanov, Georgi (American University of Paris) | Indurkhya, Bipin (International Institute of Information Technology) | Kido, Takashi (Rikengenesis) | Takadama, Keiki (University of Electro-Communications) | Konidaris, George (Massachusetts Institute of Technology) | Eaton, Eric (Bryn Mawr College) | Matsumura, Naohiro (Osaka University) | Fruchter, Renate (Stanford University) | Sofge, Donald (Naval Research Laboratory) | Lawless, William (Paine College) | Madani, Omid (Google) | Sukthankaris, Rahul (Google)
The Association for the Advancement of Artificial Intelligence was pleased to present the AAAI 2013 Spring Symposium Series, held Monday through Wednesday, March 25-27, 2013. The titles of the eight symposia were Analyzing Microtext, Creativity and (Early) Cognitive Development, Data Driven Wellness: From Self-Tracking to Behavior Change, Designing Intelligent Robots: Reintegrating AI II, Lifelong Machine Learning, Shikakeology: Designing Triggers for Behavior Change, Trust and Autonomous Systems, and Weakly Supervised Learning from Multimedia. This report contains summaries of the symposia, written, in most cases, by the cochairs of the symposium.
Reports of the AAAI 2012 Spring Symposia
Alani, Harith (The Open University) | An, Bo (University of Southern California) | Jain, Manish (University of Southern California) | Kido, Takashi (Rikengenesis) | Konidaris, George (Massachusetts Institute of Technology) | Lawless, William (Paine College) | Martin, David (Apple Computer) | Pantofaru, Caroline (Willow Garage, Inc.) | Sofge, Donald (Naval Research Laboratory) | Takadama, Keiki (University of Electro-Communications) | Tambe, Milind (University of Southern California) | Vitvar, Tomas (Czech Technical University)
The Association for the Advancement of Artificial Intelligence, in cooperation with Stanford University's Department of Computer Science, was pleased to present the 2012 Spring Symposium Series, held Monday through Wednesday, March 26–28, 2012 at Stanford University, Stanford, California USA. The six symposia held were AI, The Fundamental Social Aggregation Challenge (cochaired by W. F. Lawless, Don Sofge, Mark Klein, and Laurent Chaudron); Designing Intelligent Robots (cochaired by George Konidaris, Byron Boots, Stephen Hart, Todd Hester, Sarah Osentoski, and David Wingate); Game Theory for Security, Sustainability, and Health (cochaired by Bo An and Manish Jain); Intelligent Web Services Meet Social Computing (cochaired by Tomas Vitvar, Harith Alani, and David Martin); Self-Tracking and Collective Intelligence for Personal Wellness (cochaired by Takashi Kido and Keiki Takadama); and Wisdom of the Crowd (cochaired by Caroline Pantofaru, Sonia Chernova, and Alex Sorokin). The papers of the six symposia were published in the AAAI technical report series.