AITopics | tambe

Collaborating Authors

tambe

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

LearningMDPsfromFeatures: Predict-Then-OptimizeforSequentialDecision ProblemsbyReinforcementLearning

Neural Information Processing SystemsFeb-8-2026, 12:35:06 GMT

To resolve the first challenge, we propose to sample anestimate ofthefirst-order andsecond-order derivativestoapproximate theoptimality andKKT conditions.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

Neural Information Processing Systems

Country:

North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
North America > United States > Ohio > Franklin County > Columbus (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.96)

Add feedback

Beyond Listenership: AI-Predicted Interventions Drive Improvements in Maternal Health Behaviours

Dasgupta, Arpan, Gharat, Sarvesh, Madhiwalla, Neha, Hegde, Aparna, Tambe, Milind, Taneja, Aparna

arXiv.org Artificial IntelligenceJul-29-2025

Automated voice calls with health information are a proven method for disseminating maternal and child health information among beneficiaries and are deployed in several programs around the world. However, these programs often suffer from beneficiary dropoffs and poor engagement. In previous work, through real-world trials, we showed that an AI model, specifically a restless bandit model, could identify beneficiaries who would benefit most from live service call interventions, preventing dropoffs and boosting engagement. However, one key question has remained open so far: does such improved listenership via AI-targeted interventions translate into beneficiaries' improved knowledge and health behaviors? We present a first study that shows not only listenership improvements due to AI interventions, but also simultaneously links these improvements to health behavior changes. Specifically, we demonstrate that AI-scheduled interventions, which enhance listenership, lead to statistically significant improvements in beneficiaries' health behaviors such as taking iron or calcium supplements in the postnatal period, as well as understanding of critical health topics during pregnancy and infancy. This underscores the potential of AI to drive meaningful improvements in maternal and child health.

artificial intelligence, intervention, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2507.20755

Country:

Africa (0.28)
Asia > India (0.14)

Genre:

Research Report > Strength High (1.00)
Research Report > Experimental Study (1.00)
Research Report > New Finding (0.68)

Industry:

Health & Medicine > Therapeutic Area > Pediatrics/Neonatology (1.00)
Health & Medicine > Therapeutic Area > Obstetrics/Gynecology (1.00)
Health & Medicine > Public Health (1.00)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

The Bandit Whisperer: Communication Learning for Restless Bandits

Zhao, Yunfan, Wang, Tonghan, Nagaraj, Dheeraj, Taneja, Aparna, Tambe, Milind

arXiv.org Artificial IntelligenceAug-10-2024

Applying Reinforcement Learning (RL) to Restless Multi-Arm Bandits (RMABs) offers a promising avenue for addressing allocation problems with resource constraints and temporal dynamics. However, classic RMAB models largely overlook the challenges of (systematic) data errors - a common occurrence in real-world scenarios due to factors like varying data collection protocols and intentional noise for differential privacy. We demonstrate that conventional RL algorithms used to train RMABs can struggle to perform well in such settings. To solve this problem, we propose the first communication learning approach in RMABs, where we study which arms, when involved in communication, are most effective in mitigating the influence of such systematic data errors. In our setup, the arms receive Q-function parameters from similar arms as messages to guide behavioral policies, steering Q-function updates. We learn communication strategies by considering the joint utility of messages across all pairs of arms and using a Q-network architecture that decomposes the joint utility. Both theoretical and empirical evidence validate the effectiveness of our method in significantly improving RMAB performance across diverse problems.

communication, international conference, noisy arm, (14 more...)

arXiv.org Artificial Intelligence

2408.05686

Country:

Asia > India (0.04)
Oceania > New Zealand (0.04)
North America > United States > New York > New York County > New York City (0.04)
(5 more...)

Genre: Research Report (1.00)

Industry: Health & Medicine > Therapeutic Area (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

Generation of Games for Opponent Model Differentiation

Milec, David, Lisý, Viliam, Kiekintveld, Christopher

arXiv.org Artificial IntelligenceNov-28-2023

Protecting against adversarial attacks is a common multiagent problem. Attackers in the real world are predominantly human actors, and the protection methods often incorporate opponent models to improve the performance when facing humans. Previous results show that modeling human behavior can significantly improve the performance of the algorithms. However, modeling humans correctly is a complex problem, and the models are often simplified and assume humans make mistakes according to some distribution or train parameters for the whole population from which they sample. In this work, we use data gathered by psychologists who identified personality types that increase the likelihood of performing malicious acts. However, in the previous work, the tests on a handmade game could not show strategic differences between the models. We created a novel model that links its parameters to psychological traits. We optimized over parametrized games and created games in which the differences are profound. Our work can help with automatic game generation when we need a game in which some models will behave differently and to identify situations in which the models do not align.

node, personality type, quantal response, (14 more...)

arXiv.org Artificial Intelligence

2311.16781

Country:

Europe > Czechia > Prague (0.05)
North America > United States > Texas (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)

Genre: Research Report > New Finding (0.49)

Industry:

Information Technology > Security & Privacy (0.67)
Government > Military (0.67)
Transportation > Infrastructure & Services (0.46)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (0.46)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning (0.99)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.53)

Add feedback

Multi-defender Security Games with Schedules

Song, Zimeng, Ling, Chun Kai, Fang, Fei

arXiv.org Artificial IntelligenceNov-27-2023

Stackelberg Security Games are often used to model strategic interactions in high-stakes security settings. The majority of existing models focus on single-defender settings where a single entity assumes command of all security assets. However, many realistic scenarios feature multiple heterogeneous defenders with their own interests and priorities embedded in a more complex system. Furthermore, defenders rarely choose targets to protect. Instead, they have a multitude of defensive resources or schedules at its disposal, each with different protective capabilities. In this paper, we study security games featuring multiple defenders and schedules simultaneously. We show that unlike prior work on multi-defender security games, the introduction of schedules can cause non-existence of equilibrium even under rather restricted environments. We prove that under the mild restriction that any subset of a schedule is also a schedule, non-existence of equilibrium is not only avoided, but can be computed in polynomial time in games with two defenders. Under additional assumptions, our algorithm can be extended to games with more than two defenders and its computation scaled up in special classes of games with compactly represented schedules such as those used in patrolling applications. Experimental results suggest that our methods scale gracefully with game size, making our algorithms amongst the few that can tackle multiple heterogeneous defenders.

defender, defender 2, nse, (13 more...)

arXiv.org Artificial Intelligence

2311.16392

Country:

North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
North America > United States > California > Los Angeles County > Los Angeles (0.04)
North America > United States > New York > New York County > New York City (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report > New Finding (0.34)

Industry:

Leisure & Entertainment > Games > Computer Games (1.00)
Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
Commercial Services & Supplies > Security & Alarm Services (1.00)
(2 more...)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)

Add feedback

Learning to Defend by Attacking (and Vice-Versa): Transfer of Learning in Cybersecurity Games

Malloy, Tyler, Gonzalez, Cleotilde

arXiv.org Artificial IntelligenceJun-3-2023

Designing cyber defense systems to account for cognitive biases in human decision making has demonstrated significant success in improving performance against human attackers. However, much of the attention in this area has focused on relatively simple accounts of biases in human attackers, and little is known about adversarial behavior or how defenses could be improved by disrupting attacker's behavior. In this work, we present a novel model of human decision-making inspired by the cognitive faculties of Instance-Based Learning Theory, Theory of Mind, and Transfer of Learning. This model functions by learning from both roles in a security scenario: defender and attacker, and by making predictions of the opponent's beliefs, intentions, and actions. The proposed model can better defend against attacks from a wide range of opponents compared to alternatives that attempt to perform optimally without accounting for human biases. Additionally, the proposed model performs better against a range of human-like behavior by explicitly modeling human transfer of learning, which has not yet been applied to cyber defense scenarios. Results from simulation experiments demonstrate the potential usefulness of cognitively inspired models of agents trained in attack and defense roles and how these insights could potentially be used in real-world cybersecurity.

artificial intelligence, machine learning, simulation of human behavior, (18 more...)

arXiv.org Artificial Intelligence

2306.02165

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.14)
North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
Oceania > Australia (0.04)
Europe > Germany > Bavaria > Lower Franconia > Würzburg (0.04)

Genre: Research Report (1.00)

Industry:

Information Technology > Security & Privacy (1.00)
Government > Military > Cyberwarfare (0.61)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Transfer Learning (0.92)
Information Technology > Artificial Intelligence > Cognitive Science > Simulation of Human Behavior (0.89)
Information Technology > Artificial Intelligence > Representation & Reasoning > Analogical Reasoning (0.83)

Add feedback

Targets in Reinforcement Learning to solve Stackelberg Security Games

Bandyopadhyay, Saptarashmi, Zhu, Chenqi, Daniel, Philip, Morrison, Joshua, Shay, Ethan, Dickerson, John

arXiv.org Artificial IntelligenceNov-30-2022

Reinforcement Learning (RL) algorithms have been successfully applied to real world situations like illegal smuggling, poaching, deforestation, climate change, airport security, etc. These scenarios can be framed as Stackelberg security games (SSGs) where defenders and attackers compete to control target resources. The algorithm's competency is assessed by which agent is controlling the targets. This review investigates modeling of SSGs in RL with a focus on possible improvements of target representations in RL algorithms.

artificial intelligence, machine learning, reinforcement learning, (15 more...)

arXiv.org Artificial Intelligence

2211.17132

Country:

North America > United States > Maryland > Prince George's County > College Park (0.14)
North America > United States > California > San Francisco County > San Francisco (0.14)
North America > United States > Massachusetts > Suffolk County > Boston (0.04)
(3 more...)

Genre: Research Report (0.50)

Industry:

Commercial Services & Supplies > Security & Alarm Services (0.99)
Leisure & Entertainment > Games > Computer Games (0.69)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Undirected Networks > Markov Models (1.00)

Add feedback

Robust Solutions for Multi-Defender Stackelberg Security Games

Mutzari, Dolev, Aumann, Yonatan, Kraus, Sarit

arXiv.org Artificial IntelligenceMay-23-2022

Multi-defender Stackelberg Security Games (MSSG) have recently gained increasing attention in the literature. However, the solutions offered to date are highly sensitive, wherein even small perturbations in the attacker's utility or slight uncertainties thereof can dramatically change the defenders' resulting payoffs and alter the equilibrium. In this paper, we introduce a robust model for MSSGs, which admits solutions that are resistant to small perturbations or uncertainties in the game's parameters. First, we formally define the notion of robustness, as well as the robust MSSG model. Then, for the non-cooperative setting, we prove the existence of a robust approximate equilibrium in any such game, and provide an efficient construction thereof. For the cooperative setting, we show that any such game admits a robust approximate alpha-core, provide an efficient construction thereof, and prove that stronger types of the core may be empty. Interestingly, the robust solutions can substantially increase the defenders' utilities over those of the non-robust ones.

artificial intelligence, defender, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2204.14

Country:

North America > United States > California > Los Angeles County > Los Angeles (0.04)
Asia > Middle East > Israel (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.63)

Industry:

Leisure & Entertainment > Games > Computer Games (0.61)
Commercial Services & Supplies > Security & Alarm Services (0.61)

Technology:

Information Technology > Game Theory (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning > Uncertainty > Bayesian Inference (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.46)

Add feedback

Computer Conservation: Lily Xu Uses Artificial Intelligence To Stop Poaching Around the World

#artificialintelligenceDec-1-2021, 07:20:15 GMT

Lily Xu knew from a young age how much the environment and conservation mattered to her. By 9 years old, she'd already decided to eat vegetarian because, as she put it, "I didn't want to hurt animals." Xu grew up believing her passions would always be separate from her professional interest in computer science. Then she became a graduate student in Milind Tambe's Teamcore Lab, and everything changed. Xu is now doing award-winning research into using machine learning and artificial intelligence to help conservation and anti-poaching efforts around the world.

artificial intelligence, lily xu use artificial intelligence, srepok wildlife sanctuary, (8 more...)

#artificialintelligence

Country:

Asia > Cambodia (0.08)
North America > United States > Rhode Island (0.05)
North America > United States > Maryland (0.05)
North America > United States > District of Columbia > Washington (0.05)

Industry: Education (0.50)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.36)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.30)

Add feedback

PAWS anti-poaching AI predicts where illegal hunters will show up next

EngadgetMar-19-2021, 20:36:14 GMT

The illegal animal trade is a global scourge but a lucrative one, worth $8 to 10 billion annually, according to the United Nations Office on Drugs and Crime (UNODC) -- trailing only human, drug and weapons trafficking in value. With so much money to be made, conservationists and wildlife rangers face overwhelming odds against well-organized poaching operations fueled by incessant demand for illicit animal products. The results of this protracted conflict have been nothing short of devastating for the species caught in the middle. At the start of the 20th century, more than 100,000 tigers are estimated to have roamed throughout Southeast Asia. Today, due to a combination of habitat loss and aggressive poaching, fewer than 4,000 currently remain in the wild.

poacher, ranger, snare, (15 more...)

Engadget

Country:

Asia > Southeast Asia (0.25)
Asia > Vietnam (0.05)
Africa > Uganda (0.05)
Asia > Cambodia (0.05)

Industry: Government > Intergovernmental Programs (0.55)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.32)

Add feedback