AITopics | coup

Collaborating Authors

coup

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Reward Transfer from Inverse Reinforcement Learning: A Coupled Minimax Approach

Hao, Guang-Yuan, van der Laan, Lars, Bibaut, Aurélien, Kallus, Nathan

arXiv.org Machine LearningMay-28-2026

Expert demonstrations, such as those from car drivers, help navigate environments with unknown rewards, but are often collected in controlled settings, such as closed-course test tracks, while learned control policies must be deployed in new environments, such as city streets. We can imitate experts to perform well in the same source environment where demonstrations are observed, and we may even use inverse reinforcement learning (IRL) to improve on simple behavior cloning (Ng and Russell, 2000; Abbeel and Ng, 2004; Ziebart et al., 2008; Fu et al., 2018; Geng et al., 2020). But the target environment may have a different transition law, discount factor, or soft-control regularization. For this, IRL is crucial: we can learn a reward from demonstrations in the source environment and transfer it to the target environment, learning a policy that optimizes the same reward function in a new setting (Fu et al., 2018; Schlaginhaufen and Kamgarpour, 2024). In this paper, we characterize how well this transfer can be done and which approaches are preferable. In particular, we show the value in a coupled approach that takes the target environment into account even when learning from the source. In ordinary offline control, the Bellman equation uses a known reward, so the main statistical error comes from target transitions.

artificial intelligence, machine learning, reinforcement learning, (18 more...)

arXiv.org Machine Learning

2605.27834

Genre: Research Report (0.63)

Industry: Health & Medicine (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)

Add feedback

What's happening in Myanmar's civil war as military holds elections?

Al JazeeraDec-27-2025, 17:45:00 GMT

What's happening in Myanmar's civil war as military holds elections? Voters in parts of Myanmar are heading to the polls on Sunday for an election that critics view as a bid by the country's generals to legitimise military rule, nearly five years after they overthrew the government of Nobel Laureate Aung San Suu Kyi. The multi-phased election is unfolding amid a raging civil war, with ethnic armed groups and opposition militias fighting the military for control of vast stretches of territory, stretching from the borderlands with Bangladesh and India in the west, across the central plains, to the frontiers with China and Thailand in the north and east. Another third will be covered during a second and third phase in January, while voting has been cancelled altogether in the remainder. Fighting, including air raids and arson, has intensified in several areas.

china, military, myanmar, (13 more...)

Al Jazeera

Country:

North America > United States (0.29)
Asia > Thailand (0.25)
Asia > India (0.25)
(16 more...)

Industry:

Government > Military (1.00)
Government > Regional Government (0.70)

Technology: Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles > Drones (0.96)

Add feedback

Practical, Utilitarian Algorithm Configuration

Graham, Devon, Velez, Eros Rojas, Leyton-Brown, Kevin

arXiv.org Artificial IntelligenceNov-17-2025

Utilitarian algorithm configuration identifies a parameter setting for a given algorithm that maximizes a user's utility. Utility functions offer a theoretically well-grounded approach to optimizing decision-making under uncertainty and are flexible enough to capture a user's preferences over algorithm runtimes (e.g., they can describe a sharp cutoff after which a solution is no longer required, a per-hour cost for compute, or diminishing returns from algorithms that take longer to run). COUP is a recently-introduced utilitarian algorithm configuration procedure which was designed mainly to offer strong theoretical guarantees about the quality of the configuration it returns, with less attention paid to its practical performance. This paper closes that gap, bringing theoretically-grounded, utilitarian algorithm configuration to the point where it is competitive with widely used, heuristic configuration procedures that offer no performance guarantees. We present a series of improvements to COUP that improve its empirical performance without degrading its theoretical guarantees and demonstrate their benefit experimentally. Using a case study, we also illustrate ways of exploring the robustness of a given solution to the algorithm selection problem to variations in the utility function.

artificial intelligence, data mining, machine learning, (19 more...)

arXiv.org Artificial Intelligence

2510.14683

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Ensemble Learning (0.68)
Information Technology > Data Science > Data Mining > Big Data (0.46)

Add feedback

Strategy Masking: A Method for Guardrails in Value-based Reinforcement Learning Agents

Keane, Jonathan, Keyser, Sam, Kedziora, Jeremy

arXiv.org Artificial IntelligenceJan-9-2025

The use of reward functions to structure AI learning and decision making is core to the current reinforcement learning paradigm; however, without careful design of reward functions, agents can learn to solve problems in ways that may be considered ``undesirable" or ``unethical. Without thorough understanding of the incentives a reward function creates, it can be difficult to impose principled yet general control mechanisms over its behavior. In this paper, we study methods for constructing guardrails for AI agents that use reward functions to learn decision making. We introduce a novel approach, which we call strategy masking, to explicitly learn and then suppress undesirable AI agent behavior. We apply our method to study lying in AI agents and show that strategy masking can effectively modify agent behavior by suppressing, or actively penalizing, the reward dimension for lying such that agents act more honestly while not compromising their ability to perform effectively.

artificial intelligence, machine learning, reinforcement learning, (17 more...)

arXiv.org Artificial Intelligence

2501.05501

Country: North America > United States (0.29)

Genre: Research Report (0.84)

Industry: Leisure & Entertainment (0.93)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Agents (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Utilitarian Algorithm Configuration for Infinite Parameter Spaces

Graham, Devon, Leyton-Brown, Kevin

arXiv.org Artificial IntelligenceMay-28-2024

Utilitarian algorithm configuration is a general-purpose technique for automatically searching the parameter space of a given algorithm to optimize its performance, as measured by a given utility function, on a given set of inputs. Recently introduced utilitarian configuration procedures offer optimality guarantees about the returned parameterization while provably adapting to the hardness of the underlying problem. However, the applicability of these approaches is severely limited by the fact that they only search a finite, relatively small set of parameters. They cannot effectively search the configuration space of algorithms with continuous or uncountable parameters. In this paper we introduce a new procedure, which we dub COUP (Continuous, Optimistic Utilitarian Procrastination). COUP is designed to search infinite parameter spaces efficiently to find good configurations quickly. Furthermore, COUP maintains the theoretical benefits of previous utilitarian configuration procedures when applied to finite parameter spaces but is significantly faster, both provably and experimentally.

configuration, coup, procedure, (14 more...)

arXiv.org Artificial Intelligence

2405.18246

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > Switzerland > Basel-City > Basel (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.93)
Information Technology > Data Science > Data Mining > Big Data (0.47)

Add feedback

OpenAI's Chief Scientist Made a Tragic Miscalculation

The Atlantic - TechnologyNov-21-2023, 17:38:52 GMT

Ilya Sutskever, bless his heart. Until recently, to the extent that Sutskever was known at all, it was as a brilliant artificial-intelligence researcher. He was the star student who helped Geoffrey Hinton, one of the "godfathers of AI," kick off the so-called deep-learning revolution. In 2015, after a short stint at Google, Sutskever co-founded OpenAI and eventually became its chief scientist; so important was he to the company's success that Elon Musk has taken credit for recruiting him. Still, apart from niche podcast appearances and the obligatory hour-plus back-and-forth with Lex Fridman, Sutskever didn't have much of a public profile before this past weekend.

altman, openai, sutskever, (12 more...)

The Atlantic - Technology

Country:

North America > United States > California > San Francisco County > San Francisco (0.05)
Europe > Middle East (0.05)
Asia > Middle East (0.05)
Africa > Middle East (0.05)

Industry: Information Technology (0.90)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.96)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.82)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.72)

Add feedback

Microsoft 'pulled off a coup' of its own hiring Sam Altman, analysts say

Washington Post - Technology NewsNov-20-2023, 19:16:34 GMT

"If many OpenAI employees choose to migrate to Microsoft to join Mr. Altman and Mr. Brockman, then not only would Microsoft hold a license to OpenAI's (intellectual property) up to (artificial general intelligence, an AI-system that's generally smarter than humans), but Microsoft would also be effectively acquiring OpenAI's core differentiation -- its ambitious and experienced technical talent," Havemeyer added.

coup, microsoft, own hiring sam altman, (1 more...)

Washington Post - Technology News

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

Sam Altman was 'shocked and saddened' after he was fired as CEO of OpenAI

EngadgetNov-18-2023, 05:15:38 GMT

Sam Altman and Greg Brockman were "shocked and saddened by what the board did" and are still trying to figure out what exactly happened. The former CEO and the former President of OpenAI have published a post on X, sharing the details of what they do know and how they found out the former was being fired. Apparently, company co-founder Ilya Sutskever invited Altman for a meeting at noon on Friday, which was then attended by the whole board except for Brockman. It was at that meeting that Altman found out he was being fired and that OpenAI was going to announce it "very soon." Shortly after that, Sutskever reportedly invited Brockman to a separate Google Meet conference, where he was told that Altman had gotten fired and that he was being removed from the board.

altman, openai, sam altman, (11 more...)

Engadget

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.94)

Add feedback

Driven from city life to jungle insurgency

The Japan TimesMar-31-2022, 04:20:33 GMT

On jungle crests about 1 mile from the front lines in eastern Myanmar, a former hotel banquet coordinator slipped his index finger onto the trigger of an assault rifle. A dentist recalled picking larvae from a young fighter's infected bullet wound. A marketing manager described the adapted commercial drones she is directing to foil the enemy. More than a year after Myanmar's military seized full control in a coup -- imprisoning the nation's elected leaders, killing more than 1,700 civilians and arresting at least 13,000 more -- the country is at war, with some unlikely combatants in the fray. On one side is a military junta that, apart from a brief interlude of semidemocratic governance, has ruled with brutal force for a half-century.

myanmar, national unity government, tatmadaw, (12 more...)

The Japan Times

Country:

Asia > Myanmar > Yangon Region > Yangon (0.05)
North America > United States (0.04)
Europe > Ukraine (0.04)
(6 more...)

Industry:

Health & Medicine (1.00)
Law Enforcement & Public Safety (0.95)
Government > Military > Army (0.52)

Technology:

Information Technology > Artificial Intelligence > Robots (0.34)
Information Technology > Communications (0.30)

Add feedback

Oh, This Game Set in Latin America Has a Coup? How Original

WIREDOct-29-2021, 12:00:00 GMT

For quite some time, I've felt a deep unease playing shooting games set in the modern world. While I'm always delighted to have 11-year-olds pulverize me in Fortnite, or to drop into a zombie-infested city for make-believe fun, when it comes to more realistic shooters I get hung up on the details. For games in the Call of Duty or Tom Clancy franchises, these details usually entail an express ride through a soul-crushing wheel of stereotypes and a kaleidoscope of ahistorical musings extracted from a fictional mashup of the Cold War and the war on drugs. Likewise, as a historian of Latin America and someone who grew up in a Mexican-American community on the US–Mexico border, the genre's ongoing obsession with depicting everything south of my hometown as simultaneously exotic, corrupt, and tyrannical is tedious at best and enraging at worst. So when the reviews for Far Cry 6 started trickling into cyberspace, I wasn't surprised to read that the it rehashed all of the worst stereotypes we've come to expect from video games set in Latin America.

cry 6, latin america, video game, (6 more...)

WIRED

Country:

South America (0.93)
North America > Central America (0.93)
North America > Mexico (0.26)
(2 more...)

Industry: Leisure & Entertainment > Games > Computer Games (0.82)

Technology: Information Technology > Artificial Intelligence (0.46)

Add feedback