Goto

Collaborating Authors

 human team


Evaluating AI cyber capabilities with crowdsourced elicitation

arXiv.org Artificial Intelligence

As AI systems become increasingly capable, understanding their offensive cyber potential is critical for informed governance and responsible deployment. However, it's hard to accurately bound their capabilities, and some prior evaluations dramatically underestimated them. The art of extracting maximum task-specific performance from AIs is called "AI elicitation", and today's safety organizations typically conduct it in-house. In this paper, we explore crowdsourcing elicitation efforts as an alternative to in-house elicitation work. We host open-access AI tracks at two Capture The Flag (CTF) competitions: AI vs. Humans (400 teams) and Cyber Apocalypse (8000 teams). The AI teams achieve outstanding performance at both events, ranking top-5% and top-10% respectively for a total of \$7500 in bounties. This impressive performance suggests that open-market elicitation may offer an effective complement to in-house elicitation. We propose elicitation bounties as a practical mechanism for maintaining timely, cost-effective situational awareness of emerging AI capabilities. Another advantage of open elicitations is the option to collect human performance data at scale. Applying METR's methodology, we found that AI agents can reliably solve cyber challenges requiring one hour or less of effort from a median human CTF participant.


GRACE: A Granular Benchmark for Evaluating Model Calibration against Human Calibration

arXiv.org Artificial Intelligence

Language models are often miscalibrated, leading to confidently incorrect answers. We introduce GRACE, a benchmark for language model calibration that incorporates comparison with human calibration. GRACE consists of question-answer pairs, in which each question contains a series of clues that gradually become easier, all leading to the same answer; models must answer correctly as early as possible as the clues are revealed. This setting permits granular measurement of model calibration based on how early, accurately, and confidently a model answers. After collecting these questions, we host live human vs. model competitions to gather 1,749 data points on human and model teams' timing, accuracy, and confidence. We propose a metric, CalScore, that uses GRACE to analyze model calibration errors and identify types of model miscalibration that differ from human behavior. We find that although humans are less accurate than models, humans are generally better calibrated. Since state-of-the-art models struggle on GRACE, it effectively evaluates progress on improving model calibration.


Probabilistic Modeling of Human Teams to Infer False Beliefs

arXiv.org Artificial Intelligence

We develop a probabilistic graphical model (PGM) for artificially intelligent (AI) agents to infer human beliefs during a simulated urban search and rescue (USAR) scenario executed in a Minecraft environment with a team of three players. The PGM approach makes observable states and actions explicit, as well as beliefs and intentions grounded by evidence about what players see and do over time. This approach also supports inferring the effect of interventions, which are vital if AI agents are to assist human teams. The experiment incorporates manipulations of players' knowledge, and the virtual Minecraft-based testbed provides access to several streams of information, including the objects in the players' field of view. The participants are equipped with a set of marker blocks that can be placed near room entrances to signal the presence or absence of victims in the rooms to their teammates. In each team, one of the members is given a different legend for the markers than the other two, which may mislead them about the state of the rooms; that is, they will hold a false belief. We extend previous works in this field by introducing ToMCAT, an AI agent that can reason about individual and shared mental states. We find that the players' behaviors are affected by what they see in their in-game field of view, their beliefs about the meaning of the markers, and their beliefs about which meaning the team decided to adopt. In addition, we show that ToMCAT's beliefs are consistent with the players' actions and that it can infer false beliefs with accuracy significantly better than chance and comparable to inferences made by human observers.


Yes, ChatGPT Is Sentient -- Because It's Really Humans in the Loop

#artificialintelligence

OpenAI, recently released a new AI program called ChatGPT. It left the internet gobsmacked, though some were skeptical, and concerned about its abilities. The really amazing thing is ChatGPT's humanlike responses. They gives an observer an unnerving suspicion that the AI is actually sentient. Maybe it is actually sentient.


Why AI and autonomous response are crucial for cybersecurity (VB On-Demand)

#artificialintelligence

Today, cybersecurity is in a state of continuous growth and improvement. In this on-demand webinar, learn how two organizations use a continuous AI feedback loop to identify vulnerabilities, harden defenses and improve the outcomes of their cybersecurity programs. The security risk landscape is in tremendous flux, and the traditional on-premises approach to cybersecurity is no longer enough. Remote work has become the norm, and outside the office walls, employees are letting down their personal security defenses. Cyber risks introduced by the supply chain via third parties are still a major vulnerability, so organizations need to think about not only their defenses but those of their suppliers to protect their priority assets and information from infiltration and exploitation.


Inside the Air Force Training Program that Will Pit Human Pilots Against AI

#artificialintelligence

Air Force fighter pilots will soon face new opponents in their training: artificial intelligence-based enemy pilots that can match humans based on their personal learning needs. After steering the production of numerous AI-enabled pilot agents for years, Aptima, Inc. confirmed it landed a four-year contract with the Air Force Research Laboratory to build an "automated librarian" that will categorize those AI pilots and pair them with military trainees in scenarios that are right to advance their skillsets. "The best case outcome is that AFRL determines that the products of this research are so promising that they create a library into which AI training technologies are shelved like books are shelved and they refine the sort of librarian that we're trying to build here so that it can sweep through that enormous library of AI, sweep through a library of scenarios--and for each individual student--pick out just the right pairing to advance them to expertise reliably and more quickly than we can do today," Aptima's Chief Scientist Jared Freeman told Nextgov during an interview on Tuesday. Freeman joined the company in 1999, four years after its launch. Aptima's project portfolio has grown increasingly diverse since then, he noted. Now, much of it concerns AI support for human teams, like forming and measuring them, and helping people and AI to manage those groups.


Cooperative Assistance in Robotic Surgery through Multi-Agent Reinforcement Learning

arXiv.org Artificial Intelligence

Cognitive cooperative assistance in robot-assisted surgery holds the potential to increase quality of care in minimally invasive interventions. Automation of surgical tasks promises to reduce the mental exertion and fatigue of surgeons. In this work, multi-agent reinforcement learning is demonstrated to be robust to the distribution shift introduced by pairing a learned policy with a human team member. Multi-agent policies are trained directly from images in simulation to control multiple instruments in a sub task of the minimally invasive removal of the gallbladder. These agents are evaluated individually and in cooperation with humans to demonstrate their suitability as autonomous assistants. Compared to human teams, the hybrid teams with artificial agents perform better considering completion time (44.4% to 71.2% shorter) as well as number of collisions (44.7% to 98.0% fewer). Path lengths, however, increase under control of an artificial agent (11.4% to 33.5% longer). A multi-agent formulation of the learning problem was favored over a single-agent formulation on this surgical sub task, due to the sequential learning of the two instruments. This approach may be extended to other tasks that are difficult to formulate within the standard reinforcement learning framework. Multi-agent reinforcement learning may shift the paradigm of cognitive robotic surgery towards seamless cooperation between surgeons and assistive technologies.


Can AI be used in cybersecurity? You asked, we answered!

#artificialintelligence

How AI enhances security for IoT environments. Elon Musk's prediction that AI will outsmart humans in less than 5 years is a bold statement, predicting that machines will possess super-human qualities which help boost organizations' profits and goals. For many, these ideas belong in sci-fi fantasies rather than as a future fixture of working practices. In the broadest sense, there are no signs that AI comes close to human consciousness or sentience. When we talk about the power of AI, it's more helpful to consider the specific use cases and sectors where it will, and is having, a transformative effect – and there is one area in particular where AI has been seen to mimic the capabilities of complex human thought processes: cyber security.


Can AI be used in cybersecurity? You asked, we answered!

#artificialintelligence

Elon Musk's prediction that AI will outsmart humans in less than 5 years is a bold statement, predicting that machines will possess super-human qualities which help boost organizations' profits and goals. For many, these ideas belong in sci-fi fantasies rather than as a future fixture of working practices. In the broadest sense, there are no signs that AI comes close to human consciousness or sentience. When we talk about the power of AI, it's more helpful to consider the specific use cases and sectors where it will, and is having, a transformative effect – and there is one area in particular where AI has been seen to mimic the capabilities of complex human thought processes: cyber security. For organizations seeing more and more attacks against their digital infrastructure, cyber security is a top priority.


AI Emerges As A Major Player In The Race To Find Covid-19 Therapies And Vaccines

#artificialintelligence

Covid-19 is the new Manhattan Project and AI emerges as a major player in it. Covid-19 research has quickly created unprecedented amounts of publicly available research data from federal governments, industry, and university research labs at record rates. For example, the Covid-19 Open Research Dataset (CORD-19) created by the Allen Institute for AI in collaboration with government agencies, universities, and industry partners started with 13,000 Covid-19 scholarly articles. Two months later, it had grown to over 128K articles. Research data on a topic normally takes years, not months to grow that large.