Emerging Practices in Frontier AI Safety Frameworks
Buhl, Marie Davidsen, Bucknall, Ben, Masterson, Tammy
–arXiv.org Artificial Intelligence
At the AI Seoul Summit in 2024, a number o f AI developers signed on to the Frontier AI Safety Commitments, agreeing to develop a safety framework outlining how they will manage severe risks that their frontier AI systems may pose ( DSIT, 2024) . Since then, a research field has begun to emerge, with a diverse array of researchers from companies, governments, academi a and other third - party research organi s ations publishing work on how to write and implement an effective safety framework . S ignatories to the commitments are due to publish safety frameworks shortly, in time for the Paris AI Action Summit. This paper summarises emerging practice s - practices that appear promising and are gaining expert recognition - for safety frameworks as identified by this new research field. We draw on both the safety frameworks published so far, literature and standards on frontier AI risk management (as well as risk management more broadly), internal research by the UK AI Safety Institute, and the Frontier AI Safety Commitments themselves.
arXiv.org Artificial Intelligence
Feb-5-2025
- Country:
- Asia
- India > Tamil Nadu
- Chennai (0.04)
- South Korea > Seoul
- Seoul (0.24)
- India > Tamil Nadu
- Europe > Latvia
- Lubāna Municipality > Lubāna (0.04)
- North America > United States
- Florida > Palm Beach County
- Boca Raton (0.04)
- New York (0.04)
- Florida > Palm Beach County
- Oceania > Papua New Guinea
- Gulf Province > Kerema (0.04)
- Asia
- Genre:
- Research Report (1.00)
- Industry:
- Government (1.00)
- Information Technology > Security & Privacy (1.00)
- Technology: