Responsible Reporting for Frontier AI Development
Kolt, Noam, Anderljung, Markus, Barnhart, Joslyn, Brass, Asher, Esvelt, Kevin, Hadfield, Gillian K., Heim, Lennart, Rodriguez, Mikel, Sandbrink, Jonas B., Woodside, Thomas
–arXiv.org Artificial Intelligence
Mitigating the risks from frontier AI systems requires up-to-date and reliable information about those systems. Organizations that develop and deploy frontier systems have significant access to such information. By reporting safety-critical information to actors in government, industry, and civil society, these organizations could improve visibility into new and emerging risks posed by frontier systems. Equipped with this information, developers could make better informed decisions on risk management, while policymakers could design more targeted and robust regulatory infrastructure. We outline the key features of responsible reporting and propose mechanisms for implementing them in practice. Evaluate current models for novel risks (including risks discovered by other organizations) Update model safeguards and risk mitigations Developer Other developers (e.g., revise scaling policy, security practices) Documents and Evaluates information Consult with domain experts in government reports safety and decides on (e.g., experts in national security, public health) information response plan Solicit additional information from developer Government actor (e.g., design decisions, organizational processes) Request or conduct further safety evaluations (incl. in collaboration with independent auditors) Domain experts in
arXiv.org Artificial Intelligence
Apr-3-2024
- Country:
- North America
- United States
- Texas (0.04)
- Minnesota (0.04)
- Massachusetts (0.04)
- Arizona (0.04)
- New York > New York County
- New York City (0.05)
- Georgia > Fulton County
- Atlanta (0.04)
- California > Santa Clara County
- Palo Alto (0.04)
- Canada > Ontario
- Toronto (0.14)
- United States
- Europe > United Kingdom
- England > Oxfordshire > Oxford (0.14)
- Africa > Eswatini
- North America
- Genre:
- Research Report (0.64)
- Industry:
- Law > Statutes (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government
- Technology: