Responsible Reporting for Frontier AI Development

Kolt, Noam, Anderljung, Markus, Barnhart, Joslyn, Brass, Asher, Esvelt, Kevin, Hadfield, Gillian K., Heim, Lennart, Rodriguez, Mikel, Sandbrink, Jonas B., Woodside, Thomas

Apr-3-2024–arXiv.org Artificial Intelligence

Mitigating the risks from frontier AI systems requires up-to-date and reliable information about those systems. Organizations that develop and deploy frontier systems have significant access to such information. By reporting safety-critical information to actors in government, industry, and civil society, these organizations could improve visibility into new and emerging risks posed by frontier systems. Equipped with this information, developers could make better informed decisions on risk management, while policymakers could design more targeted and robust regulatory infrastructure. We outline the key features of responsible reporting and propose mechanisms for implementing them in practice. Evaluate current models for novel risks (including risks discovered by other organizations) Update model safeguards and risk mitigations Developer Other developers (e.g., revise scaling policy, security practices) Documents and Evaluates information Consult with domain experts in government reports safety and decides on (e.g., experts in national security, public health) information response plan Solicit additional information from developer Government actor (e.g., design decisions, organizational processes) Request or conduct further safety evaluations (incl. in collaboration with independent auditors) Domain experts in

arxiv, developer, information, (15 more...)

arXiv.org Artificial Intelligence

Apr-3-2024

arXiv.org PDF

Add feedback

Country:
- North America
  - United States
    - Texas (0.04)
    - Minnesota (0.04)
    - Massachusetts (0.04)
    - Arizona (0.04)
    - New York > New York County
      - New York City (0.05)
    - Georgia > Fulton County
      - Atlanta (0.04)
    - California > Santa Clara County
      - Palo Alto (0.04)
  - Canada > Ontario
    - Toronto (0.14)
- Europe > United Kingdom
  - England > Oxfordshire > Oxford (0.14)
- Africa > Eswatini
  - Manzini > Manzini (0.04)

Genre:
- Research Report (0.64)

Industry:
- Law > Statutes (1.00)
- Information Technology > Security & Privacy (1.00)
- Government > Regional Government
  - North America Government > United States Government (1.00)

Technology:
- Information Technology
  - Security & Privacy (1.00)
  - Artificial Intelligence
    - Natural Language > Large Language Model (1.00)
    - Issues > Social & Ethical Issues (1.00)
    - Machine Learning > Neural Networks
      - Deep Learning (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found