An AI System Evaluation Framework for Advancing AI Safety: Terminology, Taxonomy, Lifecycle Mapping

Xia, Boming, Lu, Qinghua, Zhu, Liming, Xing, Zhenchang

May-15-2024–arXiv.org Artificial Intelligence

The advent of advanced AI underscores the urgent need for comprehensive safety evaluations, necessitating collaboration across communities (i.e., AI, software engineering, and governance). However, divergent practices and terminologies across these communities, combined with the complexity of AI systems-of which models are only a part-and environmental affordances (e.g., access to tools), obstruct effective communication and comprehensive evaluation. This paper proposes a framework for AI system evaluation comprising three components: 1) harmonised terminology to facilitate communication across communities involved in AI safety evaluation; 2) a taxonomy identifying essential elements for AI system evaluation; 3) a mapping between AI lifecycle, stakeholders, and requisite evaluations for accountable AI supply chain. This framework catalyses a deeper discourse on AI system evaluation beyond model-centric approaches.

ai model, ai system, evaluation, (10 more...)

arXiv.org Artificial Intelligence

May-15-2024

arXiv.org PDF

Add feedback

Country:
- North America > United States (0.28)
- Oceania > Australia
  - New South Wales > Sydney (0.05)
  - Australian Capital Territory > Canberra (0.04)
- Africa > Eswatini
  - Manzini > Manzini (0.04)

Genre:
- Research Report (0.40)

Industry:
- Law (1.00)
- Information Technology > Security & Privacy (1.00)
- Government (0.94)

Technology:
- Information Technology > Artificial Intelligence
  - Natural Language (1.00)
  - Issues > Social & Ethical Issues (1.00)

Duplicate Docs Excel Report

Title
None found

Similar Docs Excel Report more

Title	Similarity	Source
None found