Trustworthy, Responsible, and Safe AI: A Comprehensive Architectural Framework for AI Safety with Challenges and Mitigations
Chen, Chen, Liu, Ziyao, Jiang, Weifeng, Goh, Si Qi, Lam, Kwok-Yan
–arXiv.org Artificial Intelligence
AI Safety is an emerging area of critical importance to the safe adoption and deployment of AI systems. With the rapid proliferation of AI and especially with the recent advancement of Generative AI (or GAI), the technology ecosystem behind the design, development, adoption, and deployment of AI systems has drastically changed, broadening the scope of AI Safety to address impacts on public safety and national security. In this paper, we propose a novel architectural framework for understanding and analyzing AI Safety; defining its characteristics from three perspectives: Trustworthy AI, Responsible AI, and Safe AI. We provide an extensive review of current research and advancements in AI safety from these perspectives, highlighting their key challenges and mitigation approaches. Through examples from state-of-the-art technologies, particularly Large Language Models (LLMs), we present innovative mechanism, methodologies, and techniques for designing and testing AI safety. Our goal is to promote advancement in AI safety research, and ultimately enhance people's trust in digital transformation.
arXiv.org Artificial Intelligence
Sep-12-2024
- Country:
- Africa
- Central African Republic > Ombella-M'Poko
- Bimbo (0.04)
- Eswatini > Manzini
- Manzini (0.04)
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Central African Republic > Ombella-M'Poko
- Asia
- Nepal (0.04)
- Indonesia > Bali (0.04)
- Macao (0.04)
- Japan > Kyūshū & Okinawa
- Okinawa (0.04)
- Middle East
- China
- Beijing > Beijing (0.04)
- Hong Kong (0.04)
- Jiangsu Province > Yancheng (0.04)
- South Korea > Seoul
- Seoul (0.04)
- Singapore (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Europe
- Poland > Lesser Poland Province
- Kraków (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Netherlands > South Holland
- Delft (0.04)
- Belgium
- Brussels-Capital Region > Brussels (0.04)
- Flanders > Flemish Brabant
- Leuven (0.04)
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Italy
- France
- Auvergne-Rhône-Alpes > Lyon
- Lyon (0.04)
- Île-de-France > Paris
- Paris (0.04)
- Auvergne-Rhône-Alpes > Lyon
- Portugal > Lisbon
- Lisbon (0.04)
- Latvia > Lubāna Municipality
- Lubāna (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Greater London > London (0.04)
- Oxfordshire > Oxford (0.13)
- Finland
- Northern Ostrobothnia > Oulu (0.04)
- Uusimaa > Helsinki (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Germany
- Berlin (0.04)
- North Rhine-Westphalia > Düsseldorf Region
- Düsseldorf (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Austria (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Poland > Lesser Poland Province
- North America
- Canada
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.14)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Ontario > Toronto (0.04)
- Quebec > Montreal (0.04)
- Alberta > Census Division No. 15
- Dominican Republic (0.04)
- United States
- California
- Los Angeles County
- Long Beach (0.04)
- Pasadena (0.04)
- Orange County > Irvine (0.04)
- San Diego County > San Diego (0.04)
- San Francisco County > San Francisco (0.14)
- Los Angeles County
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- Tennessee > Shelby County
- Memphis (0.04)
- District of Columbia > Washington (0.04)
- Washington > King County
- Georgia > Fulton County
- Atlanta (0.04)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.13)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- New York
- Bronx County > New York City (0.04)
- Erie County > Buffalo (0.04)
- Kings County > New York City (0.13)
- New York County > New York City (0.14)
- Queens County > New York City (0.04)
- Richmond County > New York City (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- Maryland > Baltimore (0.04)
- Colorado > Denver County
- Denver (0.13)
- Nevada > Clark County
- Las Vegas (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Texas > Travis County
- Austin (0.04)
- California
- Canada
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Victoria > Melbourne (0.13)
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Africa
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (0.92)
- New Finding (0.92)
- Industry:
- Education (1.00)
- Government
- Military (1.00)
- Regional Government > North America Government
- United States Government (1.00)
- Voting & Elections (0.92)
- Health & Medicine > Therapeutic Area
- Psychiatry/Psychology (0.67)
- Information Technology
- Security & Privacy (1.00)
- Services (1.00)
- Law
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Media > News (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Applied AI (1.00)
- Issues > Social & Ethical Issues (1.00)
- Machine Learning
- Neural Networks > Deep Learning
- Generative AI (0.48)
- Reinforcement Learning (1.00)
- Neural Networks > Deep Learning
- Natural Language
- Chatbot (1.00)
- Large Language Model (1.00)
- Representation & Reasoning > Agents (1.00)
- Information Technology > Artificial Intelligence