Trustworthy, Responsible, and Safe AI: A Comprehensive Architectural Framework for AI Safety with Challenges and Mitigations
Chen, Chen, Liu, Ziyao, Jiang, Weifeng, Goh, Si Qi, Lam, Kwok-Yan
–arXiv.org Artificial Intelligence
AI Safety is an emerging area of critical importance to the safe adoption and deployment of AI systems. With the rapid proliferation of AI and especially with the recent advancement of Generative AI (or GAI), the technology ecosystem behind the design, development, adoption, and deployment of AI systems has drastically changed, broadening the scope of AI Safety to address impacts on public safety and national security. In this paper, we propose a novel architectural framework for understanding and analyzing AI Safety; defining its characteristics from three perspectives: Trustworthy AI, Responsible AI, and Safe AI. We provide an extensive review of current research and advancements in AI safety from these perspectives, highlighting their key challenges and mitigation approaches. Through examples from state-of-the-art technologies, particularly Large Language Models (LLMs), we present innovative mechanism, methodologies, and techniques for designing and testing AI safety. Our goal is to promote advancement in AI safety research, and ultimately enhance people's trust in digital transformation.
arXiv.org Artificial Intelligence
Sep-12-2024
- Country:
- South America > Colombia
- Meta Department > Villavicencio (0.04)
- Oceania > Australia
- Victoria > Melbourne (0.13)
- New South Wales > Sydney (0.04)
- North America
- Dominican Republic (0.04)
- United States
- Maryland > Baltimore (0.04)
- District of Columbia > Washington (0.04)
- Texas > Travis County
- Austin (0.04)
- Minnesota > Hennepin County
- Minneapolis (0.14)
- Nevada > Clark County
- Las Vegas (0.04)
- Colorado > Denver County
- Denver (0.13)
- Hawaii > Honolulu County
- Honolulu (0.04)
- New York
- New York County > New York City (0.14)
- Kings County > New York City (0.13)
- Richmond County > New York City (0.04)
- Queens County > New York City (0.04)
- Bronx County > New York City (0.04)
- Erie County > Buffalo (0.04)
- Virginia > Arlington County
- Arlington (0.04)
- Louisiana > Orleans Parish
- New Orleans (0.04)
- Pennsylvania > Philadelphia County
- Philadelphia (0.13)
- New Mexico > Santa Fe County
- Santa Fe (0.04)
- Georgia > Fulton County
- Atlanta (0.04)
- Washington > King County
- Tennessee > Shelby County
- Memphis (0.04)
- Alaska > Anchorage Municipality
- Anchorage (0.04)
- California
- San Francisco County > San Francisco (0.14)
- San Diego County > San Diego (0.04)
- Orange County > Irvine (0.04)
- Los Angeles County
- Long Beach (0.04)
- Pasadena (0.04)
- Canada
- Ontario > Toronto (0.04)
- Quebec > Montreal (0.04)
- British Columbia > Metro Vancouver Regional District
- Vancouver (0.14)
- Alberta > Census Division No. 15
- Improvement District No. 9 > Banff (0.14)
- Europe
- Austria (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Slovenia > Central Slovenia
- Municipality of Ljubljana > Ljubljana (0.04)
- Middle East > Malta
- Eastern Region > Northern Harbour District > St. Julian's (0.04)
- Germany
- Berlin (0.04)
- North Rhine-Westphalia > Düsseldorf Region
- Düsseldorf (0.04)
- Denmark > Capital Region
- Copenhagen (0.04)
- Finland
- Uusimaa > Helsinki (0.04)
- Northern Ostrobothnia > Oulu (0.04)
- United Kingdom > England
- Oxfordshire > Oxford (0.13)
- Greater London > London (0.04)
- Cambridgeshire > Cambridge (0.04)
- Latvia > Lubāna Municipality
- Lubāna (0.04)
- Portugal > Lisbon
- Lisbon (0.04)
- France
- Île-de-France > Paris
- Paris (0.04)
- Auvergne-Rhône-Alpes > Lyon
- Lyon (0.04)
- Île-de-France > Paris
- Italy
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Belgium
- Brussels-Capital Region > Brussels (0.04)
- Flanders > Flemish Brabant
- Leuven (0.04)
- Netherlands > South Holland
- Delft (0.04)
- Croatia > Dubrovnik-Neretva County
- Dubrovnik (0.04)
- Ireland > Leinster
- County Dublin > Dublin (0.04)
- Poland > Lesser Poland Province
- Kraków (0.04)
- Asia
- Singapore (0.04)
- Indonesia > Bali (0.04)
- Macao (0.04)
- Nepal (0.04)
- Taiwan > Taiwan Province
- Taipei (0.04)
- Myanmar > Tanintharyi Region
- Dawei (0.04)
- South Korea > Seoul
- Seoul (0.04)
- China
- Hong Kong (0.04)
- Jiangsu Province > Yancheng (0.04)
- Beijing > Beijing (0.04)
- Middle East
- Japan > Kyūshū & Okinawa
- Okinawa (0.04)
- Africa
- Rwanda > Kigali
- Kigali (0.04)
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Eswatini > Manzini
- Manzini (0.04)
- Central African Republic > Ombella-M'Poko
- Bimbo (0.04)
- Rwanda > Kigali
- South America > Colombia
- Genre:
- Overview (1.00)
- Research Report
- Experimental Study (0.92)
- New Finding (0.92)
- Industry:
- Media > News (1.00)
- Law Enforcement & Public Safety > Crime Prevention & Enforcement (1.00)
- Education (1.00)
- Law
- Information Technology
- Services (1.00)
- Security & Privacy (1.00)
- Health & Medicine > Therapeutic Area
- Psychiatry/Psychology (0.67)
- Government
- Military (1.00)
- Voting & Elections (0.92)
- Regional Government > North America Government
- United States Government (1.00)
- Technology:
- Information Technology > Artificial Intelligence
- Representation & Reasoning > Agents (1.00)
- Issues > Social & Ethical Issues (1.00)
- Applied AI (1.00)
- Natural Language
- Large Language Model (1.00)
- Chatbot (1.00)
- Machine Learning
- Reinforcement Learning (1.00)
- Neural Networks > Deep Learning
- Generative AI (0.48)
- Information Technology > Artificial Intelligence