GAIA: Categorical Foundations of Generative AI
–arXiv.org Artificial Intelligence
Figure 1: We propose a hierarchical Generative AI Architecture (GAIA) using higher-order category theory. Generative AI has become a dominant paradigm for building intelligent systems in the last few years, ranging from large language models developed with the widely used Transformer model Vaswani et al. (2017), or more recently with the structured state space sequence models Gu et al. (2022); Yin et al. (2023), and with the growing use of image diffusion algorithms Song and Ermon (2019); Yin et al. (2023). We can broadly define the problem of generative AI as the construction, maintenance, and deployment of foundation models Bommasani et al. (2022), a storehouse of human knowledge that provides the basic infrastructure for AI across some set of applications. A fundamental question, therefore, to investigate is to study the mathematical basis for foundation models. We propose a mathematical framework for a Generative AI Architecture (GAIA) (see Figure 1) based on the hypothesis that category theory MacLane (1971); Riehl (2017); Lurie (2009) provides a universal mathematical language for foundation models.
arXiv.org Artificial Intelligence
Feb-28-2024
- Country:
- Africa
- Ethiopia > Addis Ababa
- Addis Ababa (0.04)
- Rwanda > Kigali
- Kigali (0.04)
- Ethiopia > Addis Ababa
- Asia > Japan
- Honshū > Kantō > Tokyo Metropolis Prefecture > Tokyo (0.04)
- Europe
- Greece > Central Macedonia
- Thessaloniki (0.04)
- United Kingdom > England
- Cambridgeshire > Cambridge (0.04)
- Greece > Central Macedonia
- North America
- Canada > British Columbia
- United States
- California
- Los Angeles County > Long Beach (0.04)
- San Francisco County > San Francisco (0.04)
- Santa Clara County > Palo Alto (0.04)
- Illinois > Cook County
- Chicago (0.04)
- Massachusetts > Hampshire County
- Amherst (0.04)
- New Jersey > Mercer County
- Princeton (0.04)
- New York (0.04)
- California
- Africa
- Genre:
- Instructional Material (0.67)
- Research Report (0.64)
- Technology: