Data-Centric AI Requires Rethinking Data Notion
Hajij, Mustafa, Zamzmi, Ghada, Ramamurthy, Karthikeyan Natesan, Saenz, Aldo Guzman
The transition towards data-centric AI requires revisiting data notions from mathematical and implementational standpoints to obtain unified data-centric machine learning packages. Towards this end, this work proposes unifying principles offered by categorical and cochain notions of data, and discusses the importance of these principles in data-centric AI transition. In the categorical notion, data is viewed as a mathematical structure that we act upon via morphisms to preserve this structure. As for cochain notion, data can be viewed as a function defined in a discrete domain of interest and acted upon via operators. While these notions are almost orthogonal, they provide a unifying definition to view data, ultimately impacting the way machine learning packages are developed, implemented, and utilized by practitioners.
Oct-13-2021
- Country:
- South America > Chile
- North America > United States
- New York (0.04)
- Florida (0.04)
- California (0.04)
- Genre:
- Research Report (0.40)
- Industry:
- Education (0.55)
- Technology: