DataCI: A Platform for Data-Centric AI on Streaming Data
Zhang, Huaizheng, Huang, Yizheng, Li, Yuanming
–arXiv.org Artificial Intelligence
We introduce DataCI, a comprehensive open-source platform designed specifically for data-centric AI in dynamic streaming data settings. DataCI provides 1) an infrastructure with rich APIs for seamless streaming dataset management, data-centric pipeline development and evaluation on streaming scenarios, 2) an carefully designed versioning control function to track the pipeline lineage, and 3) an intuitive graphical interface for a better interactive user experience. Preliminary studies and demonstrations attest to the easy-to-use and effectiveness of DataCI, highlighting its potential to revolutionize the practice of data-centric AI in streaming data contexts.
arXiv.org Artificial Intelligence
Jul-3-2023
- Country:
- North America > United States > Hawaii > Honolulu County > Honolulu (0.04)
- Genre:
- Research Report (0.40)
- Technology:
- Information Technology
- Artificial Intelligence (1.00)
- Communications > Networks (0.91)
- Data Science (0.95)
- Human Computer Interaction (0.70)
- Information Technology