cocopie
CoCoPIE
Many believe the company that enables real intelligence on end devices (such as mobile and IoT devices) will define the future of computing. Racing toward this goal, many companies, whether tech giants such as Google, Microsoft, Amazon, Apple and Facebook, or startups spent tens of billions of dollars each year on R&D. Assuming hardware is the major constraint for enabling real-time mobile intelligence, more companies dedicate their main efforts to developing specialized hardware accelerators for machine learning and inference. Billions of dollars have been spent to fuel this intelligent hardware race. This article challenges the view.
CoCoPIE: A software solution for putting real artificial intelligence in smaller spaces
Bit by bit, byte by byte, artificial intelligence has been working its way into public consciousness and into everyday computer use. Artificial intelligence and deep learning have been deeply woven into more and more aspects of end-user computing. Smartphones and other mobile devices use AI as well. Up until now, the artificial intelligence work has been done in the cloud, but a new approach to software design aims to arm mobile devices with real artificial-intelligence capability. "A mobile device is very resource-constrained," explained William & Mary computer scientist Bin Ren.
CoCoPIE: Making Mobile AI Sweet As PIE --Compression-Compilation Co-Design Goes a Long Way
Liu, Shaoshan, Ren, Bin, Shen, Xipeng, Wang, Yanzhi
Assuming hardware is the major constraint for enabling real-time mobile intelligence, the industry has mainly dedicated their efforts to developing specialized hardware accelerators for machine learning and inference. This article challenges the assumption. By drawing on a recent real-time AI optimization framework CoCoPIE, it maintains that with effective compression-compiler co-design, it is possible to enable real-time artificial intelligence on mainstream end devices without special hardware. CoCoPIE is a software framework that holds numerous records on mobile AI: the first framework that supports all main kinds of DNNs, from CNNs to RNNs, transformer, language models, and so on; the fastest DNN pruning and acceleration framework, up to 180X faster compared with current DNN pruning on other frameworks such as TensorFlow-Lite; making many representative AI applications able to run in real-time on off-the-shelf mobile devices that have been previously regarded possible only with special hardware support; making off-the-shelf mobile devices outperform a number of representative ASIC and FPGA solutions in terms of energy efficiency and/or performance.