Ternary Hybrid Neural-Tree Networks for Highly Constrained IoT Applications
Gope, Dibakar, Dasika, Ganesh, Mattina, Matthew
Machine learning-based applications are increasingly prevalent in IoT devices. The power and storage constraints of these devices make it particularly challenging to run modern neural networks, limiting the number of new applications that can be deployed on an IoT system. A number of compression techniques have been proposed, each with its own trade-offs. We propose a hybrid network which combines the strengths of current neural- and tree-based learning techniques in conjunction with ternary quantization, and show a detailed analysis of the associated model design space. Using this hybrid model we obtained a 11.1% reduction in the number of computations, a 52.2% reduction in the model size, and a 30.6% reduction in the overall memory footprint over a state-of-the-art keyword-spotting neural network, with negligible loss in accuracy.
Mar-4-2019
- Country:
- Oceania > Australia (0.04)
- North America
- United States
- Utah > Salt Lake County
- Salt Lake City (0.04)
- New York > New York County
- New York City (0.04)
- Hawaii > Honolulu County
- Honolulu (0.04)
- California
- Los Angeles County > Long Beach (0.04)
- Santa Clara County > Palo Alto (0.04)
- Utah > Salt Lake County
- Canada > Quebec
- Montreal (0.04)
- United States
- Europe
- United Kingdom > England
- Nottinghamshire > Nottingham (0.04)
- Sweden > Stockholm
- Stockholm (0.04)
- Netherlands > North Holland
- Amsterdam (0.04)
- Italy
- Germany > Bavaria
- Upper Bavaria > Munich (0.04)
- United Kingdom > England
- Asia > India
- Genre:
- Research Report (0.50)
- Industry:
- Technology: