Deploy Large-Scale Deep Neural Networks in Resource Constrained IoT Devices with Local Quantization Region