Single-bit-per-weight deep convolutional neural networks without batch-normalization layers for embedded systems