Efficient FPGA Implementation of Time-Domain Popcount for Low-Complexity Machine Learning