Data-dependent compression of random features for large-scale kernel approximation