Clustering units in neural networks: upstream vs downstream information