Concentration of measure for non-linear random matrices with applications to neural networks and non-commutative polynomials