It's Not That Simple. An Analysis of Simple Test-Time Scaling