Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters