Through the Judge's Eyes: Inferred Thinking Traces Improve Reliability of LLM Raters

Open in new window