Street-Level AI: Are Large Language Models Ready for Real-World Judgments?