Assessing AI Utility: The Random Guesser Test for Sequential Decision-Making Systems