AGI-Elo: How Far Are We From Mastering A Task?