Why Did Apple Fall To The Ground: Evaluating Curiosity In Large Language Model