Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones?