When can isotropy help adapt LLMs' next word prediction to numerical domains?