Entropy-Reinforced Planning with Large Language Models for Drug Discovery