Agents Explore the Environment Beyond Good Actions to Improve Their Model for Better Decisions