Reinforcement Learning-Based Coverage Path Planning with Implicit Cellular Decomposition