Explainable Finite-Memory Policies for Partially Observable Markov Decision Processes