Appendix of Confidence-A ware Imitation Learning from Demonstrations with Varying Optimality A Proofs