SPRINQL: Sub-optimal Demonstrations driven Offline Imitation Learning
–Neural Information Processing Systems
We focus on offline imitation learning (IL), which aims to mimic an expert's behavior using demonstrations without any interaction with the environment. One of the main challenges in offline IL is the limited support of expert demonstrations, which typically cover only a small fraction of the state-action space. While it may not be feasible to obtain numerous expert demonstrations, it is often possible to gather a larger set of sub-optimal demonstrations. For example, in treatment optimization problems, there are varying levels of doctor treatments available for different chronic conditions. These range from treatment specialists and experienced general practitioners to less experienced general practitioners.
Neural Information Processing Systems
Mar-27-2025, 15:31:19 GMT
- Country:
- North America > United States (0.46)
- Genre:
- Research Report > Experimental Study (1.00)
- Technology: