Training Plug-n-Play Knowledge Modules with Deep Context Distillation