Adapting Pretrained ViTs with Convolution Injector for Visuo-Motor Control