Simultaneous Tactile-Visual Perception for Learning Multimodal Robot Manipulation