Two-stream network-driven vision-based tactile sensor for object feature extraction and fusion perception