Learning Bottleneck Transformer for Event Image-Voxel Feature Fusion based Classification