Multi-language identification and transcription in Video Indexer