Generating Natural-Language Video Descriptions Using Text-Mined Knowledge