Multimedia-Aware Question Answering: A Review of Retrieval and Cross-Modal Reasoning Architectures