Multigranular Evaluation for Brain Visual Decoding
–arXiv.org Artificial Intelligence
Existing evaluation protocols for brain visual decoding predominantly rely on coarse metrics that obscure inter-model differences, lack neuroscientific foundation, and fail to capture fine-grained visual distinctions. To address these limitations, we introduce BASIC, a unified, multigranular evaluation framework that jointly quantifies structural fidelity, inferential alignment, and contextual coherence between decoded and ground-truth images. For the structural level, we introduce a hierarchical suite of segmentation-based metrics, including foreground, semantic, instance, and component masks, anchored in granularity-aware correspondence across mask structures. For the semantic level, we extract structured scene representations encompassing objects, attributes, and relationships using multimodal large language models, enabling detailed, scalable, and context-rich comparisons with ground-truth stimuli. We benchmark a diverse set of visual decoding methods across multiple stimulus-neuroimaging datasets within this unified evaluation framework. Together, these criteria provide a more discriminative, interpretable, and comprehensive foundation for evaluating brain visual decoding methods.
arXiv.org Artificial Intelligence
Dec-2-2025
- Country:
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
- Genre:
- Research Report (0.81)
- Industry:
- Health & Medicine
- Diagnostic Medicine (0.88)
- Health Care Technology (1.00)
- Therapeutic Area > Neurology (1.00)
- Transportation (1.00)
- Health & Medicine
- Technology:
- Information Technology > Artificial Intelligence
- Cognitive Science (1.00)
- Machine Learning (1.00)
- Natural Language > Text Processing (1.00)
- Representation & Reasoning > Object-Oriented Architecture (0.89)
- Vision (1.00)
- Information Technology > Artificial Intelligence