Rethinking Transformer for Long Contextual Histopathology Whole Slide Image Analysis

Mar-22-2026, 05:13:28 GMT–Neural Information Processing Systems

Histopathology Whole Slide Image (WSI) analysis serves as the gold standard for clinical cancer diagnosis in the daily routines of doctors. To develop computer-aided diagnosis model for histopathology WSIs, previous methods typically employ Multi-Instance Learning to enable slide-level prediction given only slide-level labels.Among these models, vanilla attention mechanisms without pairwise interactions have traditionally been employed but are unable to model contextual information. More recently, self-attention models have been utilized to address this issue. To alleviate the computational complexity of long sequences in large WSIs, methods like HIPT use region-slicing, and TransMIL employs Nystr\{o}mformer as an approximation of full self-attention. Both approaches suffer from suboptimal performance due to the loss of key information.

artificial intelligence, machine learning, proceedings, (8 more...)

Neural Information Processing Systems

Mar-22-2026, 05:13:28 GMT

Conferences Web Page

Add feedback

Industry:
- Health & Medicine (0.83)

Technology:
- Information Technology > Artificial Intelligence > Machine Learning (0.38)