ENACT-Heart -- ENsemble-based Assessment Using CNN and Transformer on Heart Sounds
–arXiv.org Artificial Intelligence
This study explores the application of Vision Transformer (ViT) principles in audio analysis, specifically focusing on heart sounds. This paper introduces ENACT-Heart - a novel ensemble approach that leverages the complementary strengths of Convolutional Neural Networks (CNN) and ViT through a Mixture of Experts (MoE) framework, achieving a remarkable classification accuracy of 97.52%. This outperforms the individual contributions of ViT (93.88%) and CNN (95.45%), demonstrating the potential for enhanced diagnostic accuracy in cardiovascular health monitoring. These results demonstrate the potential of ensemble methods in enhancing classification performance for cardiovascular health monitoring and diagnosis.
arXiv.org Artificial Intelligence
Feb-24-2025
- Country:
- North America > United States > Michigan > Wayne County > Dearborn (0.14)
- Genre:
- Research Report > New Finding (0.88)
- Industry:
- Technology: