Huang, Qiming
GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders
Du, Zhicheng, Jiang, Chenyao, Yuan, Xi, Zhai, Shiyao, Lei, Zhengyang, Ma, Shuyue, Liu, Yang, Ye, Qihui, Xiao, Chufan, Huang, Qiming, Xu, Ming, Yu, Dongmei, Qin, Peiwu
The timely identification of mental disorders in adolescents is a global public health challenge.Single factor is difficult to detect the abnormality due to its complex and subtle nature. Additionally, the generalized multimodal Computer-Aided Screening (CAS) systems with interactive robots for adolescent mental disorders are not available. Here, we design an android application with mini-games and chat recording deployed in a portable robot to screen 3,783 middle school students and construct the multimodal screening dataset, including facial images, physiological signs, voice recordings, and textual transcripts.We develop a model called GAME (Generalized Model with Attention and Multimodal EmbraceNet) with novel attention mechanism that integrates cross-modal features into the model. GAME evaluates adolescent mental conditions with high accuracy (73.34%-92.77%) and F1-Score (71.32%-91.06%).We find each modality contributes dynamically to the mental disorders screening and comorbidities among various mental disorders, indicating the feasibility of explainable model. This study provides a system capable of acquiring multimodal information and constructs a generalized multimodal integration algorithm with novel attention mechanisms for the early screening of adolescent mental disorders.
Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning
Liu, Yiman, Huang, Qiming, Han, Xiaoxiang, Liang, Tongtong, Zhang, Zhifang, Chen, Lijun, Wang, Jinfeng, Stefanidis, Angelos, Su, Jionglong, Chen, Jiangang, Li, Qingli, Zhang, Yuqi
Purpose: Congenital heart defect (CHD) is the most common birth defect. Thoracic echocardiography (TTE) can provide sufficient cardiac structure information, evaluate hemodynamics and cardiac function, and is an effective method for atrial septal defect (ASD) examination. This paper aims to study a deep learning method based on cardiac ultrasound video to assist in ASD diagnosis. Materials and methods: We select two standard views of the atrial septum (subAS) and low parasternal four-compartment view (LPS4C) as the two views to identify ASD. We enlist data from 300 children patients as part of a double-blind experiment for five-fold cross-validation to verify the performance of our model. In addition, data from 30 children patients (15 positives and 15 negatives) are collected for clinician testing and compared to our model test results (these 30 samples do not participate in model training). We propose an echocardiography video-based atrial septal defect diagnosis system. In our model, we present a block random selection, maximal agreement decision and frame sampling strategy for training and testing respectively, resNet18 and r3D networks are used to extract the frame features and aggregate them to build a rich video-level representation. Results: We validate our model using our private dataset by five-cross validation. For ASD detection, we achieve 89.33 AUC, 84.95 accuracy, 85.70 sensitivity, 81.51 specificity and 81.99 F1 score. Conclusion: The proposed model is multiple instances learning-based deep learning model for video atrial septal defect detection which effectively improves ASD detection accuracy when compared to the performances of previous networks and clinical doctors.
Futuristic Variations and Analysis in Fundus Images Corresponding to Biological Traits
Hassan, Muhammad, Zhang, Hao, Ameen, Ahmed Fateh, Zeng, Home Wu, Ma, Shuye, Liang, Wen, Shang, Dingqi, Ding, Jiaming, Zhan, Ziheng, Lam, Tsz Kwan, Xu, Ming, Huang, Qiming, Wu, Dongmei, Zhang, Can Yang, You, Zhou, Ain, Awiwu, Qin, Pei Wu
Fundus image captures rear of an eye, and which has been studied for the diseases identification, classification, segmentation, generation, and biological traits association using handcrafted, conventional, and deep learning methods. In biological traits estimation, most of the studies have been carried out for the age prediction and gender classification with convincing results. However, the current study utilizes the cutting-edge deep learning (DL) algorithms to estimate biological traits in terms of age and gender together with associating traits to retinal visuals. For the traits association, our study embeds aging as the label information into the proposed DL model to learn knowledge about the effected regions with aging. Our proposed DL models, named FAG-Net and FGC-Net, correspondingly estimate biological traits (age and gender) and generates fundus images. FAG-Net can generate multiple variants of an input fundus image given a list of ages as conditions. Our study analyzes fundus images and their corresponding association with biological traits, and predicts of possible spreading of ocular disease on fundus images given age as condition to the generative model. Our proposed models outperform the randomly selected state of-the-art DL models.
Neuro-Symbolic Learning: Principles and Applications in Ophthalmology
Hassan, Muhammad, Guan, Haifei, Melliou, Aikaterini, Wang, Yuqi, Sun, Qianhui, Zeng, Sen, Liang, Wen, Zhang, Yiwei, Zhang, Ziheng, Hu, Qiuyue, Liu, Yang, Shi, Shunkai, An, Lin, Ma, Shuyue, Gul, Ijaz, Rahee, Muhammad Akmal, You, Zhou, Zhang, Canyang, Pandey, Vijay Kumar, Han, Yuxing, Zhang, Yongbing, Xu, Ming, Huang, Qiming, Tan, Jiefu, Xing, Qi, Qin, Peiwu, Yu, Dongmei
Neural networks have been rapidly expanding in recent years, with novel strategies and applications. However, challenges such as interpretability, explainability, robustness, safety, trust, and sensibility remain unsolved in neural network technologies, despite the fact that they will unavoidably be addressed for critical applications. Attempts have been made to overcome the challenges in neural network computing by representing and embedding domain knowledge in terms of symbolic representations. Thus, the neuro-symbolic learning (NeSyL) notion emerged, which incorporates aspects of symbolic representation and bringing common sense into neural networks (NeSyL). In domains where interpretability, reasoning, and explainability are crucial, such as video and image captioning, question-answering and reasoning, health informatics, and genomics, NeSyL has shown promising outcomes. This review presents a comprehensive survey on the state-of-the-art NeSyL approaches, their principles, advances in machine and deep learning algorithms, applications such as opthalmology, and most importantly, future perspectives of this emerging field.