基于多视图脑网络监督对比学习的脑疾病诊断
Supervised Contrastive Learning with Multi-View Brain Networks for Brain Disease Diagnosis
-
摘要: 由静息态功能磁共振成像(resting-state functional Magnetic Resonance Imaging,rs-fMRI)衍生出的脑网络提供了大脑的功能整合信息,有效促进了对大脑认知和发病过程的理解。近年来,以图神经网络为代表的深度学习算法凭借可以适配脑网络固有拓扑结构的能力,从而在脑疾病辅助诊断中展现出极大的潜力。然而,现有图神经网络方法往往依赖于静态的脑网络拓扑结构,其性能将受限于输入图的质量;同时,rs-fMRI数据采集的高成本限制了数据集的规模,进而制约了深度学习技术的应用与发展。基于这些挑战,本文提出了一种多视图脑网络监督对比学习分析算法。该算法在保留传统脑网络静态拓扑结构视图的基础上,进一步引入了语义视图,其图结构随模型训练动态调整,旨在捕捉面向特定任务时脑网络蕴含的丰富语义信息;此外,设计并引入了交叉注意力模块,它可以学习多视图间的联合表示,从而提高特征的表达能力;最后,构建基于多视图融合的监督对比学习机制,通过配对样本级别的学习,不仅一定程度上实现训练数据的扩充,而且基于样本标签构建正例和负例对有助于在个体层面提取出更具有鉴别能力的脑网络特征,以进行更可靠的疾病诊断。在两个公开的脑疾病神经成像数据集上的实验证明了所提出多视图脑网络监督对比学习算法的有效性。Abstract: The brain network derived from resting-state functional Magnetic Resonance Imaging (rs-fMRI) provides valuable insights into the functional integration of the brain, enhancing our understanding of brain cognition and the underlying mechanisms of various diseases. In recent years, deep learning algorithms, particularly graph neural networks, have shown significant promise in assisting with the diagnosis of brain diseases due to their ability to adapt to the inherent topology of brain networks. However, existing graph neural network methods often rely on a static representation of brain network topology, leading to limited performance based on the quality of the graph input. Additionally, the high cost of rs-fMRI data acquisition restricts dataset size, which hampers the advancement and application of deep learning technologies. To address these challenges, this paper proposes a supervised contrastive learning algorithm designed for multi-view brain networks. This approach retains the traditional static topology view of brain networks while also incorporating a semantic view, wherein the graph structure is dynamically adjusted during model training. This modification aims to capture the rich semantic information present within the brain network for specific tasks. Furthermore, we introduce a cross-attention module that learns joint representations between multiple views, enhancing the ability to represent features effectively. Finally, we construct a supervised contrastive learning mechanism based on multi-view fusion. This paired sample-level learning approach not only expands the training dataset to some extent but also aids in extracting more discriminative features at the individual level, leading to more reliable disease diagnoses. Experiments conducted on two publicly available neuroimaging datasets for brain diseases demonstrate the effectiveness of the proposed multi-view brain network supervised contrastive learning algorithm.