CEO神经网络: 多观测语义通信中的一致性融合方法
CEO Neural Network: A Consistency-Aware Fusion Method for Semantic Multi-Observation Communication
-
摘要: 中国有句谚语,叫作“三个臭皮匠,顶个诸葛亮”,意指集思广益、协同合作往往能超越单一的智慧。这一思想在语义通信的研究中同样适用。针对现实场景中多源观测多类别信息并存、语义表达多样且易受干扰的挑战,本文面向多观测语义通信系统(Semantic Multi-Observation Communication System,SMOCS),提出一种基于语义一致性融合机制的神经网络架构,旨在实现多角度观测下语义信息的有效整合与可靠判别。在该系统中多个结构异构的预训练神经网络模拟不同观察者对同一语义事件的建模过程,各自独立生成语义判别概率,从而形成丰富多样的语义表达。在此基础上,本文引入CEO融合网络(Chief Executive Officer Fusion Network)作为语义集成模块,专注于从多源判别结果中挖掘潜在的分布一致性与互补结构。该网络不直接参与语义特征提取,而是通过深度融合机制,从多个观测模型输出中学习其内在关联与语义聚合方式,进而生成更加稳定、准确的最终判决结果。整个系统结构体现了由“多观察-协同建模-统一判决”驱动的语义通信优化策略。为了验证所提方法的有效性,本文选用MNIST手写数字识别任务作为实验平台,对多层感知机(Multilayer Perceptron,MLP)、卷积神经网络(Convolutional Neural Network,CNN)、循环神经网络(Recurrent Neural Network,RNN)、长短期记忆(Long Short-Term Memory,LSTM)网络和残差神经网络(Residual Neural Network,ResNet)等子网络进行预训练并固定权重,最终只训练CEO网络以保证信息融合阶段的纯粹性。实验结果表明,相比单一神经网络分类器,所提系统在识别准确率、鲁棒性与抗干扰能力方面均实现明显提升,尤其在面对语义扰动和信息缺失的场景中展现出更强的容错能力与表达稳定性。为了验证结构泛化性,本文还在文本语义传输任务中进行了仿真实验,展示了模型在多模态、多场景下的适用性与优越性能。Abstract: As the Chinese proverb says, “Three cobblers together can rival Zhuge Liang,” meaning that collective wisdom often surpasses individual genius. This concept is equally applicable in the research of semantic communication. To address the challenges posed by multi-source observations involving multiple categories of coexisting information as well as diverse and easily disturbed semantic expressions in real-world scenarios, this paper focuses on the Semantic Multi-Observation Communication System (SMOCS) and proposes a neural network architecture based on a semantic consistency fusion mechanism, aiming to realize effective integration and reliable discrimination of semantic information from multiple observational perspectives. The system simulates different observers modeling the same semantic event through multiple structurally heterogeneous pre-trained neural networks, each independently generating semantic discrimination probabilities, thereby forming a rich and diverse set of semantic expressions. Accordingly, this paper introduces a Chief Executive Officer (CEO) fusion network as the semantic integration module, which focuses on mining latent distribution consistency and complementary structures from multi-source discrimination results. This network does not directly participate in semantic feature extraction; rather, through a deep fusion mechanism, it learns the intrinsic correlations and semantic aggregation methods from the outputs of multiple observation models, thus producing more stable and accurate final decisions. The entire system’s structure reflects a semantic communication optimization strategy driven by “multi-observation, collaborative modeling, and unified decision-making.” To verify the efficacy of the proposed method, the MNIST handwritten digit recognition task was selected as the experimental platform, where Multilayer Perceptron (MLP), Convolutional Neural Network (CNN), Recurrent Neural Network (RNN), Long Short-Term Memory (LSTM) Network, and Residual Neural Network (ResNet) subnetworks were pre-trained with fixed weights, and only the CEO network was trained to ensure the purity of the information fusion phase. Experimental results demonstrate that, compared to single-neural-network classifiers, the proposed system achieves significant improvements in recognition accuracy, robustness, and anti-interference capability, exhibiting stronger fault tolerance and expression stability when faced with semantic perturbations and information loss. To verify the generalizability of the proposed architecture, simulation experiments were conducted on a text semantic transmission task, demonstrating the applicability and superior performance of the model across multimodal and diverse scenarios.
下载: