雷达-通信网络频谱资源动态协同算法
Spectrum Resource Dynamic Collaboration Algorithm for Radar-Communication Networks
-
摘要: 在雷达-通信一体网络中各时刻间的决策变量具有时间相关性时,以往追求某一时刻性能最优的资源分配算法不再适用。本文基于马尔可夫决策过程理论,构建面向雷达-通信一体网络的资源分配决策问题,其目标在于最小化各用频设备的长期平均发射功率。该马尔可夫决策过程问题状态-动作空间维度随用频设备数量呈指数增长,易陷入“维度诅咒”。为提升运行效率,本文提出一种分布式相对值迭代算法,通过对每个用频设备进行资源预分配处理,将原问题分解为多个可并行迭代的低维子问题,其中每个子问题可通过传统的相对值迭代法快速求解。仿真结果表明所提算法与追求单一时刻性能最优的贪婪策略比较,其性能可得到明显提升。
Abstract: In a radar-communication integrated network, conventional resource allocation algorithms which aim at maximizing performance in an instant timeslot are not applicable, in the scenario where decisions across timeslots are correlated. This paper investigates a resource allocation decision problem in a radar-communication integrated network, aiming at minimizing the long-term average transmit power consumed by user equipment. The dimensions of the action/state spaces in such a Markov decision process problem grows exponentially as the number of user equipment increases. This leads to “the curse of dimensionality”. To improve computational efficiency of resource allocation, this paper proposes a distributed relative value iteration algorithm. By pre-allocating the resource to each user equipment, the original problem can be decoupled into multiple small-scale subproblems, where each subproblem can be efficiently solved by exploiting a relative-value-iteration-based algorithm. Simulation results reveal that the proposed algorithm can yield better performance than the scheme designed under myopic policies, which merely focus on maximizing the utility in a single slot.