ID 原文 译文
45196 仿真实验验证了该方法的有效性,并分析了不同参数设置对防御策略选取的影响。 Simulation experiments verify the effectiveness of the proposed method and analyze the influence of different parameter settings on the selection of defense strategy.
45197 针对 Sarsa 算法存在的收敛速度较慢的问题,提出一种改进的基于值函数迁移的启发式 Sarsa 算法(VFT-HSA)。 With the problem of slow convergence for traditional Sarsa algorithm, an improved heuristic Sarsa algorithm-based on value function transfer was proposed.
45198 该算法将 Sarsa 算法与值函数迁移方法相结合,引入自模拟度量方法,在相同的状态空间和动作空间下,对新任务与历史任务之间的不同状态进行相似性度量,对满足条件的历史状态进行值函数迁移,提高算法的收敛速度。 The algorithm combined traditional Sarsa algorithm and value function transfer method, and the algorithm introduced bisimulation metric and used it to measure the similarity between new tasks and historical tasks in which those two tasks had the same state space and action space and speed up the algorithm convergence.
45199 此外,该算法结合启发式探索方法,引入贝叶斯推理,结合变分推理衡量信息增益, In addition, combined with heuristic exploration method, the algorithm introduced Bayesian inference and used variational inference to measure information gain.
45200 并运用获取的信息增益构建内在奖赏函数作为探索因子,进而加快算法的收敛速度。 Finally, using the obtained information gain to build intrinsic re-ward function model as exploring factors, to speed up the convergence of the algorithm.
45201 将所提算法用于经典的 Grid World 问题,并与 Sarsa 算法、Q-Learning 算法以及收敛性能较好的 VFT-Sarsa 算法、IGP-Sarsa 算法进行比较,实验表明,所提算法具有较快的收敛速度和较好的稳定性。 Applying the proposed algorithm to the traditional Grid World problem, and compared with the traditional Sarsa algorithm, the Q-Learning algorithm, and the VFT-Sarsa algorithm, the IGP-Sarsa algorithm with better convergence performance, the experiment results show that the proposed algorithm has faster convergence speed and better convergence stability.
45202 目前,基于博弈理论的网络安全研究大多采用静态博弈或多阶段动态博弈模型,不符合实际网络攻防连续对抗、实时变化的特点, Most current network security research based on game theory adopts the static game or multi-stage dynamic game model, which does not accord with the real-time change and continuity of the actual network attack-defense process.
45203 为了更加贴近攻防实际进行安全威胁预警,借鉴传染病动力学模型分析安全威胁传播过程, To make security threats warning more consistent with the attack-defense process, the threat propagation process was analyzed referring to the epidemic model.
45204 基于定性微分博弈理论构建网络攻防博弈模型,推演安全威胁动态变化趋势。 Then the network attack-defense game model was constructed based on the qualitative differential game theory, by which the evolution of the network security state could be predicted.
45205 在此基础上,提出攻防定性微分博弈求解方法,构造攻防界栅以及捕获区和躲避区; Based on the model, the qualitative differential game solution method was designed to construct the attack-defense barrier and divide the capture area.