Aerospace and Mechanical Insider on MSN
Hierarchical reinforcement learning boosts air defense efficiency
Modern air defense confrontations demand rapid, precise task assignments in environments where threats evolve within seconds.
Abstract: In recent years, reinforcement learning (RL) has made great achievements in artificial intelligence. Proximal policy optimization (PPO) is a representative RL algorithm, which limits the ...
Abstract: Conventional traffic light control systems often exhibit rigid timing patterns, limited flexibility, and insufficient adaptability to changing traffic conditions. This paper addresses urban ...
连续动作空间的 PPO 算法实现 多智能体环境支持 10 种训练技巧优化 TensorBoard 训练可视化 自定义 MPE 环境(多无人机 ...
Modify the parameters in config.conf as you like.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results