Proximal Policy Optimization Algorithm

Aerospace and Mechanical Insider on MSN

Hierarchical reinforcement learning boosts air defense efficiency

Modern air defense confrontations demand rapid, precise task assignments in environments where threats evolve within seconds.

IEEE

Proximal Policy Optimization With Advantage Reuse Competition

Abstract: In recent years, reinforcement learning (RL) has made great achievements in artificial intelligence. Proximal policy optimization (PPO) is a representative RL algorithm, which limits the ...

IEEE

Hybrid CNN-LSTM and Proximal Policy Optimization Model for Traffic Light Control in a Multi-Agent Environment

Abstract: Conventional traffic light control systems often exhibit rigid timing patterns, limited flexibility, and insufficient adaptability to changing traffic conditions. This paper addresses urban ...

GitHub

yezzzzye/mult_uav_ppo_case

连续动作空间的 PPO 算法实现多智能体环境支持 10 种训练技巧优化 TensorBoard 训练可视化自定义 MPE 环境（多无人机 ...

GitHub

jcwleo/random-network-distillation-pytorch

Modify the parameters in config.conf as you like.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results