MLP Gaussian policy
Looks like a Gaussian policy whose mean and std are outputs of a neural network.
Looks like a Gaussian policy whose mean and std are outputs of a neural network.
Author: 神奇的战士
Link: https://wangshub.github.io/posts/mlpgaussianpolicy/
本文采用知识共享署名-非商业性使用 4.0 国际许可协议进行许可