ddpg-ink-Pendulum-v0



ddpg:

1. state and action belongs to different data, and should be input at different layers.

2. use batch normalization

3. use the same one layer for the state in actor and critic.



深度学习推荐
深度学习推荐

墨之科技,版权所有 © Copyright 2017-2027

湘ICP备14012786号     邮箱:ai@inksci.com