ddpg:
1. state and action belongs to different data, and should be input at different layers.
2. use batch normalization
3. use the same one layer for the state in actor and critic.
墨之科技,版权所有 © Copyright 2017-2027
湘ICP备14012786号 邮箱:ai@inksci.com