Trust Region Policy Optimization

완전히 어려운 수학적 베이스의 이론적인 논문 이제 시작합니다.

https://docs.google.com/presentation/d/1-HM5f0vGbXYLxN3k85BxZiteCUsrL2VsPGS9KY18sxk/edit?usp=sharing

출처 : youtube 팡요랩

Leave a Reply

Your email address will not be published. Required fields are marked *