A STOCHASTIC TRUST-REGION FRAMEWORK FOR POLICY OPTIMIZATION
Mingming Zhao, Yongfeng Li, Zaiwen Wen
Journal of Computational Mathematics ›› 2022, Vol. 40 ›› Issue (6) : 1004-1030.
A STOCHASTIC TRUST-REGION FRAMEWORK FOR POLICY OPTIMIZATION
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 |
|
〉 |