Cited By
View all- Panda PBhatnagar SKiyavash NMooij J(2024)Finite-time analysis of three-timescale constrained actor-critic and constrained natural actor-critic algorithmsProceedings of the Fortieth Conference on Uncertainty in Artificial Intelligence10.5555/3702676.3702809(2787-2834)Online publication date: 15-Jul-2024
- Wang YWang YZhou YZou SSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Non-asymptotic analysis for single-loop (natural) actor-critic with compatible function approximationProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3694193(51771-51824)Online publication date: 21-Jul-2024
- Agrawal SA. PMaguluri SSalakhutdinov RKolter ZHeller KWeller AOliver NScarlett JBerkenkamp F(2024)Policy evaluation for variance in average reward reinforcement learningProceedings of the 41st International Conference on Machine Learning10.5555/3692070.3692091(471-502)Online publication date: 21-Jul-2024
- Show More Cited By