Jian Peng

If you made any changes in Pure these will be visible here soon.

Research Output

Filter
Paper
2019

Accelerating nonconvex learning via replica exchange Langevin diffusion

Chen, Y., Chen, J., Dong, J., Peng, J. & Wang, Z., Jan 1 2019.

Research output: Contribution to conferencePaper

Learning to play in a day: Faster deep reinforcement learning by optimality tightening

He, F. S., Liu, Y., Schwing, A. G. & Peng, J., Jan 1 2019.

Research output: Contribution to conferencePaper

Off-policy evaluation and learning from logged bandit feedback: Error reduction via surrogate policy

Xie, Y., Liu, Q., Zhou, Y., Liu, B., Wang, Z. & Peng, J., Jan 1 2019.

Research output: Contribution to conferencePaper

2018

Action-dependent control variates for policy optimization via Stein’s identity

Liu, H., Feng, Y., Mao, Y., Zhou, D., Peng, J. & Liu, Q., Jan 1 2018.

Research output: Contribution to conferencePaper

Fast and accurate text classification: Skimming, rereading and early stopping

Yu, K., Liu, Y., Schwing, A. G. & Peng, J., Jan 1 2018.

Research output: Contribution to conferencePaper

Policy optimization by genetic distillation

Gangwani, T. & Peng, J., Jan 1 2018.

Research output: Contribution to conferencePaper

2017

Stein variational policy gradient

Liu, Y., Ramachandran, P., Liu, Q. & Peng, J., Jan 1 2017.

Research output: Contribution to conferencePaper