20092019
If you made any changes in Pure, your changes will be visible here soon.

Research Output 2009 2019

  • 23 Conference contribution
  • 8 Article
  • 2 Conference article
  • 1 Paper
Filter
Paper
2019

Off-policy evaluation and learning from logged bandit feedback: Error reduction via surrogate policy

Xie, Y., Liu, Q., Zhou, Y., Liu, B., Wang, Z. & Peng, J., Jan 1 2019.

Research output: Contribution to conferencePaper

Maximum likelihood
Feedback
evaluation
learning
Recommender systems