RATE: A Reliability-Aware Tester-Based Evaluation Framework of User Simulators

Sahiti Labhishetty, Cheng Xiang Zhai

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Evaluation of user simulators is needed in order to use them for evaluating Interactive Information Retrieval (IIR) Systems. Previous work has proposed a tester-based approach to evaluate user simulators, but it has not addressed the important question about the reliability of the testers themselves, nor has it studied how to generate a single reliability score for a user simulator based on multiple testers. In this paper, we address these two limitations and propose a novel Reliability-Aware Tester-based Evaluation (RATE) framework for evaluating the reliability of both User Simulators and testers. In this framework, the reliability of Testers and that of Simulators are jointly learned through unsupervised learning using iterative propagation of reliability. We propose and evaluate two algorithms for unsupervised learning of reliabilities. Evaluation results using TREC data sets show that the proposed RATE framework is effective in measuring the reliability of simulators and testers, thus serving as a foundation for potentially establishing a new paradigm for evaluating IIR systems using user simulation.

Original languageEnglish (US)
Title of host publicationAdvances in Information Retrieval - 44th European Conference on IR Research, ECIR 2022, Proceedings
EditorsMatthias Hagen, Suzan Verberne, Craig Macdonald, Christin Seifert, Krisztian Balog, Kjetil Nørvåg, Vinay Setty
PublisherSpringer
Pages336-350
Number of pages15
ISBN (Print)9783030997359
DOIs
StatePublished - 2022
Event44th European Conference on Information Retrieval, ECIR 2022 - Stavanger, Norway
Duration: Apr 10 2022Apr 14 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13185 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference44th European Conference on Information Retrieval, ECIR 2022
Country/TerritoryNorway
CityStavanger
Period4/10/224/14/22

Keywords

  • IIR Systems
  • Reliability of User Simulator
  • Tester

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'RATE: A Reliability-Aware Tester-Based Evaluation Framework of User Simulators'. Together they form a unique fingerprint.

Cite this