TY - GEN
T1 - Focused evaluation for image description with binary forced-choice tasks
AU - Hodosh, Micah
AU - Hockenmaier, Julia
N1 - This paper is based upon work supported by the National Science Foundation under Grants No. 1205627, 1405883 and 1053856. Any opinions, findings, and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the National Science Foundation.
PY - 2016
Y1 - 2016
N2 - Current evaluation metrics for image description may be too coarse. We therefore propose a series of binary forced-choice tasks that each focus on a different aspect of the captions. We evaluate a number of different off-the-shelf image description systems. Our results indicate strengths and shortcomings of both generation and ranking based approaches.
AB - Current evaluation metrics for image description may be too coarse. We therefore propose a series of binary forced-choice tasks that each focus on a different aspect of the captions. We evaluate a number of different off-the-shelf image description systems. Our results indicate strengths and shortcomings of both generation and ranking based approaches.
UR - https://www.scopus.com/pages/publications/85083489900
UR - https://www.scopus.com/pages/publications/85083489900#tab=citedBy
U2 - 10.18653/v1/W16-3203
DO - 10.18653/v1/W16-3203
M3 - Conference contribution
AN - SCOPUS:85083489900
T3 - Proceedings of the Annual Meeting of the Association for Computational Linguistics
SP - 19
EP - 28
BT - Proceedings of the 5th Workshop on Vision and Language, VL 2016 at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
A2 - Belz, Anya
A2 - Erdem, Erkut
A2 - Mikolajczyk, Krystian
A2 - Pastra, Katerina
PB - Association for Computational Linguistics (ACL)
T2 - 5th Workshop on Vision and Language, VL 2016 at the 54th Annual Meeting of the Association for Computational Linguistics, ACL 2016
Y2 - 12 August 2016
ER -