MineObserver: A Deep Learning Framework for Assessing Natural Language Descriptions of Minecraft Imagery

Jay M. Mahajan, Samuel Hum, Jeff Ginger, H. Chad Lane

Research output: Contribution to journalConference articlepeer-review

Abstract

This paper introduces a novel approach for learning natural language descriptions of scenery in Minecraft. We apply techniques from Computer Vision and Natural Language Processing to create an AI framework called MineObserver for assessing the accuracy of learner-generated descriptions of science-related images. The ultimate purpose of the system is to automatically assess the accuracy of learner observations, written in natural language, made during science learning activities that take place in Minecraft. Eventually, MineObserver will be used as part of a pedagogical agent framework for providing in-game support for learning. Preliminary results are mixed, but promising with approximately 62% of images in our test set being properly classified by our image captioning approach. Broadly, our work suggests that computer vision techniques work as expected in Minecraft and can serve as a basis for assessing learner observations.

Keywords

  • Computer Vision
  • Natural Language Processing
  • Pedagogical Agent

ASJC Scopus subject areas

  • Artificial Intelligence
  • Software

Fingerprint

Dive into the research topics of 'MineObserver: A Deep Learning Framework for Assessing Natural Language Descriptions of Minecraft Imagery'. Together they form a unique fingerprint.

Cite this