Abstract
This paper introduces a novel approach for learning natural language descriptions of scenery in Minecraft. We apply techniques from Computer Vision and Natural Language Processing to create an AI framework called MineObserver for assessing the accuracy of learner-generated descriptions of science-related images. The ultimate purpose of the system is to automatically assess the accuracy of learner observations, written in natural language, made during science learning activities that take place in Minecraft. Eventually, MineObserver will be used as part of a pedagogical agent framework for providing in-game support for learning. Preliminary results are mixed, but promising with approximately 62% of images in our test set being properly classified by our image captioning approach. Broadly, our work suggests that computer vision techniques work as expected in Minecraft and can serve as a basis for assessing learner observations.
Original language | English (US) |
---|---|
Journal | Proceedings of the International Florida Artificial Intelligence Research Society Conference, FLAIRS |
Volume | 35 |
DOIs | |
State | Published - 2022 |
Event | 35th International Florida Artificial Intelligence Research Society Conference, FLAIRS-35 2022 - Jensen Beach, United States Duration: May 15 2022 → May 18 2022 |
Keywords
- Computer Vision
- Natural Language Processing
- Pedagogical Agent
ASJC Scopus subject areas
- Artificial Intelligence
- Software