TY - GEN
T1 - CoQuest
T2 - 2024 CHI Conference on Human Factors in Computing Sytems, CHI 2024
AU - Liu, Yiren
AU - Chen, Si
AU - Cheng, Haocong
AU - Yu, Mengxia
AU - Ran, Xiao
AU - Mo, Andrew
AU - Tang, Yiliu
AU - Huang, Yun
N1 - Publisher Copyright:
© 2024 Copyright held by the owner/author(s)
PY - 2024/5/11
Y1 - 2024/5/11
N2 - Developing novel research questions (RQs) often requires extensive literature reviews, especially in interdisciplinary fields. To support RQ development through human-AI co-creation, we leveraged Large Language Models (LLMs) to build an LLM-based agent system named CoQuest. We conducted an experiment with 20 HCI researchers to examine the impact of two interaction designs: breadth-first and depth-first RQ generation. The findings revealed that participants perceived the breadth-first approach as more creative and trustworthy upon task completion. Conversely, during the task, participants considered the depth-first generated RQs as more creative. Additionally, we discovered that AI processing delays allowed users to reflect on multiple RQs simultaneously, leading to a higher quantity of generated RQs and an enhanced sense of control. Our work makes both theoretical and practical contributions by proposing and evaluating a mental model for human-AI co-creation of RQs. We also address potential ethical issues, such as biases and over-reliance on AI, advocating for using the system to improve human research creativity rather than automating scientific inquiry. The system's source is available at: https://github.com/yiren-liu/coquest.
AB - Developing novel research questions (RQs) often requires extensive literature reviews, especially in interdisciplinary fields. To support RQ development through human-AI co-creation, we leveraged Large Language Models (LLMs) to build an LLM-based agent system named CoQuest. We conducted an experiment with 20 HCI researchers to examine the impact of two interaction designs: breadth-first and depth-first RQ generation. The findings revealed that participants perceived the breadth-first approach as more creative and trustworthy upon task completion. Conversely, during the task, participants considered the depth-first generated RQs as more creative. Additionally, we discovered that AI processing delays allowed users to reflect on multiple RQs simultaneously, leading to a higher quantity of generated RQs and an enhanced sense of control. Our work makes both theoretical and practical contributions by proposing and evaluating a mental model for human-AI co-creation of RQs. We also address potential ethical issues, such as biases and over-reliance on AI, advocating for using the system to improve human research creativity rather than automating scientific inquiry. The system's source is available at: https://github.com/yiren-liu/coquest.
KW - Co-creation Systems
KW - Large Language Models
KW - Mixed-initiative Design
KW - Scientifc Discovery
UR - http://www.scopus.com/inward/record.url?scp=85194889680&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85194889680&partnerID=8YFLogxK
U2 - 10.1145/3613904.3642698
DO - 10.1145/3613904.3642698
M3 - Conference contribution
AN - SCOPUS:85194889680
T3 - Conference on Human Factors in Computing Systems - Proceedings
BT - CHI 2024 - Proceedings of the 2024 CHI Conference on Human Factors in Computing Sytems
PB - Association for Computing Machinery
Y2 - 11 May 2024 through 16 May 2024
ER -