May 11th, 2024 | RESEARCH
Generative Artificial Intelligence (AI) has witnessed unprecedented growth in text-to-image AI tools. Yet, much remains unknown about users’ prompt journey with such tools in the wild. In this paper, we posit that designing human-centered text-to-image AI tools requires a clear understanding of how individuals intuitively approach crafting prompts, and what challenges they may encounter. To address this, we conducted semi-structured interviews with 19 existing users of a text-to-image AI tool. Our findings (1) offer insights into users’ prompt journey including structures and processes for writing, evaluating, and refining prompts in text-to-image AI tools and (2) indicate that users must overcome barriers to aligning AI to their intents, and mastering prompt crafting knowledge. From the findings, we discuss the prompt journey as an individual yet a social experience and highlight opportunities for aligning text-to-image AI tools and users’ intents.
Document
Team Members
Atefeh Mahdavi Goloujeh, Author, Georgia Institute of TechnologyAnne Sullivan, Author, Georgia Institute of Technology
Brian Magerko, Author, Georgia Institute of Technology
Citation
Publication: CHI '24: Proceedings of the 2024 CHI Conference on Human Factors in Computing System
Funders
Funding Source: NSF
Funding Program: AISL
Award Number: 2214463
Related URLs
Project: Fostering AI Literacy through Embodiment and Creativity across Informal Learning Spaces
Tags
Audience: General Public
Discipline: AI | Technology
Resource Type: Conference Proceedings | Research
Environment Type: Websites | Mobile Apps | Online Media