Human creativity still surpasses “creative” generative AI, according to recent research

Human creativity still surpasses “creative” generative AI, according to recent research

  • Research shows that generative AI models perform poorly in creative image production, especially when deprived of human guidance.
  • Directly comparing the performance of AI models with that of visual artists and the general population in creative imagination tasks, people turned out to be much more creative.

Research confirms it: in image generation, AI creativity is a myth. Although current generative AI models may indeed appear autonomous creative agents, when deconstructing their imaginative process, their lack of true creative abilities becomes apparent. This is the conclusion reached by a new study, published in the scientific journal Advanced Science, led by an international team of researchers from the Cognition and Brain Plasticity group at Bellvitge Biomedical Research Institute (IDIBELL) and the Institute of Neurosciences of the University of Barcelona (UB), the Computer Vision Center (CVC), the Autonomous University of Barcelona (UAB) and the Vienna Cognitive Science Hub. The study, focused on visual creativity and imagination, was born in 2024 during a workshop organised by the Fundació Èpica – La Fura dels Baus, whose work focuses on promoting interdisciplinary collaborations between science, technology, and arts.

As a result of this workshop, the researchers articulated an innovative methodology to study creativity: they prepared a visual-creative imagination task based on abstract stimuli and compared the creative performance of an AI image generation model, with and without human guidance, with that of two groups of people: visual artists and the general population. To reproduce a visually comparable drawing style, the generative model was trained using the human participants’ creative productions, and was later tested under two conditions: with and without human guidance, by employing elaborate, concrete prompts in the former case, and more basic ones in the latter. Abstract stimuli and resulting drawings can be seen in Figure 1.

Figure 1. Examples of the Imagination Task’s abstract stimuli and resulting creative images from the four experimental categories (Visual Artists, Non-Artists, Human-Inspired GenAI and Self-Guided GenAI), along with their respective creativity ratings (1-7).

Unanimity among evaluators: human productions are more creative

A group of people and two AIs were in charge of evaluating the level of creativity of the drawings according to 5 different criteria: likeness (to what extent they liked drawing), liveliness, originality, aesthetics and curiosity. In all of them, the results showed a clear gradient: visual artists were rated as most creative, followed by the general population and the human-guided AI model, and lastly, at a strong disadvantage, the unguided AI model. “Although the AI model was trained with the creative productions of human participants, it showed poor performance in the production of creative images and, in fact, it did even worse when deprived of human guidance”, explains Dr Xim Cerdá-Company, researcher at IDIBELL and the CVC/UAB, and co-leader of the study. The scores can be seen in Figure 2.

Figure 2. Examples of drawings from each category, in rows: Visual Artists, Non-Artists, Human-Inspired AI, Self-Guided AI. The values reported correspond to the drawing’s Creativity score by humans, GPT-4o, and Guided-GPT-4o raters, respectively.

Studying creativity as a process, not just for results

For the research team, this study’s contributions to both AI and cognitive science are manifold. Firstly, it highlights the necessity of employing a diverse range of measures and models when investigating a process as complex and multi-faceted as creativity. “Currently, AI creativity is valued almost exclusively according to verbal creativity tasks, leading to biased results, and even presenting AI models as creative agents whose performance surpassed that of most humans. With a different approach, directly evaluating the imaginative process from ideation to execution, we have shown that this is not true”, Dr. Cerdá-Company continues. “Creativity must be studied as a process, not just focus on its results”, he adds. 

Secondly, by showing that as human guidance decreased, so did the creativity of the models, the study highlighted “that current image-generation models are still far from reproducing independent creative processes”, says Dr Antoni Rodríguez-Fornells, head of the Brain Cognition and Plasticity group at IDIBELL-UB, ICREA researcher, and co-leader of the study. This highlights the fundamental need for human intervention in multiple stages of the AI creative process, from training to idea generation. “The technical image-generation AI abilities cannot be evaluated solo; the creative process must be explored in its many components. In doing so, it becomes clear that AI depends directly on our intervention,” concludes Dr Rodríguez-Fornells.


Original source: Silvia Rondini, et al. Stable Diffusion Models Reveal a Persisting Human–AI Gap in Visual Creativity. Advanced Science, 2026. https://doi.org/10.1002/advs.202524142