Subscribe
The latest psychology and neuroscience discoveries.
My Account
  • Mental Health
  • Social Psychology
  • Cognitive Science
  • Psychopharmacology
  • Neuroscience
  • About
No Result
View All Result
PsyPost
PsyPost
No Result
View All Result
Home Exclusive Artificial Intelligence

Artificial intelligence outperforms the average human in a creative thinking test

by Eric W. Dolan
October 9, 2023
in Artificial Intelligence, Cognitive Science
(Photo credit: Adobe Stock)

(Photo credit: Adobe Stock)

Share on TwitterShare on Facebook
Stay on top of the latest psychology findings: Subscribe now!

A groundbreaking study reveals that artificial intelligence, particularly ChatGPT4, can surpass the average human’s ability to generate ideas in a classic creativity assessment. But while AI chatbots displayed consistently high performance, they did not outperform the most creative human participants. Instead, humans showed a wider range of creative potential, which could be linked to variations in executive functions and cognitive processes.

These new findings have been published in the journal Scientific Reports.

Traditionally, creativity was thought to be a distinctly human trait, driven by complex cognitive processes such as imagination, insight, and the ability to connect seemingly unrelated concepts. Yet, as AI technology continues to advance, it has become increasingly clear that machines are capable of producing creative outputs that rival, and sometimes even surpass, human achievements.

“I think we are living in a very particular historical moment in which how we perceive machines and machine intelligence may radically change. I believe as a scientist that there is a lot of research to be done on how people perceive machines and what are the abilities that machines can currently imitate from humans,” explained study author Simone Grassini, an associate professor at the University of Bergen.

“Until few decades ago, it would have been difficult to believe that machines could output abilities such as as creative behavior, and the field is developing so quickly that is difficult to predict what will happen in one or two years from now.”

The researchers conducted their study using a common creativity test known as the Alternate Uses Task (AUT). In this task, participants, both human and AI chatbots, were asked to generate unique and creative uses for common objects such as a rope, box, pencil, and candle.

The humans had 30 seconds to generate as many creative ideas as possible. Chatbots, on the other hand, were instructed to generate a specific number of ideas (e.g., 3 ideas) and use only 1-3 words in each response. Each chatbot was tested 11 times.

The study involved three AI chatbots: ChatGPT3, ChatGPT4, and Copy.Ai, along with a group of 256 human participants. The participants, all native English speakers, were recruited from the online platform Prolific and had an average age of 30.4 years, with a range from 19 to 40 years.

The responses from both humans and AI chatbots were analyzed using two main approaches:

  • Semantic Distance Scores: This automated method assessed the originality of responses by measuring how different they were from common or expected uses of the objects.
  • Subjective Ratings of Creativity: Six human raters, unaware of which responses were generated by AI, were asked to evaluate the creativity of the ideas on a 5-point scale.

The researchers found that the AI chatbots, particularly ChatGPT3 and ChatGPT4, consistently achieved higher semantic distance scores than humans. This means that they generated responses that were more original and less conventional compared to human participants. Human raters also rated AI chatbots, especially ChatGPT4, as more creative than human participants on average.

“According to our results, AI chatbots (such as ChatGPT) are getting quite good in outputting creative answers when asked to perform a traditional creative thinking tasks often used in psychological research,” Grassini said.

However, it’s important to note that while AI chatbots performed exceptionally well, they did not consistently outperform the best human performers. In some cases, highly creative individuals among the human participants were able to compete with AI in generating novel and imaginative responses.

“The average machine performs the Alternate Uses Tasks better than the average human. However, the best of the human participants still outperformed all the models we tested,” Grassini told PsyPost.

“This is a remarkable achievement by the AI systems. However, I believe that people should not overestimate what this may mean in the real world. The fact that a machine can perform quite well in a very specific creativity task, does not mean that the machine will perform well in complex jobs that include creativity. Whether or not these ‘skills’ of the machines are transferable to anything in the real world is still to be tested.”

“I prefer to believe that in the future AI as chatbots will help humans in their creative jobs, more than substitute them for these jobs,” Grassini said. “I think we should need to start thinking about a future in which humans and AI machines can co-exist without necessarily thinking that the machines will destroy us or will steal all our jobs.”

“However, it is worth noting that the impact of AI in the job market is quite significant, and most likely will increase in the next years. How our society will develop to integrate AI into human jobs is therefore a crucial issue of modernity, and I expect that governments and stakeholders will elaborate guidelines and rules on how machines can be used to replace or aid human work.”

ChatGPT4 was the most creative among the chatbots when subjective ratings were considered.

“A notable findings was that ChatGPT4 (the newest model tested) did not outperform the other AI models when the task was scored using an algorithm to measure semantic distance,” Grassini explained. “However, ChatGPT4 performed generally better than the other models when human raters would score the level of creativity displayed in the responses.”

“This seems to point out that the outputs of ChatGPT4 did not differ compared to the others in the ‘objective’ semantic distance between the proposed item and the creative way to use it. However, ChatGPT4 answers are more ‘appealing’ (i.e., are perceived subjectively more creative) by the human raters.”

Like any scientific study, this research has its limitations. “We only measure one type of creative behavior,” Grassini told PsyPost. “Our results may not be generalizable for creativity as a complex phenomenon.”

The researchers also noted that comparing creativity at process levels between humans and chatbots is challenging, as chatbots are essentially “black boxes” whose internal processes are not fully understood.

“The machine may not ‘display’ creativity in the factual way, but it may have learnt the best answer to that specific task in the training data,” Grassini explained. The task may have assessed the chatbots memory rather than its “ability to come up with creative uses of things. Due to the architecture of these models, it is impossible to know.”

The study, “Best humans still outperform artificial intelligence in a creative divergent thinking task“, was authored by Mika Koivisto and Simone Grassini.

TweetSendScanShareSendPin2ShareShareShareShareShare

RELATED

New psychology study: Inner reasons for seeking romance are a top predictor of finding it
Artificial Intelligence

Scientists demonstrate that “AI’s superhuman persuasiveness is already a reality”

July 18, 2025

A recent study reveals that AI is not just a capable debater but a superior one. When personalized, ChatGPT's arguments were over 64% more likely to sway opinions than a human's, a significant and potentially concerning leap in persuasive capability.

Read moreDetails
Autism severity rooted in embryonic brain growth, study suggests
Cognitive Science

Common pollutant in drinking water linked to brain damage and cognitive impairment

July 17, 2025

New research in mice reveals that prolonged exposure to "forever chemicals," or PFAS, can disrupt brain function and impair memory, even at low concentrations. The findings add to growing evidence that these common chemicals may pose significant risks to brain health.

Read moreDetails
Trump’s speeches stump AI: Study reveals ChatGPT’s struggle with metaphors
Artificial Intelligence

Trump’s speeches stump AI: Study reveals ChatGPT’s struggle with metaphors

July 15, 2025

Can an AI understand a political metaphor? Researchers pitted ChatGPT against the speeches of Donald Trump to find out. The model showed moderate success in detection but ultimately struggled with context, highlighting the current limits of automated language analysis.

Read moreDetails
Daughters who feel more attractive report stronger, more protective bonds with their fathers
Artificial Intelligence

People who use AI may pay a social price, according to new psychology research

July 14, 2025

Worried that using AI tools like ChatGPT at work makes you look lazy? New research suggests you might be right. A study finds employees who use AI are often judged more harshly, facing negative perceptions about their competence and effort.

Read moreDetails
Is ChatGPT really more creative than humans? New research provides an intriguing test
ADHD

Scientists use deep learning to uncover hidden motor signs of neurodivergence

July 10, 2025

Diagnosing autism and attention-related conditions often takes months, if not years. But new research shows that analyzing how people move their hands during simple tasks, with the help of artificial intelligence, could offer a faster, objective path to early detection.

Read moreDetails
Scientists find genetic basis for how much people enjoy music
Cognitive Science

Is humor inherited? Twin study suggests the ability to be funny may not run in the family

July 10, 2025

A first-of-its-kind study set out to discover whether being funny is something you inherit. By testing twins on their joke-making skills, researchers found that your sense of humor might have less to do with DNA than you'd think.

Read moreDetails
Even in healthy adults, high blood sugar levels are linked to impaired brain function
Memory

Neuroscientists decode how people juggle multiple items in working memory

July 8, 2025

New neuroscience research shows how the brain decides which memories deserve more attention. By tracking brain activity, scientists found that the frontal cortex helps direct limited memory resources, allowing people to remember high-priority information more precisely than less relevant details.

Read moreDetails
New study uncovers a surprising effect of cold-water immersion
Cognitive Science

New study uncovers a surprising effect of cold-water immersion

July 8, 2025

Cold-water immersion increases energy expenditure—but it may also drive people to eat more afterward. A study in Physiology & Behavior found participants consumed significantly more food following cold exposure, possibly due to internal cooling effects that continue after leaving the water.

Read moreDetails

SUBSCRIBE

Go Ad-Free! Click here to subscribe to PsyPost and support independent science journalism!

STAY CONNECTED

LATEST

Key Alzheimer’s protein found at astonishingly high levels in healthy newborns

People’s ideal leader isn’t hyper-masculine — new study shows preference for androgynous traits

Chronic pain rewires how the brain processes punishment, new research suggests

Common antidepressants and anti-anxiety drugs tied to major shifts in gut microbiome composition

New psychology study: Inner reasons for seeking romance are a top predictor of finding it

Scientists demonstrate that “AI’s superhuman persuasiveness is already a reality”

Cannabis alternative 9(R)-HHC may be as potent as THC, study in mice suggests

A single dose of lamotrigine causes subtle changes in emotional memory

         
       
  • Contact us
  • Privacy policy
  • Terms and Conditions
[Do not sell my information]

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

Subscribe
  • My Account
  • Cognitive Science Research
  • Mental Health Research
  • Social Psychology Research
  • Drug Research
  • Relationship Research
  • About PsyPost
  • Contact
  • Privacy Policy