PsyPost
  • Mental Health
  • Social Psychology
  • Cognitive Science
  • Neuroscience
  • About
No Result
View All Result
Join
My Account
PsyPost
No Result
View All Result
Home Exclusive Artificial Intelligence

ChatGPT-4 outperforms human psychologists in test of social intelligence, study finds

by Eric W. Dolan
April 19, 2024
Reading Time: 3 mins read
(Photo credit: Adobe Stock)

(Photo credit: Adobe Stock)

Share on TwitterShare on Facebook

A new study published in Frontiers in Psychology investigates how AI compares to human psychologists in understanding and responding to human emotions and needs during counseling. The study specifically examined large language models, such as ChatGPT-4, Google Bard, and Bing, assessing their social intelligence — a critical skill in psychotherapy.

ChatGPT-4 outperformed all participating psychologists, while Bing surpassed more than half of them. However, Google Bard’s performance was comparable only to psychologists seeking bachelor’s degrees and was significantly outstripped by doctoral students.

Large language models (LLMs) are advanced artificial intelligence systems designed to understand and generate human-like text by processing vast amounts of written data. These models are trained on diverse internet text to capture nuances in language, context, and syntax.

Through techniques known as deep learning, particularly using structures called neural networks, LLMs can perform a variety of tasks such as answering questions, translating languages, summarizing long articles, and even engaging in conversation that feels strikingly human.

While previous research has shown that LLMs can diagnose and help manage mental health conditions, there was a gap in understanding specifically how these models perform in social contexts, particularly against human psychologists who are skilled in navigating complex emotional interactions.

“The use of artificial intelligence models in counseling and psychotherapy represents a major challenge for psychologists, due to concern that it may take their place in these important tasks,” said study author Fahmi Hassan Fadhel, an associate professor of clinical psychology at Qatar University. “The superiority of artificial intelligence in the areas of perceiving and understanding people’s emotions may mean that it will perhaps be more useful than a human psychotherapist, which is a very concerning issue.”

The study included 180 male psychologists from King Khalid University in Saudi Arabia, divided based on their educational status into bachelor’s and doctoral students. The AI participants included some of the most advanced LLMs available: OpenAI’s ChatGPT-4, Google Bard, and Microsoft Bing.

Each participant, both human and AI, was asked to respond individually to 64 scenarios presented in the Social Intelligence Scale. This scale was chosen because it is well-established and offers a reliable measure of the social skills that are crucial in psychotherapy. The responses were collected and scored according to predefined criteria.

Google News Preferences Add PsyPost to your preferred sources

The items were designed to measure two primary dimensions of social intelligence: the soundness of judgment of human behavior and the ability to act wisely in social situations. The soundness of judgment involves understanding social experiences through observation of human behavior, while the ability to act pertains to analyzing social problems and choosing appropriate solutions.

The results indicated a significant variance in the performance of different AI models and human psychologists, suggesting that some AI systems have advanced to a point where they can outperform human professionals in specific aspects of social intelligence.

Among the AI models evaluated, ChatGPT-4 stood out by demonstrating the highest level of social intelligence. It scored 59 out of 64 on the Social Intelligence Scale, effectively surpassing the performance of all human psychologists in the study. The average social intelligence scores were 39.19 for bachelor’s students and 46.73 for doctoral students.

On the other hand, Bing also performed well, scoring 48 out of 64. This score indicated that Bing outperformed 90% of the bachelor’s students and was on par with 50% of the doctoral students.

In contrast, Google Bard exhibited a lower level of social intelligence in this study. It scored 40 out of 64, which positioned it roughly equivalent to the bachelor-level psychologists but significantly below doctoral students.

The findings serve as a benchmark for future development of AI systems intended for psychotherapy and counseling. Knowing that AI can match or even exceed human performance in social intelligence tasks provides a strong foundation for further integrating these technologies into mental health services.

“The study provides a quick overview of the rapid developments in artificial intelligence,” Fadhel told PsyPost. “It’s a bright outlook for the near future.”

However, the study also raises important questions about training, development, and the ethical use of AI in sensitive areas like mental health, where the ability to empathize and form therapeutic relationships is traditionally viewed as uniquely human.

“Perhaps the biggest caveats will relate to the capabilities of artificial intelligence in the future to understand and analyze human feelings and make decisions based on that,” Fadhel said. “We do not know where developments in this field are headed. To date, the controls imposed on artificial intelligence developers are still at their lowest levels, according to our knowledge.”

The study, “Artificial intelligence and social intelligence: preliminary comparison study between AI models and psychologists,” was authored by Nabil Saleh Sufyan, Fahmi H. Fadhel, Saleh Safeer Alkhathami, and Jubran Y. A. Mukhadi.

RELATED

Blue light exposure may counteract anxiety caused by chronic vibration
Addiction

AI-designed drug reduces fentanyl consumption in animal models by targeting serotonin receptors

May 12, 2026
Childhood ADHD traits linked to midlife distress, with societal exclusion playing a major role
Artificial Intelligence

ChatGPT’s free version is 26 times more likely to respond inappropriately to psychotic delusions

May 9, 2026
Mind captioning: This scientist just used AI to translate brain activity into text
Artificial Intelligence

Scientists tested AI’s moral compass, and the results reveal a key blind spot

May 8, 2026
Scientists show how common chord progressions unlock social bonding in the brain
Artificial Intelligence

Perpetrators of AI sexual abuse often view their actions as a joke, new research shows

May 7, 2026
AI outshines humans in humor: Study finds ChatGPT is as funny as The Onion
Artificial Intelligence

Conversational AI shows promise in easing symptoms of anxiety and depression

May 6, 2026
The surprising link between conspiracy mentality and deepfake detection ability
Artificial Intelligence

Deepfake videos degrade political reputations even when viewers realize they are fake

May 5, 2026
Stanford scientist discovers that AI has developed an uncanny human-like ability
Artificial Intelligence

Turning to chatbots when lonely may exacerbate feelings of loneliness, study finds

May 4, 2026
Study explores how virtual “girlfriend experiences” tap evolved relationship motivations in the digital age
Artificial Intelligence

Study explores how virtual “girlfriend experiences” tap evolved relationship motivations in the digital age

May 3, 2026

Follow PsyPost

The latest research, however you prefer to read it.

Daily newsletter

One email a day. The newest research, nothing else.

Google News

Get PsyPost stories in your Google News feed.

Add PsyPost to Google News
RSS feed

Use your favorite reader. We also syndicate to Apple News.

Copy RSS URL
Social media
Support independent science journalism

Ad-free reading, full archives, and weekly deep dives for members.

Become a member

Trending

  • Brooding identified as a major driver of bedtime procrastination, alongside physical markers of stress
  • Scientists challenge The Body Keeps the Score with a new predictive model of trauma
  • Eating at least five eggs a week is associated with a 27 percent lower risk of Alzheimer’s
  • Brain scans reveal how people with autistic traits connect differently
  • Scientists discover a hydraulic link between the abdomen and the brain

Science of Money

  • The Goldilocks zone of sales pressure: Why a little urgency helps and too much hurts
  • What women really want from “girl power” ads: Six ingredients that make femvertising work
  • The seductive allure of neuroscience: Why brain talk feels so satisfying, even when it explains nothing
  • When two heads aren’t better than one: What research reveals about human-AI teamwork in marketing
  • How your personality may shape whether you pick value or growth stocks

PsyPost is a psychology and neuroscience news website dedicated to reporting the latest research on human behavior, cognition, and society. (READ MORE...)

  • Mental Health
  • Neuroimaging
  • Personality Psychology
  • Social Psychology
  • Artificial Intelligence
  • Cognitive Science
  • Psychopharmacology
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and conditions
  • Do not sell my personal information

(c) PsyPost Media Inc

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

Subscribe
  • My Account
  • Cognitive Science Research
  • Mental Health Research
  • Social Psychology Research
  • Drug Research
  • Relationship Research
  • About PsyPost
  • Contact
  • Privacy Policy

(c) PsyPost Media Inc