PsyPost
  • Mental Health
  • Social Psychology
  • Cognitive Science
  • Neuroscience
  • About
No Result
View All Result
Join
My Account
PsyPost
No Result
View All Result
Home Exclusive Artificial Intelligence

Scientists shocked to find AI’s social desirability bias “exceeds typical human standards”

by Eric W. Dolan
February 5, 2025
Reading Time: 6 mins read
[Adobe Stock]

[Adobe Stock]

Share on TwitterShare on Facebook

A new study published in PNAS Nexus reveals that large language models, which are advanced artificial intelligence systems, demonstrate a tendency to present themselves in a favorable light when taking personality tests. This “social desirability bias” leads these models to score higher on traits generally seen as positive, such as extraversion and conscientiousness, and lower on traits often viewed negatively, like neuroticism.

The language systems seem to “know” when they are being tested and then try to look better than they might otherwise appear. This bias is consistent across various models, including GPT-4, Claude 3, Llama 3, and PaLM-2, with more recent and larger models showing an even stronger inclination towards socially desirable responses.

Large language models are increasingly used to simulate human behavior in research settings. They offer a potentially cost-effective and efficient way to collect data that would otherwise require human participants. Since these models are trained on vast amounts of text data generated by humans, they can often mimic human language and behavior with surprising accuracy. Understanding the potential biases of large language models is therefore important for researchers who are using or planning to use them in their studies.

Personality traits, particularly the “Big Five” (extraversion, openness to experience, conscientiousness, agreeableness, and neuroticism), are a common focus of psychological research. While the Big Five model was designed to be neutral, most people tend to favor higher scores on extraversion, openness, conscientiousness, and agreeableness, and lower scores on neuroticism.

Given the prevalence of personality research and the potential for large language models to be used in this field, the researchers sought to determine whether these models exhibit biases when completing personality tests. Specifically, they wanted to investigate whether large language models are susceptible to social desirability bias, a well-documented phenomenon in human psychology where individuals tend to answer questions in a way that portrays them positively.

“Our lab works at the intersection of psychology and AI,” said study authors Johannes Eichstaedt (an assistant professor and Shriram Faculty Fellow at the Institute for Human-Centered Artificial Intelligence) and Aadesh Salecha (a master’s student at Stanford University and a staff data scientist at the Computational Psychology and Well-Being Lab).

“We’ve been fascinated by using our understanding of human behavior (and the methods from cognitive science) and applying it to intelligent machines. As LLMs are used more and more to simulate human behavior in psychological experiments, we wanted to explore whether they reflect biases similar to those we see in humans. During our explorations with giving different psychological tests to LLMs, we came across this robust social desirability bias.”

To examine potential response biases in large language models, the researchers conducted a series of experiments using a standardized 100-item Big Five personality questionnaire. This questionnaire is based on a well-established model of personality and is widely used in psychological research. The researchers administered the questionnaire to a variety of large language models, including those developed by OpenAI, Anthropic, Google, and Meta. These models were chosen to ensure that the findings would be broadly applicable across different types of large language models.

Google News Preferences Add PsyPost to your preferred sources

The core of the study involved varying the number of questions presented to the models in each “batch.” The researchers tested batches ranging from a single question to 20 questions at a time. Each batch was presented in a new “session” to prevent the model from having access to previous questions and answers. The models were instructed to respond to each question using a 5-point scale, ranging from “Very Inaccurate” to “Very Accurate,” similar to how humans would complete the questionnaire.

The researchers also took steps to ensure the integrity of their findings. They tested the impact of randomness in the models’ responses by adjusting a setting called “temperature,” which controls the level of randomness. They created paraphrased versions of the survey questions to rule out the possibility that the models were simply recalling memorized responses from their training data.

Additionally, they randomized the order of the questions to eliminate any potential effects of question order. Finally, they tested both positively coded and reverse-coded versions of the questions (e.g., “I am the life of the party” vs. “I don’t talk a lot”) to assess the potential influence of acquiescence bias, which is the tendency to agree with statements regardless of their content.

The study’s results clearly demonstrated that large language models exhibit a social desirability bias when completing the Big Five personality test. Across all tested models, scores were skewed towards the desirable ends of the trait dimensions. For instance, as the number of questions presented in a batch increased, the models’ scores on extraversion, conscientiousness, openness, and agreeableness tended to increase, while their scores on neuroticism decreased. This pattern was particularly pronounced in the GPT-4 model.

“Our study shows that LLMs behave differently when they infer that they are being evaluated,” Eichstaedt and Salecha told PsyPost. “Specifically, when they recognize that they are being subjected to a Big Five personality test they ‘try’ to appear more likable, just like people do. This has big implications for how we use LLMs in research and real-world applications because it suggests they might subtly change their behavior based on how they interpret a situation. It also shows just how much they are trained to ‘be liked’ — which is something that can backfire in subtle ways — some models are leaning towards being sycophantic.”

The researchers found that the difference in scores between administering the survey one question at a time versus 20 questions at a time was equivalent to more than one standard deviation in human terms, which is a very large effect. In other words, the change in the model’s responses due to social desirability bias was comparable to a significant shift in a person’s personality. The bias was not limited to GPT-4; it was observed across all the large language models tested, with larger and more recent models generally exhibiting a stronger bias.

“The size of the effect was shocking. In humans, social desirability bias exists but in LLMs, we saw effect sizes that exceed typical human standards — like an average person suddenly becoming extremely extroverted (90th percentile),” the researchers said.

Further analysis suggested that the models were able to infer when they were being evaluated for personality, even with a small number of questions. The accuracy with which the models could identify that they were taking a personality test was associated with the strength of the social desirability bias. For example, GPT-4, Claude 3, and Llama 3 could correctly identify the nature of the questions with over 90% accuracy when presented with just five questions, while PaLM-2 and GPT-3.5 were less accurate.

When the models were explicitly told that they were completing a personality test, their responses were even more skewed towards social desirability, even when presented with only a single question. This finding suggests that the models are adjusting their responses based on their perception of being evaluated. While reverse-coding the questions reduced the magnitude of the bias, it did not eliminate it entirely. This indicates that the observed effects are not solely due to acquiescence bias. The researchers also confirmed that the bias persisted even when the questions were paraphrased and when the order of questions was randomized, further supporting the robustness of their findings.

The researchers acknowledge that their study primarily focused on the Big Five personality traits, which are widely represented in the training data of large language models. It is possible that the same response biases might not occur with less common or less socially evaluative psychological constructs.

Future research should explore the prevalence of social desirability bias across different types of surveys and measurement methods. Another area for further investigation is the role of training data and model development processes in the emergence of these biases. Understanding how these biases are formed and whether they can be mitigated during the training process is essential for ensuring the responsible use of large language models in research and other applications.

Despite these limitations, the study’s findings have significant implications for the use of large language models as proxies for human participants in research. The presence of social desirability bias suggests that results obtained from these models may not always accurately reflect human responses, particularly in the context of personality assessment and other socially sensitive topics.

“As we integrate AI into more parts of our lives, understanding these subtle behaviors and biases becomes crucial,” Eichstaedt and Salecha said. “There needs to be more research into understanding at which stage of the LLM development (pre-training, preference tuning, etc) these biases are being amplified and how to mitigate them without hampering the performance of these models. Whether we’re using LLMs to support research, write content, or even assist in mental health settings, we need to be aware of how these models might unconsciously mimic human flaws—and how that might affect outcomes.”

The study, “Large language models display human-like social desirability biases in Big Five personality surveys,” was authored by Aadesh Salecha, Molly E. Ireland, Shashanka Subrahmanya, João Sedoc, Lyle H Ungar, and Johannes C. Eichstaedt.

RELATED

People cannot tell AI-generated from human-written poetry and they like AI poetry more
Artificial Intelligence

A new study mapped 350,000 relationship stories and found a communication style AI struggles to copy

May 24, 2026
New study links manipulative personality traits to lower relationship intimacy expectations
Artificial Intelligence

Brain scans shed light on why women develop romantic feelings for AI companions

May 22, 2026
Live music causes brain waves to synchronize more strongly with rhythm than recorded music
ADHD Research News

A new AI tool spots hidden signs of adult ADHD months before a formal diagnosis

May 21, 2026
Modern AI is often judged to be more human than actual humans in Turing test experiments
Artificial Intelligence

AI-generated Grokipedia articles are longer, less readable, and cite fewer sources than their Wikipedia counterparts

May 21, 2026
Modern AI is often judged to be more human than actual humans in Turing test experiments
Artificial Intelligence

Modern AI is often judged to be more human than actual humans in Turing test experiments

May 21, 2026
AI-assisted venting can boost psychological well-being, study suggests
Addiction

Artificial intelligence tools answer addiction questions accurately but lack medical nuance

May 15, 2026
Scientists trained AI to talk people out of conspiracy theories — and it worked surprisingly well
Artificial Intelligence

Real-world evidence shows generative AI is making human creative output more uniform

May 14, 2026
Blue light exposure may counteract anxiety caused by chronic vibration
Addiction

AI-designed drug reduces fentanyl consumption in animal models by targeting serotonin receptors

May 12, 2026

Follow PsyPost

The latest research, however you prefer to read it.

Daily newsletter

One email a day. The newest research, nothing else.

Google News

Get PsyPost stories in your Google News feed.

Add PsyPost to Google News
RSS feed

Use your favorite reader. We also syndicate to Apple News.

Copy RSS URL
Social media
Support independent science journalism

Ad-free reading, full archives, and weekly deep dives for members.

Become a member

Trending

  • TikTok disproportionately served anti-Democratic videos during the 2024 election, study finds
  • Neuroscientists discover the brain’s memory center starts “full” and prunes itself down to optimize learning
  • New study links manipulative personality traits to lower relationship intimacy expectations
  • Depression appears to alter how young adults remember childhood trauma and adversity
  • Younger partners and sex toy use are associated with less severe symptoms of menopause

Science of Money

  • What a CEO’s tweets reveal about their paycheck
  • When optimism mutes the message: How investor mood shapes crypto’s response to economic news
  • Why nominal interest rates bite harder than textbooks suggest
  • California’s $20 fast food wage pushed restaurant prices up 3.4% across the state, new analysis finds
  • The psychology of “manifesting”: Why believers feel more successful but often aren’t

PsyPost is a psychology and neuroscience news website dedicated to reporting the latest research on human behavior, cognition, and society. (READ MORE...)

  • Mental Health
  • Neuroimaging
  • Personality Psychology
  • Social Psychology
  • Artificial Intelligence
  • Cognitive Science
  • Psychopharmacology
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and conditions
  • Do not sell my personal information

(c) PsyPost Media Inc

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

Subscribe
  • My Account
  • Cognitive Science Research
  • Mental Health Research
  • Social Psychology Research
  • Drug Research
  • Relationship Research
  • About PsyPost
  • Contact
  • Privacy Policy

(c) PsyPost Media Inc