Subscribe
The latest psychology and neuroscience discoveries.
My Account
  • Mental Health
  • Social Psychology
  • Cognitive Science
  • Neuroscience
  • About
No Result
View All Result
PsyPost
PsyPost
No Result
View All Result
Home Exclusive Artificial Intelligence

How well can ChatGPT-4 write APA-style psychology papers?

by Bianca Setionago
June 2, 2024
in Artificial Intelligence
(Photo credit: Adobe Stock)

(Photo credit: Adobe Stock)

Share on TwitterShare on Facebook

In a recent study published in Contemporary School Psychology, researchers have put the latest AI technology to the test in academic writing, revealing both its potential and limitations.

Artificial Intelligence (AI) has been making waves in various fields, and academia is no exception. Tools powered by AI such as Grammarly and Turnitin have become staples for students and researchers, helping to refine writing by checking for grammar and ensuring originality of written work, respectively. However the capabilities of these tools, particularly in autonomously generating coherent, reliable, and scientifically accurate content, remain under scrutiny.

Led by Adam B Lockwood and Joshua Castleberry from Kent State University, the study aimed to evaluate Generative Pre-trained Transformer 4 (GPT-4), a popular advanced AI language model developed by OpenAI, in writing American Psychological Association (APA)-style psychology papers.

While recent advancements in technology have enabled these sophisticated language models to produce what resembles human-written information, the researchers were curious to assess performance of GPT-4 in three areas: substantiation of claims, factual accuracy, and referencing.

Lockwood and Castleberry entered the following prompt into GPT-4, “Write a 2500-word manuscript on the ethical dilemmas of using ChatGPT to write for psychological and educational reports. Address how APA and NASP guidelines, as well as HIPAA and FERPA laws pertain to these ethical dilemmas. Provide recommendations for overcoming these limitations. Provide citations and references in APA formatting.”

GPT-4 provided a 1814-word document, but after removal of the title, abstract, keywords, headings, and references, a 1043-word paper remained which comprised 45 sentences.

Out of 42 sentences should have been supported by an in-text citation, only 17 (40.5%) were correctly substantiated. The remaining 25 sentences did not have a citation (40%), possessed a citation that did not exist (40%), or were supported by a citation that was irrelevant to the claim being made in the paper (20%).

To check scientific accuracy of the 25 unsubstantiated claims, the researchers were fully able to confirm the accuracy of 14 using other sources, and partially confirm accuracy of 3 more sentences (i.e. the other sources did not explicitly state the claim, but it could be inferred). Thus in total, 31 (73.8%) of sentences were verified.

Google News Preferences Add PsyPost to your preferred sources

Finally, 16 references were provided at the end of the paper – 12 referenced real websites; errors were found on 5 of these (1 listed incorrect authors, 1 failed to provide a Digital Object Identifier (DOI) and 3 provided incorrect links). With the remaining 4 references, 1 was to the wrong article and the 3 remaining links were broken.

Lockwood and Castleberry concluded, “While GPT-4 demonstrated some capability in generating factually accurate information and producing APA-style citations, there were notable limitations. The substantial number of unsubstantiated claims and the presence of errors in citations and referencing indicate the need for further refinement and that we cannot blindly rely on GPT-4 to write papers.”

Some limitations should be noted. The study’s focus on a single paper may not be representative of GPT-4’s overall performance, and the use of specific prompts may have biased GPT-4’s output, suggesting that further research is needed to fully understand its capabilities.

The study, “Examining the Capabilities of GPT-4 to Write an APA-Style School Psychology Paper,” was authored by Adam B. Lockwood and Joshua Castleberry.

Previous Post

New neuroscience research shows the lasting impact of poverty on language processing

Next Post

Scientists are using VR to study cocaine cravings

RELATED

Scientists just uncovered a major limitation in how AI models understand truth and belief
Artificial Intelligence

The bystander effect applies to virtual agents, new psychology research shows

March 12, 2026
Scientists identify a fat-derived hormone that drives the mood benefits of exercise
Artificial Intelligence

Therapists test an AI dating simulator to help chronically single men practice romantic skills

March 9, 2026
Researchers identify two psychological traits that predict conspiracy theory belief
Artificial Intelligence

Brain-controlled assistive robots work best when they share the workload with users

March 8, 2026
Why most people fail to spot AI-generated faces, while super-recognizers have a subtle advantage
Artificial Intelligence

Why most people fail to spot AI-generated faces, while super-recognizers have a subtle advantage

February 28, 2026
People with social anxiety more likely to become overdependent on conversational artificial intelligence agents
Artificial Intelligence

AI therapy is rated higher for empathy until people learn a machine wrote the text

February 26, 2026
New research: AI models tend to reflect the political ideologies of their creators
Artificial Intelligence

New research: AI models tend to reflect the political ideologies of their creators

February 26, 2026
Stress disrupts gut and brain barriers by reducing key microbial metabolites, study finds
Artificial Intelligence

AI and mental health: New research links use of ChatGPT to worsened psychiatric symptoms

February 24, 2026
Stanford scientist discovers that AI has developed an uncanny human-like ability
Artificial Intelligence

How personality and culture relate to our perceptions of artificial intelligence

February 23, 2026

STAY CONNECTED

LATEST

Your personality and upbringing predict if you will lean toward science or faith

Veterans are no more likely than the general public to support political violence

People with social anxiety are less likely to experience a post-sex emotional glow

The extreme male brain theory of autism applies more strongly to females

A newly discovered brain cluster acts as an on and off switch for sex differences

Researchers identify personality traits that predict alcohol relapse after treatment

New study links the fatigue of depression to overworked cellular power plants

New study reveals risk factors for suicidal thoughts in people with gambling problems

PsyPost is a psychology and neuroscience news website dedicated to reporting the latest research on human behavior, cognition, and society. (READ MORE...)

  • Mental Health
  • Neuroimaging
  • Personality Psychology
  • Social Psychology
  • Artificial Intelligence
  • Cognitive Science
  • Psychopharmacology
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and conditions
  • Do not sell my personal information

(c) PsyPost Media Inc

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

Subscribe
  • My Account
  • Cognitive Science Research
  • Mental Health Research
  • Social Psychology Research
  • Drug Research
  • Relationship Research
  • About PsyPost
  • Contact
  • Privacy Policy

(c) PsyPost Media Inc