Subscribe
The latest psychology and neuroscience discoveries.
My Account
  • Mental Health
  • Social Psychology
  • Cognitive Science
  • Neuroscience
  • About
No Result
View All Result
PsyPost
PsyPost
No Result
View All Result
Home Exclusive Artificial Intelligence

Large language models outperform experts in predicting neuroscience discoveries

by Mane Kara-Yakoubian
January 16, 2025
in Artificial Intelligence, Neuroimaging
(Photo credit: Adobe Stock)

(Photo credit: Adobe Stock)

Share on TwitterShare on Facebook

Large language models surpass human experts in predicting neuroscience results, according to a study published in Nature Human Behaviour.

Scientific research is increasingly challenging due to the immense growth in published literature. Integrating noisy and voluminous findings to predict outcomes often exceeds human capacity. This investigation was motivated by the growing role of artificial intelligence in tasks such as protein folding and drug discovery, raising the question of whether LLMs could similarly enhance fields like neuroscience.

Xiaoliang Luo and colleagues developed BrainBench, a benchmark designed to test whether LLMs could predict the results of neuroscience studies more accurately than human experts. BrainBench included 200 test cases based on neuroscience research abstracts. Each test case consisted of two versions of the same abstract: one was the original, and the other had a modified result that changed the study’s conclusion but kept the rest of the abstract coherent. Participants—both LLMs and human experts—were tasked with identifying which version was correct.

The study involved 171 human participants, all neuroscience experts with an average of 10 years of experience, including doctoral students, postdoctoral researchers, and academic staff. On the computational side, general-purpose LLMs were tested alongside BrainGPT, a specialized model fine-tuned with over 1.3 billion tokens from neuroscience literature. BrainBench covered five major subfields of neuroscience (behavioral/cognitive, cellular/molecular, systems/circuits, neurobiology of disease, and development/plasticity/repair), to ensure a comprehensive assessment.

To evaluate LLMs, the researchers used a metric called “perplexity,” which measures how well the models predict text sequences, while human accuracy was measured based on correct answers. The researchers also ensured the test items were not present in the LLMs’ training data, eliminating concerns about memorization.

LLMs significantly outperformed human experts in predicting neuroscience study outcomes. On average, LLMs achieved 81.4% accuracy, compared to 63.4% for human participants. BrainGPT, the model fine-tuned with neuroscience knowledge, performed even better, improving accuracy by 3% over general-purpose LLMs. This specialized training allowed BrainGPT to excel across all five neuroscience subfields included in the benchmark.

One key advantage of the LLMs was their ability to integrate information from the entire abstract, including the background and methods, rather than relying on isolated details. When tested with only the results section, their accuracy dropped, demonstrating the importance of contextual understanding. Human experts, by contrast, struggled to achieve the same level of integration. Additionally, both humans and LLMs showed higher accuracy when they were confident in their predictions, but LLMs displayed better alignment between confidence and correctness.

Importantly, the study confirmed that LLMs’ success was not due to memorization but rather their ability to recognize patterns in neuroscience research, highlighting their potential to assist in scientific discovery.

Google News Preferences Add PsyPost to your preferred sources

The authors acknowledge that BrainBench, while innovative, is labor-intensive to create. Moreover, there is a risk that reliance on LLM predictions could discourage researchers from pursuing studies that contradict AI predictions, potentially stifling innovation.

The study, “Large language models surpass human experts in predicting neuroscience results,” was authored by Xiaoliang Luo, Akilles Rechardt, Guangzhi Sun, Kevin K. Nejad, Felipe Yáñez, and colleagues.

Previous Post

Genetics, not shared environments, drives parent-child similarities in cognitive ability

Next Post

Transgender athletes’ rights was opposed by those who viewed female athletes as undeserving, study finds

RELATED

What we know about a person changes how our brain processes their face
Neuroimaging

More time spent on social media is linked to a thinner cerebral cortex in young adolescents

April 15, 2026
People ascribe intentions and emotions to both human- and AI-made art, but still report stronger emotions for artworks made by humans
Artificial Intelligence

New research links personality traits to confidence in recognizing artificial intelligence deception

April 13, 2026
Breathwork shows promise in reducing stress, anxiety and depression, according to a new meta-analysis
Meditation

Advanced meditation techniques linked to younger brain age during sleep

April 13, 2026
Psychology researchers identify a “burnout to extremism” pipeline
Narcissism

Narcissistic traits are linked to a brain area governing emotional control

April 12, 2026
Albumin and cognitive decline: Common urine test may help predict dementia risk
Neuroimaging

Reduced gray matter and altered brain connectivity are linked to problematic smartphone use

April 12, 2026
Scientists just found a novel way to uncover AI biases — and the results are unexpected
Artificial Intelligence

Artificial intelligence makes consumers more impatient

April 11, 2026
Cognitive dissonance helps explain why Trump supporters remain loyal, new research suggests
Meditation

Scientists discover intriguing brainwave patterns linked to rhythmic sound meditation

April 11, 2026
Hemp-derived cannabigerol shows promise in reducing anxiety — and maybe even improving memory
Cannabis

Scientists uncover the neurological mechanisms behind cannabis-induced “munchies”

April 10, 2026

STAY CONNECTED

RSS Psychology of Selling

  • Why personalized ads sometimes backfire: A research review explains when tailoring messages works and when it doesn’t
  • The common advice to avoid high customer expectations may not be backed by evidence
  • Personality-matched persuasion works better, but mismatched messages can backfire
  • When happy customers and happy employees don’t add up: How investor signals have shifted in the social media age
  • Correcting fake news about brands does not backfire, five-study experiment finds

LATEST

Children with obesity face a steep decline in adult economic mobility

Finnish cold-water swimmers reveal how frigid dips cure the modern rush

Children with ADHD report applying less effort on cognitive tasks compared to their peers

Can psychedelics help trauma survivors reconnect intimately?

Cannabinoid use is linked to both pro- and anti-inflammatory effects, massive review finds

New psychology study links relationship insecurity to the pursuit of wealth and status

Republican lawmakers lead the trend of using insults to chase media attention instead of policy wins

Scientists wired up volunteers’ genitals and had them watch animals hump to test a long-held theory

PsyPost is a psychology and neuroscience news website dedicated to reporting the latest research on human behavior, cognition, and society. (READ MORE...)

  • Mental Health
  • Neuroimaging
  • Personality Psychology
  • Social Psychology
  • Artificial Intelligence
  • Cognitive Science
  • Psychopharmacology
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and conditions
  • Do not sell my personal information

(c) PsyPost Media Inc

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

Subscribe
  • My Account
  • Cognitive Science Research
  • Mental Health Research
  • Social Psychology Research
  • Drug Research
  • Relationship Research
  • About PsyPost
  • Contact
  • Privacy Policy

(c) PsyPost Media Inc