PsyPost
  • Mental Health
  • Social Psychology
  • Cognitive Science
  • Neuroscience
  • About
No Result
View All Result
Join
My Account
PsyPost
No Result
View All Result
Home Exclusive Cognitive Science

How machine learning can help with voice disorders

by Massachusetts Institute of Technology
August 29, 2016
Reading Time: 3 mins read
Photo credit: kengmerry/Fotolia

Photo credit: kengmerry/Fotolia

Share on TwitterShare on Facebook

There’s no human instinct more basic than speech, and yet, for many people, talking can be taxing. 1 in 14 working-age Americans suffer from voice disorders that are often associated with abnormal vocal behaviors – some of which can cause damage to vocal cord tissue and lead to the formation of nodules or polyps that interfere with normal speech production.

Unfortunately, many behaviorally-based voice disorders are not well understood. In particular, patients with muscle tension dysphonia (MTD) often experience deteriorating voice quality and vocal fatigue (“tired voice”) in the absence of any clear vocal cord damage or other medical problems, which makes the condition both hard to diagnose and hard to treat.

But a team from MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) and Massachusetts General Hospital (MGH) believes that better understanding of conditions like MTD is possible through machine learning.

Using accelerometer data collected from a wearable device developed by researchers at the MGH Voice Center, researchers demonstrated that they can detect differences between subjects with MTD and matched controls. The same methods also showed that, after receiving voice therapy, MTD subjects exhibited behavior that was more similar to that of the controls.

“We believe this approach could help detect disorders that are exacerbated by vocal misuse, and help to empirically measure the impact of voice therapy,” says MIT PhD student Marzyeh Ghassemi, who is first author on a related paper that she presented at last week’s Machine Learning in Health Care (MLHC) conference in Los Angeles. “Our long-term goal is for such a system to be used to alert patients when they are using their voices in ways that could lead to problems.”

The paper’s co-authors include MIT professor John Guttag; Zeeshan Syed, CEO of the machine-learning start-up Health[at]Scale; and Drs. Robert Hillman, Daryush Mehta and Jarrad H. Van Stan of Massachusetts General Hospital.

How it works

Existing approaches to applying machine learning to physiological signals often involve supervised learning, in which researchers painstakingly label data and provide desired outputs. Besides being time-consuming, such methods currently can’t actually help classify utterances as normal or abnormal, because there is currently not a good understanding of the correlations between accelerometer data and voice misuse.

Google News Preferences Add PsyPost to your preferred sources

Because the CSAIL team did not know when vocal misuse was occurring, they opted to use unsupervised learning, where data is unlabeled at the instance level.

“People with vocal disorders aren’t always misusing their voices, and people without disorders also occasionally misuse their voices,” says Ghassemi. “The difficult task here was to build a learning algorithm that can determine what sort of vocal cord movements are prominent in subjects with a disorder.”

The study was broken into two groups: patients that had been diagnosed with voice disorders, and a control group of individuals without disorders. Each group went about their daily activities while wearing accelerometers on their necks that captured the motions of their vocal folds.

Researchers then looked at the two groups’ data, analyzing more than 110 million “glottal pulses” that each represent one opening and closing of the vocal folds. By comparing clusters of pulses, the team could detect significant differences between patients and controls.

The team also found that after voice therapy the distribution of patients’ glottal pulses were more similar to those of the controls. According to Guttag, this is the first such study to use machine learning to provide objective evidence of the positive effects of voice therapy.

“When a patient comes in for therapy, you might only be able to analyze their voice for 20 or 30 minutes to see what they’re doing incorrectly and have them practice better techniques,” says Dr. Susan Thibeault, a professor at the department of surgery at the University of Wisconsin School of Medicine and Public Health who was not involved in the research. “As soon as they leave, we don’t really know how well they’re doing, and so it’s exciting to think that we could eventually give patients wearable devices that use round-the-clock data to provide more immediate feedback.”

Looking ahead

One long-term goal of the work is to be able to use the data not just to improve the lives of those with voice disorders, but to potentially help diagnose specific disorders.

The team also hopes to further explore the underlying reason why certain kinds of vocal pulses are more common in patients than in controls.

“Ultimately we hope this work will lead to smartphone-based biofeedback,” says Hillman. “That sort of technology can help with the most challenging aspect of voice therapy: getting patients to actually employ the healthier vocal behaviors that they learned in therapy in their everyday lives.”

RELATED

Pupil response can reveal the depths of depression
Cognitive Science

New research shows the brain relies on whole faces, not just eyes, to decode emotions

June 1, 2026
Sharing false political information is associated with heightened schizotypy
Cognitive Science

How partisan loyalty affects our ability to spot false claims

May 31, 2026
Researchers identify a peculiar tendency among insecure narcissists
Cognitive Science

New study suggests the brain applies different standards of beauty to paintings and architecture

May 31, 2026
Live music causes brain waves to synchronize more strongly with rhythm than recorded music
Cognitive Science

How learning to read alters the brain’s approach to spoken language

May 29, 2026
Social class narcissism linked to anti-psychiatry conspiracy theories
Cognitive Science

The psychology of paradoxical thinking: Extreme arguments in favor of a controversial topic can reduce overall support

May 28, 2026
New study reveals key psychological traits linked to generativity in older adults
Cognitive Science

The cognitive difference between amateur and expert chess players

May 26, 2026
Psychologists developed a 20-minute tool to help people reframe their depression as a source of strength
Cognitive Science

General intelligence and a strong work ethic are the best predictors of college grades

May 25, 2026
What 50 years of data say about the happiness of single parents
Cognitive Science

Does the smell of pine make you smarter?

May 24, 2026

Follow PsyPost

The latest research, however you prefer to read it.

Daily newsletter

One email a day. The newest research, nothing else.

Google News

Get PsyPost stories in your Google News feed.

Add PsyPost to Google News
RSS feed

Use your favorite reader. We also syndicate to Apple News.

Copy RSS URL
Social media
Support independent science journalism

Ad-free reading, full archives, and weekly deep dives for members.

Become a member

Trending

  • More than half of adults with ADHD in clinical settings have a co-occurring personality disorder
  • New study links parental indulgence to psychopathic and narcissistic traits in adulthood
  • How learning to read alters the brain’s approach to spoken language
  • The psychology of paradoxical thinking: Extreme arguments in favor of a controversial topic can reduce overall support
  • Men’s sexual desire peaks around age 40, large new study finds

Science of Money

  • Class isn’t dead: Your job title still predicts your wealth in Europe, a five-country study finds
  • Packing products tightly on shelves makes shoppers grab more flavors
  • When your job feels scriptable: How routine work and AI anxiety drain employee energy
  • Childhood obesity and the American Dream: New research links early weight to lower lifetime mobility
  • The brain chemical behind your money moves: How dopamine shapes financial choices

PsyPost is a psychology and neuroscience news website dedicated to reporting the latest research on human behavior, cognition, and society. (READ MORE...)

  • Mental Health
  • Neuroimaging
  • Personality Psychology
  • Social Psychology
  • Artificial Intelligence
  • Cognitive Science
  • Psychopharmacology
  • Contact us
  • Disclaimer
  • Privacy policy
  • Terms and conditions
  • Do not sell my personal information

(c) PsyPost Media Inc

Welcome Back!

Login to your account below

Forgotten Password?

Retrieve your password

Please enter your username or email address to reset your password.

Log In

Add New Playlist

Subscribe
  • My Account
  • Cognitive Science Research
  • Mental Health Research
  • Social Psychology Research
  • Drug Research
  • Relationship Research
  • About PsyPost
  • Contact
  • Privacy Policy

(c) PsyPost Media Inc