Search CORE

32 research outputs found

Recommended from our members

Cortical encoding and decoding models of speech production

Author: Chartier Josh
Publication venue: eScholarship, University of California
Publication date: 01/01/2019
Field of study

To speak is to dynamically orchestrate the movements of the articulators (jaw, tongue, lips, and larynx), which in turn generate speech sounds. It is an amazing mental and motor feat that is controlled by the brain and is fundamental for communication. Technology that could translate brain signals into speech would be transformative for people who are unable to communicate as a result of neurological impairments. This work first investigates how articulator movements that underlie natural speech production are represented in the brain. Building upon this, this work also presents a neural decoder that can synthesize audible speech from brain signals. Data to support these results were from direct cortical recordings of the human sensorimotor cortex while participants spoke natural sentences. Neural activity at individual electrodes encoded a diversity of articulatory kinematic trajectories (AKTs), each revealing coordinated articulator movements towards specific vocal tract shapes. The neural decoder was designed to leverage the kinematic trajectories encoded in the sensorimotor cortex which enhanced performance even with limited data. In closed vocabulary tests, listeners could readily identify and transcribe speech synthesized from cortical activity. These findings advance the clinical viability of using speech neuroprosthetic technology to restore spoken communication

eScholarship - University of California

Characterization and Decoding of Speech Representations From the Electrocorticogram

Author: Chakrabarti Shreya
Publication venue: ODU Digital Commons
Publication date: 01/07/2015
Field of study

Millions of people worldwide suffer from various neuromuscular disorders such as amyotrophic lateral sclerosis (ALS), brainstem stroke, muscular dystrophy, cerebral palsy, and others, which adversely affect the neural control of muscles or the muscles themselves. The patients who are the most severely affected lose all voluntary muscle control and are completely locked-in, i.e., they are unable to communicate with the outside world in any manner. In the direction of developing neuro-rehabilitation techniques for these patients, several studies have used brain signals related to mental imagery and attention in order to control an external device, a technology known as a brain-computer interface (BCI). Some recent studies have also attempted to decode various aspects of spoken language, imagined language, or perceived speech directly from brain signals. In order to extend research in this direction, this dissertation aims to characterize and decode various speech representations popularly used in speech recognition systems directly from brain activity, specifically the electrocorticogram (ECoG). The speech representations studied in this dissertation range from simple features such as the speech power and the fundamental frequency (pitch), to complex representations such as the linear prediction coding and mel frequency cepstral coefficients. These decoded speech representations may eventually be used to enhance existing speech recognition systems or to reconstruct intended or imagined speech directly from brain activity. This research will ultimately pave the way for an ECoG-based neural speech prosthesis, which will offer a more natural communication channel for individuals who have lost the ability to speak normally

Recommended from our members

A speech envelope landmark for syllable encoding in human superior temporal gyrus.

Author: Chang Edward F
Oganian Yulia
Publication venue: eScholarship, University of California
Publication date: 01/11/2019
Field of study

The most salient acoustic features in speech are the modulations in its intensity, captured by the amplitude envelope. Perceptually, the envelope is necessary for speech comprehension. Yet, the neural computations that represent the envelope and their linguistic implications are heavily debated. We used high-density intracranial recordings, while participants listened to speech, to determine how the envelope is represented in human speech cortical areas on the superior temporal gyrus (STG). We found that a well-defined zone in middle STG detects acoustic onset edges (local maxima in the envelope rate of change). Acoustic analyses demonstrated that timing of acoustic onset edges cues syllabic nucleus onsets, while their slope cues syllabic stress. Synthesized amplitude-modulated tone stimuli showed that steeper slopes elicited greater responses, confirming cortical encoding of amplitude change, not absolute amplitude. Overall, STG encoding of the timing and magnitude of acoustic onset edges underlies the perception of speech temporal structure

eScholarship - University of California

Characterization of Language Cortex Activity During Speech Production and Perception

Author: Baker Hassan
Publication venue: ODU Digital Commons
Publication date: 01/01/2018
Field of study

Millions of people around the world suffer from severe neuromuscular disorders such as spinal cord injury, cerebral palsy, amyotrophic lateral sclerosis (ALS), and others. Many of these individuals cannot perform daily tasks without assistance and depend on caregivers, which adversely impacts their quality of life. A Brain-Computer Interface (BCI) is technology that aims to give these people the ability to interact with their environment and communicate with the outside world. Many recent studies have attempted to decode spoken and imagined speech directly from brain signals toward the development of a natural-speech BCI. However, the current progress has not reached practical application. An approach to improve the performance of this technology is to better understand the underlying speech processes in the brain for further optimization of existing models. In order to extend research in this direction, this thesis aims to characterize and decode the auditory and articulatory features from the motor cortex using the electrocorticogram (ECoG). Consonants were chosen as auditory representations, and both places of articulation and manners of articulation were chosen as articulatory representations. The auditory and articulatory representations were decoded at different time lags with respect to the speech onset to determine optimal temporal decoding parameters. In addition, this work explores the role of the temporal lobe during speech production directly from ECoG signals. A novel decoding model using temporal lobe activity was developed to predict a spectral representation of the speech envelope during speech production. This new knowledge may be used to enhance existing speech-based BCI systems, which will offer a more natural communication modality. In addition, the work contributes to the field of speech neurophysiology by providing a better understanding of speech processes in the brain

Neurolinguistics Research Advancing Development of a Direct-Speech Brain-Computer Interface

Author: Alderson-Day
Alderson-Day
Aleman
Baddeley
Badre
Basho
Bates
Benetello
Berwick
Bin
Blakely
Bocquelet
Bouchard
Bouton
Brocklehurst
Brumberg
Brumberg
Chakrabarti
Chaudhary
Chi
Corley
DaSalla
Dell
Dell
Deng
Derix
Derix
Di Liberto
Dolcos
Donchin
D’albis
D’Zmura
Edwards
Emerson
Fargier
Fernyhough
Filik
Frazier
Friederici
Friederici
Friederici
Geva
Geva
Gilbert
González-Castañeda
Goto
Guenther
Hashim
Hauk
Henseler
Herff
Herff
Herff
Herff
Hickok
Hickok
Hickok
Hirshorn
Hochberg
Huang
Hubbard
Hurlburt
Hurlburt
Ibayashi
Ikeda
Iljina
Indefrey
Indefrey
Indefrey
Jackendoff
Jones
Kanas
Kellis
Kielar
Lambon Ralph
Leuthardt
Levelt
Levelt
Levelt
Levelt
Livezey
Llorens
Lotte
Love
MacKay
Mainy
Martin
Martin
Martínez-Manrique
Marvel
McGuire
McGuire
Miyake
Mohr
Morin
Moses
Mugler
Mugler
Nguyen
Oken
Oppenheim
Oppenheim
Palmer
Pandarinath
Paulesu
Paulesu
Pei
Pei
Perrone-Bertolotti
Pickering
Piñango
Price
Pulvermüller
Pulvermüller
Ramsey
Rezazadeh Sereshkeh
Riès
Ruescher
Scott
Sedivy
Segalowitz
Shergill
Shergill
Shuster
Stark
Szekely
Tabar
Tabar
Tian
Torres-García
Vigneau
Wang
Wang
Wheeldon
Whitney
Wildgruber
Yoshimura
Zaccarella
Zaccarella
Zatorre
Zhang
Publication venue: 'Elsevier BV'
Publication date: 01/10/2018
Field of study

A direct-speech brain-computer interface (DS-BCI) acquires neural signals corresponding to imagined speech, then processes and decodes these signals to produce a linguistic output in the form of phonemes, words, or sentences. Recent research has shown the potential of neurolinguistics to enhance decoding approaches to imagined speech with the inclusion of semantics and phonology in experimental procedures. As neurolinguistics research findings are beginning to be incorporated within the scope of DS-BCI research, it is our view that a thorough understanding of imagined speech, and its relationship with overt speech, must be considered an integral feature of research in this field. With a focus on imagined speech, we provide a review of the most important neurolinguistics research informing the field of DS-BCI and suggest how this research may be utilized to improve current experimental protocols and decoding techniques. Our review of the literature supports a cross-disciplinary approach to DS-BCI research, in which neurolinguistics concepts and methods are utilized to aid development of a naturalistic mode of communication. : Cognitive Neuroscience; Computer Science; Hardware Interface Subject Areas: Cognitive Neuroscience, Computer Science, Hardware Interfac

Directory of Open Access Journals

Ulster University's Research Portal

Recommended from our members

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

Author: Chang Edward F
Leonard Matthew K
Makin Joseph G
Moses David A
Publication venue: eScholarship, University of California
Publication date: 01/07/2019
Field of study

Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate

eScholarship - University of California

The cortical processing of speech sounds in the temporal lobe

Author: Chang E.
Sjerps M.
Publication venue
Publication date: 01/10/2019
Field of study