Search CORE

10 research outputs found

Brain-to-text: Decoding spoken phrases from phone representations in the brain

Author: Adriana de Pesters
Blakely
Bouchard
Bouchard
Brumberg
Canolty
Chang
Christian Herff
Crane
Crone
Crone
Deng
Dominic Heger
Dominic Telaar
Farwell
Formisano
Fukuda
Gales
Gales
Gasser
Gerwin Schalk
Guenther
Haeb-Umbach
Huang
Jelinek
Kellis
Kennedy
Kubanek
Kubanek
Lee
Leuthardt
Leuthardt
Lotte
Martin
McFarland
Mesgarani
Miller
Mugler
Mugler
Pasley
Pei
Pei
Peter Brunner
Potes
PulvermÃ¼ller
Rabiner
Roy
Sahin
Schalk
Schultz
Stolcke
Sutter
Talairach
Tanja Schultz
Telaar
Towle
Unknown.
Wolpaw
Publication venue: Frontiers Media
Publication date: 01/01/2015
Field of study

It has long been speculated whether communication between humans and machines based on natural speech related cortical activity is possible. Over the past decade, studies have suggested that it is feasible to recognize isolated aspects of speech from neural signals, such as auditory features, phones or one of a few isolated words. However, until now it remained an unsolved challenge to decode continuously spoken speech from the neural substrate associated with speech and language processing. Here, we show for the first time that continuously spoken speech can be decoded into the expressed words from intracranial electrocorticographic (ECoG) recordings. Specifically, we implemented a system, which we call Brain-To-Text that models single phones, employs techniques from automatic speech recognition (ASR), and thereby transforms brain activity while speaking into the corresponding textual representation. Our results demonstrate that our system can achieve word error rates as low as 25% and phone error rates below 50%. Additionally, our approach contributes to the current understanding of the neural basis of continuous speech production by identifying those cortical regions that hold substantial information about individual phones. In conclusion, the Brain-To-Text system described in this paper represents an important step toward human-machine communication based on imagined speech

Crossref

KITopen

Frontiers - Publisher Connector

PubMed Central

Error Correction based on Error Signatures applied to automatic speech recognition

Author: Telaar Dominic
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2015
Field of study

KITopen

Speech Processes for Brain-Computer Interfaces

Author: Herff Christian Emanuel
Publication venue
Publication date: 01/01/2016
Field of study

Speech interfaces have become widely used and are integrated in many applications and devices. However, speech interfaces require the user to produce intelligible speech, which might be hindered by loud environments, concern to bother bystanders or the general in- ability to produce speech due to disabilities. Decoding a usera s imagined speech instead of actual speech would solve this problem. Such a Brain-Computer Interface (BCI) based on imagined speech would enable fast and natural communication without the need to actually speak out loud. These interfaces could provide a voice to otherwise mute people. This dissertation investigates BCIs based on speech processes using functional Near In- frared Spectroscopy (fNIRS) and Electrocorticography (ECoG), two brain activity imaging modalities on opposing ends of an invasiveness scale. Brain activity data have low signal- to-noise ratio and complex spatio-temporal and spectral coherence. To analyze these data, techniques from the areas of machine learning, neuroscience and Automatic Speech Recog- nition are combined in this dissertation to facilitate robust classification of detailed speech processes while simultaneously illustrating the underlying neural processes. fNIRS is an imaging modality based on cerebral blood flow. It only requires affordable hardware and can be set up within minutes in a day-to-day environment. Therefore, it is ideally suited for convenient user interfaces. However, the hemodynamic processes measured by fNIRS are slow in nature and the technology therefore offers poor temporal resolution. We investigate speech in fNIRS and demonstrate classification of speech processes for BCIs based on fNIRS. ECoG provides ideal signal properties by invasively measuring electrical potentials artifact- free directly on the brain surface. High spatial resolution and temporal resolution down to millisecond sampling provide localized information with accurate enough timing to capture the fast process underlying speech production. This dissertation presents the Brain-to- Text system, which harnesses automatic speech recognition technology to decode a textual representation of continuous speech from ECoG. This could allow to compose messages or to issue commands through a BCI. While the decoding of a textual representation is unparalleled for device control and typing, direct communication is even more natural if the full expressive power of speech - including emphasis and prosody - could be provided. For this purpose, a second system is presented, which directly synthesizes neural signals into audible speech, which could enable conversation with friends and family through a BCI. Up to now, both systems, the Brain-to-Text and synthesis system are operating on audibly produced speech. To bridge the gap to the final frontier of neural prostheses based on imagined speech processes, we investigate the differences between audibly produced and imagined speech and present first results towards BCI from imagined speech processes. This dissertation demonstrates the usage of speech processes as a paradigm for BCI for the first time. Speech processes offer a fast and natural interaction paradigm which will help patients and healthy users alike to communicate with computers and with friends and family efficiently through BCIs

E-LIB Dokumentserver - Staats und Universitätsbibliothek Bremen

Automatic Recognition of Concurrent and Coupled Human Motion Sequences

Author: Gehrig Dirk
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2015
Field of study

We developed methods and algorithms for all parts of a motion recognition system, i. e. Feature Extraction, Motion Segmentation and Labeling, Motion Primitive and Context Modeling as well as Decoding. We collected several datasets to compare our proposed methods with the state-of-the-art in human motion recognition. The main contributions of this thesis are a structured functional motion decomposition and a flexible and scalable motion recognition system suitable for a Humanoid Robot

KITopen

Workshops of the Sixth International Brain–Computer Interface Meeting: brain–computer interfaces past, present, and future

Author: Barachant A
Beaudoin N
Borton DA
Boutros NN
Brouwer A-M
Brouwer AM
Chao ZC
Charvet G
Chavarriaga R
Collinger JL
Dal Seno B
Degenhart AD
Dourish P
Gavin WJ
Guger C
Halder S
Herff C
Hsu SH
Iturrate I
Iturrate I
Kleih SC
Kohli S
Kohlmorgen J
Korik A
Korik A
Kreilinger A
Lotte F
Lotte F
Martin S
Mugler E
Murphy MD
Müller-Putz G
Naseer N
Nijboer F
Ofner P
Pandarinath C
Pereira J
Ramirez R
Rosenboom D
Rupp R
Rupp R
Salas MA
Salisbury DB
Schettini F
Schreuder M
Simon N
Sun H
Telaar D
Toppi J
Wang Z
Witkowski M
Publication venue: 'Informa UK Limited'
Publication date: 30/01/2017
Field of study

Brain–computer interfaces (BCI) (also referred to as brain–machine interfaces; BMI) are, by definition, an interface between the human brain and a technological application. Brain activity for interpretation by the BCI can be acquired with either invasive or non-invasive methods. The key point is that the signals that are interpreted come directly from the brain, bypassing sensorimotor output channels that may or may not have impaired function. This paper provides a concise glimpse of the breadth of BCI research and development topics covered by the workshops of the 6th International Brain–Computer Interface Meeting

Infoscience - École polytechnique fédérale de Lausanne

Crossref

University of Twente Research Information

HAL - UPEC / UPEM

BioKIT -Real-time decoder for biosignal processing

Author: Christian Herff
Christoph Amma
Dirk Gehrig
Dominic Heger
Dominic Telaar
Felix Putze
Mark Erhardt
Matthias Janke
Michael Wand
Ngoc Thang Vu
Tanja Schultz
Tim Schlippe
Publication venue
Publication date: 10/04/2020
Field of study

Abstract We introduce BioKIT, a new Hidden Markov Model based toolkit to preprocess, model and interpret biosignals such as speech, motion, muscle and brain activities. The focus of this toolkit is to enable researchers from various communities to pursue their experiments and integrate real-time biosignal interpretation into their applications. BioKIT boosts a flexible two-layer structure with a modular C++ core that interfaces with a Python scripting layer, to facilitate development of new applications. BioKIT employs sequence-level parallelization and memory sharing across threads. Additionally, a fully integrated error blaming component facilitates in-depth analysis. A generic terminology keeps the barrier to entry for researchers from multiple fields to a minimum. We describe our onlinecapable dynamic decoder and report on initial experiments on three different tasks. The presented speech recognition experiments employ Kaldi [1] trained deep neural networks with the results set in relation to the real time factor needed to obtain them

CiteSeerX

Advancing Pattern Recognition Techniques for Brain-Computer Interfaces: Optimizing Discriminability, Compactness, and Robustness

Author: Heger Dominic
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2015
Field of study

In dieser Dissertation formulieren wir drei zentrale Zielkriterien zur systematischen Weiterentwicklung der Mustererkennung moderner Brain-Computer Interfaces (BCIs). Darauf aufbauend wird ein Rahmenwerk zur Mustererkennung von BCIs entwickelt, das die drei Zielkriterien durch einen neuen Optimierungsalgorithmus vereint. Darüber hinaus zeigen wir die erfolgreiche Umsetzung unseres Ansatzes für zwei innovative BCI Paradigmen, für die es bisher keine etablierte Mustererkennungsmethodik gibt

KITopen

Modellierung und Erkennung dreidimensionaler Handschrift mittels Inertialsensorik

Author: Amma Christoph
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2015
Field of study

In dieser Dissertation wird mit Airwriting eine Technologie präsentiert, die eine freihändige, jederzeit verfügbare und leicht erlernbare Texteingabe für Wearable Computing Systeme durch Schreiben in der Luft erlaubt. Die Bewegungserfassung erfolgt mittels am Körper getragener Inertialsensoren. Zusätzlich wird auch die inertialsensorbasierte Erkennung traditioneller, mit einem Stift geschriebener, Schrift behandelt und die gestenbasierte Texteingabe mit einer Gestensteuerung kombiniert

KITopen

Adaptive Cognitive Interaction Systems

Author: Putze Felix
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2014
Field of study

Adaptive kognitive Interaktionssysteme beobachten und modellieren den Zustand ihres Benutzers und passen das Systemverhalten entsprechend an. Ein solches System besteht aus drei Komponenten: Dem empirischen kognitiven Modell, dem komputationalen kognitiven Modell und dem adaptiven Interaktionsmanager. Die vorliegende Arbeit enthält zahlreiche Beiträge zur Entwicklung dieser Komponenten sowie zu deren Kombination. Die Ergebnisse werden in zahlreichen Benutzerstudien validiert

KITopen

Rapid Generation of Pronunciation Dictionaries for new Domains and Languages

Author: Schlippe Tim
Publication venue: KIT-Bibliothek, Karlsruhe
Publication date: 01/01/2014
Field of study

This dissertation presents innovative strategies and methods for the rapid generation of pronunciation dictionaries for new domains and languages. Depending on various conditions, solutions are proposed and developed. Starting from the straightforward scenario in which the target language is present in written form on the Internet and the mapping between speech and written language is close up to the difficult scenario in which no written form for the target language exists

KITopen