Brain Computer Interfaces for Silent Speech

Yousef Rezaei Tabar; Ugur Halici

doi:10.1017/S1062798716000569

Brain Computer Interfaces for Silent Speech

Published online by Cambridge University Press: 22 December 2016

Yousef Rezaei Tabar and

Ugur Halici

Show author details

Yousef Rezaei Tabar: Affiliation:
Biomedical Engineering, Middle East Technical University, Ankara, Turkey. E-mail: rezaeetabar@gmail.com
Ugur Halici: Affiliation:
Biomedical Engineering, Neuroscience and Neurotechnology, Electrical and Electronics Engineering, Middle East Technical University, Ankara, Turkey. E-mail: halici@metu.edu.tr

Article contents

Abstract
Introduction
Measuring Brain Activity
Brain Activities Used in EEG-based BCI
EEG Signal Processing
BCI Applications for Silent Speech
Conclusions
References

Rights & Permissions

Abstract

Brain Computer Interface (BCI) systems provide control of external devices by using only brain activity. In recent years, there has been a great interest in developing BCI systems for different applications. These systems are capable of solving daily life problems for both healthy and disabled people. One of the most important applications of BCI is to provide communication for disabled people that are totally paralysed. In this paper, different parts of a BCI system and different methods used in each part are reviewed. Neuroimaging devices, with an emphasis on EEG (electroencephalography), are presented and brain activities as well as signal processing methods used in EEG-based BCIs are explained in detail. Current methods and paradigms in BCI based speech communication are considered.

Type: In Honour of Erol Gelenbe
Information: European Review , Volume 25 , Issue 2 , May 2017 , pp. 208 - 230

DOI: https://doi.org/10.1017/S1062798716000569 [Opens in a new window]
Copyright: © Academia Europaea 2016

1. Introduction

The human brain controls the body by passing signals through a peripheral nervous system. This process is started with the human’s intent and continues through peripheral nerves until the destination body part is reached. Recent advances in electrophysiological recording technology offer alternative ways to bypass the peripheral nervous system and control a device directly by the brain. Such a system that is responsible from translating brain activity to device control command is called a Brain Computer Interface (BCI) .Reference Graimann, Allison and Pfurtscheller ¹

A BCI measures the brain activity patterns produced by the user’s intent and uses it for applications such as communication or control. This can be very useful for patients with motor disabilities. However the application of BCI is not limited to people with disabilities. BCI can be used in a variety of applications, from communication tools for Locked-In State (CLIS) patients to video gaming for healthy people.

An overview of a BCI system is given in Figure 1. The device to be controlled may be a wheelchair, a neuroprothesis, a computer, a game console or any other device. In a BCI system, the user represents his/her intention by a mental activity. The resulting brain signals are transmitted to a computer and processed to generate a control signal for the device to be manipulated. The control signal is used to change the state of the device controlled and a feedback about the new state of the device is provided to the user. The loop continues as the user changes his or her mental activity according to the new state of the device.

Figure 1 Overview of a BCI system.

Any BCI system interacts with the user by using different types of feedback signals. Using these feedbacks provides the adaptation of the user to the system and also the system to the user. Subjects learn to regulate their brain activities by using the online feedback signals sent by the BCI system. The information collected may also be used to train the BCI system through machine learning algorithms.

There are different control paradigms that define how the user interacts with the BCI system. In asynchronous control, users can interact with a BCI any time without worrying about timing. However, in a synchronous control system there are specific time intervals that the user should respond to only in these periods. This is the easiest and probably the most common paradigm in BCI applications.

People with motor disabilities can use BCI to control their environment. Controlling the TV, lights, or room temperature can improve the quality of life for these people.Reference Sellers, Vaughan and Wolpaw ² Locomotion is another BCI application that helps people with physical impairments to control their wheelchairs autonomously.Reference Rebsamen, Guan, Zhang, Wang, Teo, Ang and Burdet ³ Improvements in BCI technology have opened a new way to extend BCI use by non-disabled people. BCI provides a new interaction modality to play video games or use computers. In some recent studies, simple video games, such as Pacman, are being controlled by motor imagery.Reference Krepki, Blankertz, Curio and Muller ⁴

Speech communication, which is also called silent speech, is one of the main applications of BCI for people who have communication disabilities. There have been a lot of studies in the field. In these studies, a variety of brain activities have been used to select the target letter from an on-screen display. One of the most popular paradigms for BCI control in communication applications is to use P300 event-related brain potentials. These signals are used in many speech communication studies in BCI.Reference Farwell and Donchin ⁵ ^– Reference Li, Nam, Shadden and Johnson ⁹ Steady State Visual Evoked Potentials (SSVEP) are other types of control signals that have been used frequently for speech communication.Reference Cheng, Gao, Gao and Xu ¹⁰ ^– Reference Segers, Combaz, Manyakov, Chumerin, Vanderperren, Van Huffel and Van Hulle ¹³ Motor imagery signals are also popular in speech communication.Reference Obermaier, Muller and Pfurtscheller ¹⁴ ^– Reference Blankertz, Müller, Krusienski, Schalk, Wolpaw, Schlögl, Pfurtscheller, Millan, Schröder and Birbaumer ¹⁶

The aim of this paper is to review BCI systems with an emphasis on speech communication applications, This application is chosen since there are several studies in the literature and it is most appropriate for showing how different approaches can be used for the same purpose in BCI. In the next section, how to measure brain activity in general is explained. In Section 3, different modalities for EEG signal acquisition are explained since EEG is the most convenient and widely used approach in BCI systems. In Section 4, how to process EEG signals is explained along with information about the toolboxes, software libraries and datasets available for BCI applications. Then, in Section 5, the existing BCI systems for silent speech are summarized and how to measure the performances of such systems is also explained. Finally, conclusions are provided in Section 6.

2. Measuring Brain Activity

Brain activity produces electrophysiological and haemodynamic activities. There are different sensors that can detect different types of activities in the brain. Signal acquisition methods can be categorized in to two main groups: invasive and non-invasive techniques. Table 1, which is an extended version of the table provided in Ref. Reference Nicolas-Alonso and Gomez-Gil17, summarizes different signal acquisition methods.

Table 1 Properties of different signal acquisition methods.

Invasive methods record the brain signals using sensors implanted inside the body. Micro-electrode arrays (MeA) are highly invasive since they are implanted inside the brain.Reference Suner, Fellows, Vargas-Irwin, Nakata and Donoghue ¹⁸ Electrocorticographic (ECoG) activity recording is another invasive approach in which the sensors are placed not inside but on the surface of the brain.Reference Freeman, Holmes, Burke and Vanhatalo ¹⁹ Despite the accurate signal recording ability of the invasive methods, surgery risks and implant-related problems make these methods less preferable for BCI applications. However, there are some studies that used EcoGReference Levine, Huggins, BeMent, Kushwaha, Schuh, Passaro, Rohde and Ross ²⁰ and MeAReference Kennedy, Kirby, Moore, King and Mallory ²¹ for BCI applications.

Non-invasive techniques involve all the methods that record brain activity from outside of the body boundaries. These methods can measure two groups of signals: signals from haemodynamic (blood oxygenation levels) activities Reference Gelenbe, Feng and Krishnan ¹⁰⁵ and signals from electrophysiological (neuronal) activities.Reference Wolpaw, Loeb, Allison, Donchin, do Nascimento, Heetderks, Nijboer, Shain and Turner ²² The first group of signals can be detected with functional magnetic resonance imaging (fMRI) or near-infrared spectroscopy (NIRS) methods. In fMRI, blood oxygenation level-dependent (BOLD) signals associated with cortical activation are being measured. Different oxygen levels of the blood can also be measured by NIRS, which is a portable device with a higher temporal resolution but lower spatial resolution compared with fMRI.Reference Bauernfeind, Leeb, Wriessnegger and Pfurtscheller ²³ There are few studies that use fMRI for BCI applications.Reference Ward and Mazaheri ²⁴ This is because of the difficulty of real time measurement of the brain activity. On the other hand, fNIRS has been used in several BCI studies in recent years, although it has lower spatial resolution.Reference Coyle, Ward and Markham ²⁵ ^, Reference Power, Kushki and Chau ²⁶

Magnetoencephalography (MEG) and electroencephalography (EEG) methods are two basic modalities for measuring brain electrophysiological activities. MEG measures the brain activity with high resolution by measuring the magnetic fields induced by the neuron’s electric current. However, MEG equipment is large and expensive, which makes it a poor choice for BCI applications.Reference Nicolas-Alonso and Gomez-Gil ¹⁷ Some studies have used MEG for BCI applications, though.Reference Lal, Schröder, Hill, Preissl, Hinterberger, Mellinger, Bogdan, Rosenstiel, Hofmann, Birbaumer and Schölkopf ²⁷ ^– Reference Jinyin, Sudre, Xin, Wei, Weber and Bagic ²⁹

EEG also records brain activity by measuring the electrical fields produced by firing neurons. EEG signals have comparatively low spatial resolution but, high temporal resolution with cheap and easy to use equipment.Reference Nicolas-Alonso and Gomez-Gil ¹⁷ These features make the EEG a proper choice for BCI applications. EEG is used as the signal acquisition method in plenty of BCI studies and therefore is explained in detail in the following section.

3. Brain Activities Used in EEG-based BCI

In EEG, sensor electrodes are placed over the head to measure the brain activity. The number of electrodes can vary from 1 to more than 100. To accurately place the electrodes over the head and measure the activities in different parts of the brain, the International 10–20 System is being used. In this system, the distances between the electrodes are 10% or 20% of the front–back or right–left distance of the skull. Each region has a letter corresponding to the brain lobe (F frontal, T temporal, C central, P parietal, and O occipital) and a number specifying the hemisphere location. The electrode placement according to the International 10–20 System is shown in Figure 2.

Figure 2 Electrode locations in the international 10–20 System.

The brain, as a result of conscious or unconscious mechanisms, may generate different brain activity signals. The function of most of these signals is not understood. However, the physiological phenomena of some of these signals are understood and are being used in BCI applications. These signals are P300 evoked potentials, Steady State Visual Evoked Potentials (SSVEP), Slow Cortical Potentials (SCPs) and Sensory-Motor Rhythms and Motor Imagery.

3.1. P300

P300-evoked potentials are positive peaks in the EEG because of infrequent task-related stimuli. These potentials appear in the EEG signal, approximately 300 ms after the stimulus. To evoke P300, the user is given a series of random stimuli. Whenever the target (infrequent) stimulus is observed, P300 appears in the EEG.Reference Farwell and Donchin ⁵ P300 signals are used widely in BCI applications from controlling cursersReference Citi, Poli, Cinel and Sepulveda ³⁰ and robotsReference Bell, Shenoy, Chalodhorn and Rao ³¹ to speech communication.Reference Farwell and Donchin ⁵ ^– Reference Li, Nam, Shadden and Johnson ⁹

3.2. SSVEP

SSVEP signals are oscillations observable at the occipital lobe, because of visual stimulation. The frequencies of the oscillations are the same as the frequencies of the stimulation.Reference Herrmann ³² When the subject focuses on a stimulus, the amplitude in the corresponding frequency bands is increased. SSVEP signals are used mostly in speech communication studies.Reference Cheng, Gao, Gao and Xu ¹⁰ ^– Reference Segers, Combaz, Manyakov, Chumerin, Vanderperren, Van Huffel and Van Hulle ¹³

3.3. SCP

SCP appears as a slow voltage shift in the EEG in the frequency range 1–2 Hz. A decrease in cortical excitability causes negative SCPs and an increase in cortical excitability causes positive SCPs. It is shown in Ref. Reference Hinterberger, Schmidt, Neumann, Mellinger, Blankertz, Curio and Birbaumer33 that users can be trained to control their SCPs by using visual or auditory feedback signals. SCP signals are used to provide communication for ALS patients.Reference Iversen, Ghanayim, Kübler, Neumann, Birbaumer and Kaiser ³⁴

3.4. Sensory-Motor Rhythms and Motor Imagery

According to brain state, different oscillations happen in brain activity. These oscillations are categorized into four different groups based on their frequency band in EEG: delta (1–4 Hz), theta (4–8 Hz), mu (8–13 Hz), beta (13–25 Hz), and gamma (25–40 Hz). Sensory-Motor Rhythms (SMR) refers to oscillatory activities observed in somatosensory and motor areas. The activations in different parts of the body are mapped to different regions in the sensorimotor cortex of the brain. An activity in a particular part of the body causes a decrease in SMR activity in the related brain area. This decrease is called event-related desynchronization (ERD).Reference Pfurtscheller and da Silva ³⁵ Correspondingly, event-related synchronization (ERS) is the increase in SMR activity during the relaxation period after the body movements. These ERD and ERS activities also happen when the subject is imagining the body movement and not actually moving the body. ERS/ERD oscillations can be observed in EEG in beta and mu frequency bands.

The term ‘motor imagery’ refers to moving a body part in imagination without actually moving it. As discussed above, this imagination causes ERD activities in the brain that can be observed in EEG. However, ERD/ERS patterns of all body parts cannot be discriminated in EEG. The produced patterns should be large enough to be distinguished from the background EEG. Currently, there are four types of motor imagery actions that can be detected via EEG. These actions are the movements of the left hand, right hand, feet and tongue. These four motor imagery signals can be used to control BCI after attending sufficient training sections.Reference Schlögl, Lee, Bischof and Pfurtscheller ³⁶

MI related signals are usually recorded by using C3, C4 and Cz electrodes in EEG. Activity invoked by imagining the movement of right hand can be observed mostly in electrode location C3. Left hand movement imagery can be observed mostly in location C4. Movement imageries of left and right feet are not distinguishable since the corresponding motor rhythm origination areas take part in a sulcus (groove in the cerebral cortex). Therefore, the measured potentials on the scalp are spatially close. They both invoke activity mostly over the Cz area.Reference Pfurtscheller, Brunner, Schlogl and da Silva ³⁷ Motor imagery is used in wide range of BCI applications to send the desired command. In Ref. Reference Fabiani, McFarland, Wolpaw and Pfurtscheller38 motor imagery is used for curser movement. It is also used for controlling a wheelchairReference Long, Li, Wang, Yu, Pan and Li ³⁹ and a robot arm.Reference Horki, Solis-Escalante, Neuper and Müller-Putz ⁴⁰ Speech communication is another popular application of motor imagery.Reference Obermaier, Muller and Pfurtscheller ¹⁴ ^- Reference Blankertz, Müller, Krusienski, Schalk, Wolpaw, Schlögl, Pfurtscheller, Millan, Schröder and Birbaumer ¹⁶

4. EEG Signal Processing

Once the brain activity patterns are measured, the next step is to process these signals in order to translate them to the appropriate control commands. This stage has three steps: preprocessing, feature extraction and classification.

4.1. Preprocessing

The goal of the preprocessing step is to improve the quality of the desired patterns in EEG and enhance the signal-to-noise ratio (SNR). There are three main steps in EEG signal preprocessing: referencing, temporal filtering and signal enhancement. Preprocessing also involves the removal of undesired EEG artefacts.

Referencing

The choice of referencing in EEG-based BCI applications can change the results dramatically. There are three main referencing strategies in EEG.

Common reference: In this approach an electrode far from the other electrodes is selected as a reference. This method is widely used in BCI applications.

Average reference: In this method, the average of the activity of all electrodes is subtracted from the measurements.

Current source density (CSD): It is ‘the rate of change of current flowing into and through the scalp’.Reference Al-ani and Trad ⁴¹ This quantity can be derived from EEG data, and it may be interpreted as the potential difference between an electrode and a weighted average of their surrounding electrodes.

Temporal Filtering

Informative brain signals for BCIs are found in the frequencies below 30 Hz. Therefore, all other content with higher frequencies can be removed using a low pass filter. Specific frequency bands may also be selected using band-pass filters.

Signal Enhancement

Because of the volume conduction, potentials from a large area affect the measured potential in one electrode. To estimate the contribution of each electrode, a linear transformation may be applied to the EEG signal. Methods such as Common Average Reference (CAR) and Laplacian filter preserve the original values of electrodes. Some other methods, such as Principal Component Analysis (PCA)Reference Jolliffe ⁴² and Independent Component Analysis (ICA),Reference Comon ⁴³ try to find independent sources without a direct reference to original channels. Some of these methods are explained in further sections.

EEG Artefacts

The EEG signal includes undesired potentials that corrupt the brain signals. These signals are called artefacts and should be cleaned before the processing step. Artefacts may originate from outside the human body (non-physiological) or inside human body (physiological). The first type of artefacts may originate due to recording equipment. There are some activities inside the human body that may also cause artefacts. Ocular artefacts, caused by eye blinking and pupil movement, and muscular artefacts, caused by movement of body parts, are two main groups of physiological artefacts.

Artefacts can be handled by using three different strategies: avoiding, rejecting and removing. Artefacts may be avoided by asking the subjects to avoid moving and eye blinking. Artefacts can also be identified and rejected by an expert in offline applications. An artefact removal approach attempts to detect and remove the artefacts automatically during the signal processing step. Because of the online application of BCI, this approach is the preferred method for BCI studies. In the literature there are several methods for artefact removal, such as linear filtering, linear combination and regression, and Principle component analysis (PCA). Some of these methods are explained in the further sections.

4.2. Feature Extraction

The goal of the signal processing stage of a BCI system is to separate brain patterns related to a subject’s intention from the other patterns. Therefore, we deal with a pattern recognition problem where different patterns should be classified according to their features. Selecting suitable features is a challenging issue. The values recorded from one electrode may contain overlapped signals from different sources. In this section, we briefly discuss most common feature extraction methods for BCI applications.

Time and Frequency Domain Features

Time domain features can be used when event related potentials are present in the signal. The relevant information can be separated based on the EEG signal amplitude by using methods such as band-pass filtering, windowing and down-sampling. Frequency domain features are derived from oscillations in the EEG signal. These features are mostly used in BCI systems based on SSVEP and motor imagery tasks. Different types of timeReference An, Kuang, Guo, Zhao and He ⁴⁴ and frequencyReference Ince, Arica and Tewfik ⁴⁵ ^, Reference Kaiser, Bauernfeind, Kreilinger, Kaufmann, Kübler, Neuper and Müller-Putz ⁴⁶ domain features have been used in BCI studies. In Ref. 47, a fourth-order Butterworth band-pass filter is used to select the frequency bands 6–30 Hz, including mu and beta bands that correspond to limb movements. Then, different frequency bins and time segments are selected as features. Event related desynchronization (ERD) and event-related synchronization (ERS) can also be used as features. ERD and ERS are defined as the percentage of power decrease (ERD) or power increase (ERS) in a defined frequency band in relation to the reference interval with second duration before the verification of an event.Reference Kaiser, Bauernfeind, Kreilinger, Kaufmann, Kübler, Neuper and Müller-Putz ⁴⁶ The band powers can be used as features in the classification algorithms.

Principal Component Analysis (PCA)

Principal Component Analysis (PCA) is an orthogonal linear transformation method that transforms the data to a new basis according to variance of the data. The axes of the new coordinate system, which are called the principal components, are ordered with decreasing variance and the components having high variance are used to represent the data. PCA is commonly used for reducing dimensionality of the data set since correlated variables are also eliminated while projecting data to the lower dimensional space.Reference Jolliffe ⁴² PCA is proven to reduce noise and improve the classification accuracy. This method has been used in several EEG BCI applications. PCA is used to reduce the dimension of the feature space before classificationReference Ince, Arica and Tewfik ⁴⁵ ^, Reference Lin and Hsieh ⁴⁹ ^– Reference Talukdar, Sakib, Pathan and Fattah ⁵¹ and also to remove the EEG artefacts and reduce noise.Reference Boye, Kristiansen, Billinger, do Nascimento and Farina ⁴⁸

Independent Component Analysis (ICA)

ICA is a statistical method that assumes the recorded value of the EEG signal is a combination of independent sources coming from different cognitive activities inside the brain. No further previous information is used about the signals. The recorded EEG signal is expressed by a linear or nonlinear function of the independent sources.Reference Te-Won, Lewicki, Girolami and Sejnowski ⁵² The number of independent components is usually assumed to be fewer than or equal to the number of EEG channels. Like PCA, ICA uses information from channels to identify patterns in brain activity related to different mental tasks. ICA is usually used to remove artefacts from the EEG signal before the classification.Reference Gao, Yang, Lin, Wang and Zheng ⁵³ However, it can also be used as a classification method.Reference Erfanian and Erfani ⁵⁴

Common Spatial Pattern (CSP)

CSP tries to map EEG channels into a subspace where the differences between channels are maximized and the similarities are reduced. The variances of the signals filtered by CSP can be directly used as features for classification.Reference Ramoser, Muller-Gerking and Pfurtscheller ⁵⁵ CSP is designed to solve two-class problems but can be extended to deal with multi-class problems too. This method has been used in many BCI applications, especially for motor imagery tasks.Reference Grosse-Wentrup and Buss ⁵⁶ ^, Reference Ang, Chin, Zhang and Guan ⁵⁷

Genetic Algorithm (GA)

GA is originally an optimization method, which may be used for selecting efficient features.Reference Holland ⁵⁸ In BCI studies, GA has been used to extract the optimal set of features automatically. In this method, first a random population of chromosomes is constructed. Each chromosome has binary value for each feature. Then, in each iteration/generation a portion of chromosomes with best fitting values are selected for the next generation. These chromosomes are then modified by cross-over and mutation operations. In cross-over, two chromosomes are mixed to make new chromosomes. In mutation, random changes happen at chromosomes. Fitness is defined as classification accuracy for each chromosome. When the termination condition is reached, the best chromosome is selected as the feature set for classification. In BCI area, GA is used to select features from the power spectral density (PSD) of each EEG channel during the motor imagery taskReference Corralejo, Hornero and Alvarez ⁵⁹ and to select features for P300 classification.Reference Seno, Matteucci and Mainardi ⁶⁰

AdaBoost

AdaBoost is a machine-learning algorithm first introduced for adaptive boosting.Reference Freund and Schapire ⁶¹ The main idea is to combine weak classifiers to construct a new strong classifier. The features are selected by using the discriminative properties of the target and non-target classes. AdaBoost performs dimension reduction by selecting a subset of features according to the information provided in training data and eliminating the unselected features. In BCI studies, AdaBoost is used for feature selection and also for classification purposes.Reference Yıldırım and Halici ⁵⁰ ^, Reference Boostani and Moradi ⁶²

4.3. Classification

The classification step aims to determine the subject’s intention by using the features provided in the previous stage. These features are used to construct boundaries between classes in the training stage of the classifier and then they are used to discover the intention in the recognition stage. Some of the most popular classification methods used in BCI studies are discussed in the following.

K-Nearest Neighbour Classifier (k-NNC)

In this classifier, the test sample is classified into a class based on the distance between the features of the test sample and samples of different classes. K nearest neighbours (with less distance) are selected from trained samples and the test sample is assigned to the class with more neighbors.Reference Fix and Hodges ⁶³ K-NNC is proven to be efficient when the dimension of the feature vector is low and is not very popular in BCI research.Reference Nicolas-Alonso and Gomez-Gil ¹⁷

Linear Discriminant Analysis (LDA)

LDA is a simple classifier with acceptable accuracy and low computational requirements.Reference Fukunaga ⁶⁴ LDA is designed for classification of two classes but can be extended for multi-classes. For a two-class problem, LDA tries to define a hyperplane in the feature space that distinguishes the classes. This hyperplane is defined by a linear discrimination function. LDA has some drawbacks, such as failing in the presence of strong noise and not being stable. LDA can also be used for dimension reduction for feature extraction before classification. There are some improved algorithms based on LDA, like Fisher LDA (FLDA) and Bayesian LDA (BLDA).Reference Hoffmann, Vesin, Ebrahimi and Diserens ⁶⁵ Because of the ability of online computation, this method has been applied in many BCI studies.Reference Scherer, Müller, Neuper, Graimann and Pfurtscheller ¹⁵ ^, Reference Nicolas-Alonso and Gomez-Gil ¹⁷ ^, Reference Ince, Arica and Tewfik ⁴⁵ ^, Reference Kaiser, Bauernfeind, Kreilinger, Kaufmann, Kübler, Neuper and Müller-Putz ⁴⁶ ^, Reference Garrett, Peterson, Anderson and Thaut ⁶⁶

Support Vector Machine (SVM)

The main idea in SVM is to select the hyperplanes separating the classes in a way that the distance from the nearest training points of different classes is maximized.Reference Cortes and Vapnik ⁶⁷ ^, Reference Burges ⁶⁸ SVM was proposed originally for classification of two classes but it can be extended to multi-classes. It provides simple, robust and fast classification without needing a large training set. This method has been used in many BCI applications, especially to classify P300 evoked potentials.Reference Nicolas-Alonso and Gomez-Gil ¹⁷ ^, Reference Schlögl, Lee, Bischof and Pfurtscheller ³⁶ ^, Reference Yıldırım and Halici ⁵⁰ ^, Reference Garrett, Peterson, Anderson and Thaut ⁶⁶ ^, Reference Blankertz, Curio and Muller ⁶⁹ ^, Reference Rakotomamonjy and Guigue ⁷⁰

Bayesian Statistical Classifier

Bayesian classifier assigns an observed vector x to a class y by maximizing the so-called a posteriori probability P(y|x). For a feature vector x, a posteriori probability is defined by Bayesian rule as P(y|x)=P(y)P(x|y)/P(x), where P(y) is the prior probability of class y and P(x|y) is the likelihood of x given class y.Reference Jensen ⁷¹ The likelihood function is usually assumed to have Gaussian form. The parameters of the Gaussian model are being estimated to achieve maximum likelihood or maximum a posteriori (MAP). The Expectation Maximization (EM) algorithm is usually used to predict these parameters.Reference Moon ⁷²

Although Bayesian classifiers are not very popular in BCI applications, they have been used in some motor imagery and P300 studies.Reference Nicolas-Alonso and Gomez-Gil ¹⁷ ^, Reference Corralejo, Hornero and Alvarez ⁵⁹

Hidden Markov Models (HMM)

An HMM is a stochastic process that has unobserved (hidden) states that can only be observed through another set of stochastic processes that produce the sequence of observed symbols.Reference Rabiner and Juang ⁷³ Hidden Markov Models are well known for their application in temporal pattern recognition such as speech recognition, and they have been used in some BCI.Reference Obermaier, Guger, Neuper and Pfurtscheller ⁷⁴ ^, Reference Zhong and Gosh ⁷⁵

Artificial Neural Network (ANN)

ANNs are non-linear classifiers that have been used in a wide variety of pattern recognition applications. The multilayer perceptron (MLP) is a popular ANN structureReference Rumelhart, Hinton and Williams ⁷⁶ but several other models are also used.Reference Gelenbe and Fourneau ¹⁰⁶ The backpropagation algorithm is the most widely used algorithm for training MLP. In the backpropagation algorithm, a labelled training set is fed to the network and the difference between the output produced by the network and the desired output is computed. Then optimization methods such as gradient descent are used to minimize this difference by changing network weights. The trained network can then be used for classification of the new samples. There are a variety of other NN structures.Reference Gelenbe, Mao and Li ¹⁰⁷ ^, Reference Gelenbe and Timotheou ¹⁰⁸

Neural Networks are used in many BCI applications to classify two or more tasks.Reference Masic and Pfurtscheller ⁷⁷ ^- Reference Hamedi, Salleh, Noor and Mohammad-Rezazadeh ⁸³ They have also been used in the preprocessing step of EEG studies to improve the classification accuracy.Reference Nicolas-Alonso and Gomez-Gil ¹⁷

Deep Neural Networks

Deep neural networks is a recent approach in neural networks, allowing the network to extract much more complex features of the input by using several hidden layers. Each layer has a nonlinear activation function. In this way, deep networks can represent more functions in a compact form. Due to the complexity of the deep networks, training is a difficult task. The algorithms used to train the deep neural networks are called Deep Learning. One approach is to pre-train a deep network work by training each layer in turn. This approach is utilized in stacked autoencoder networks. In a stacked autoencoder, multiple layers of autoencoders are connected to each other consecutively.Reference Gelenbe and Yin ¹¹² The parameters of each layer are learned separately, and the activation units of the layer are computed. Then, the computed neuron outputs are used as raw input for the next layer. Mapping from the last hidden layer to the output can be performed by classification methods such as logistic regression. To improve the results, a fine-tuning by backpropagation can be applied to tune the change of all layers at the same time. Convolutional Neural Network,Reference LeCun, Bottou, Bengio and Haffner ⁸⁴ Stacked auto encoders,Reference Bengio, Lamblin, Popovici and Larochelle ⁸⁵ and Deep Boltzmann MachineReference Hinton ⁸⁶ are the most widely used deep networks for various applications.

Deep neural networks have been used in some recent BCI studies. Convolutional neural networks are used for classification of P300 in Ref. 87. Stacked Auto Encoder and Deep Boltzmann Machine have been used for classification of EEG motor imagery signals.Reference An, Kuang, Guo, Zhao and He ⁴⁴ ^, Reference Junhua and Cichocki ⁸⁸ Convolutional neural networks and Stacked Auto Encoder are used together in Ref. 89 to classify motor imagery signals.

4.4. Tools, Libraries and Datasets

EEGLABReference Delorme and Makeig ⁹⁰ is a Matlab toolbox that may be used to analyse the EEG data and different brain patterns. BCILABReference Kothe and Makeig ⁹¹ is another Matlab toolbox for designing and testing brain computer interface experiments. BioSigReference Vidaurre, Sander and Schlögl ⁹² is an open source software library that provides signal processing algorithms for biomedical applications. To design an experiment with visual or auditory feedbacks and also in connection with the EEG device, PsychtoolboxReference Brainard ⁹³ may be used in Matlab.

There are plenty of online datasets including BCI signals. The most popular datasets are BCI competition datasets. BCI competition 2003Reference Blankertz ⁹⁴ includes several datasets with SCP, P300 and motor imagery signals. BCI Competition IIIReference Blankertz ⁹⁵ includes different P300 and motor imagery datasets with different paradigms. BCI competition IVReference Blankertz ⁹⁶ also has different motor imagery datasets. There are also other online available datasets, such as OpenVIBE dataset, that provide BCI signals.Reference Renard, Lotte, Gibert, Congedo, Maby, Delannoy and Lécuyer ⁹⁷

5. BCI Applications for Silent Speech

Silent Speech applications, which are BCI systems developed for speech communication, do not use voice, but only brain signals. These systems can be categorized in three main groups according to the brain response they use: event-related potentials (ERP), steady state evoked potential (SSVEP), and motor imagery (MI).

5.1. BCIs Based on P300

The best-known representative of this group is the P300 speller. The first speller based on P300 was proposed in Ref. Reference Farwell and Donchin5 and different modifications of it have been studied afterwards.Reference Townsend, LaPallo, Boulay, Krusienski, Frye, Hauser, Schwartz, Vaughan, Wolpaw and Sellers ⁶ ^– Reference Li, Nam, Shadden and Johnson ⁹ ^, Reference Blankertz ⁹⁴ ^, Reference Blankertz, Dornhege, Krauledat, Schroder, Williamson, Murray-Smith and Müller ⁹⁸

In such applications, a matrix of characters is displayed to the subject. The rows and columns of the matrix are intensified sequentially and the subject attends to the target character. A sample character matrix used in P300 spelling paradigm is shown in Figure 3.

Figure 3 P300 Spelling Paradigm character matrix that is displayed to user.Reference Blankertz ⁹⁴

The attention of the subject to an intensified character evokes an enhanced P300 component. A classifier can be trained to detect the target character by using the combination of intensified rows and columns. For signal processing, a time window is usually applied to select the EEG samples related to P300 evoked potentials. Then, different samples are selected from each channel and used as feature vectors for training and testing. In the literature, different classification methods such as SVM, neural networks and Bayesian linear discriminant analysis are used for classification. Different spelling paradigms based on P300 are used in speech communication studies.Reference Townsend, LaPallo, Boulay, Krusienski, Frye, Hauser, Schwartz, Vaughan, Wolpaw and Sellers ⁶ ^– Reference Li, Nam, Shadden and Johnson ⁹

Even though there has been a lot of research in the P300 speller area, the most recent systems are still not applicable for clinical use. The proposed systems lack robustness across the users and the users cannot control the system easily.Reference Cecotti ⁹⁹

5.2. BCIs Based on SSVEP

In this paradigm, flickering lights at different frequencies are used as the stimuli. For each flickering frequency band, Steady-State Visual Evoked Potential (SSVEP) oscillations happen in the visual cortex of the brain with the same frequency band and higher harmonics. By using this fact, it is possible to detect if the subject is looking at the display part with frequency f or 2f, 3f, etc. Several graphical interfaces have been proposed for this purpose. Figure 4 shows a simple form of SSVEP speller. In this example, symbol w is selected in three stages. Each stage is composed of four boxes in the display with different flickering frequencies.Reference Mora-Cortes, Manyakov, Chumerin and Van Hulle ¹⁰⁰ Different forms of SSVEP-based spellers are introduced for speech communication.Reference Cheng, Gao, Gao and Xu ¹⁰ ^– Reference Segers, Combaz, Manyakov, Chumerin, Vanderperren, Van Huffel and Van Hulle ¹³

Figure 4 Character sets based on SSVEP.Reference Mora-Cortes, Manyakov, Chumerin and Van Hulle ¹⁰⁰

Since the SSVEP is embedded in other ongoing brain activity and also noise, the recording interval should be long. Another limitation is that only flickering frequencies within a particular frequency range evoke a reasonable SSVEP response.Reference Jia, Gao, Hong and Gao ¹⁰¹ Further studies are needed to provide a SSVEP-based speller for commercial uses.

5.3. BCIs Based on MI

As described before, moving a body part or imagining it produces neural activity in the motor cortex of the brain that can be detected by EEG. Only a limited number of movements can be detected by using this method. So, a strategy should be used to combine these acts and produce characters.

Different spelling interfaces have been proposed in the literature for MI-based communication.

A speller is presented in Ref. Reference Blankertz, Dornhege, Krauledat, Schroder, Williamson, Murray-Smith and Müller98 by using only two commands: left hand and both feet. In this study, 30 different characters are divided into six hexagons around a circle (Figure 5). By left hand command, the arrow rotates in a clockwise manner showing the selected box, and by feet command the box is selected. A character can be selected in two stages.

Figure 5 MI based speller with six hexagons chosen by two MI tasks.Reference Blankertz, Dornhege, Krauledat, Schroder, Williamson, Murray-Smith and Müller ⁹⁸

Another speller system based on MI systemReference D’albis, Blatt, Tedesco, Sbattella and Matteucci ¹⁰² is shown in Figure 6. This system is composed of four boxes. Twenty-six English characters and a space symbol are grouped in three boxes. The fourth box is used for undo command. The subject selects one of the boxes by imagining the movement of the corresponding body part. They have used left hand, right hand, both hands and both feet movement for command. The desired symbol can be selected in three stages.

Figure 6 MI-based speller with four boxes each chosen by one of four MI tasks.Reference D’albis, Blatt, Tedesco, Sbattella and Matteucci ¹⁰²

5.4. Other Studies Use Similar Interfaces for Selecting Characters

Motor Imagery commands can be used in another manner to produce desired characters. Each character can be coded into a combination of motor imagery acts. In this way there is no need for a graphical interface. To our knowledge, there are only two studies considering this approach in the literature. Both of these studiesReference Palaniappan, Paramesran, Nishida and Saiwaki ¹⁰³ ^, Reference Nicolaou and Georgiou ¹⁰⁴ use motor imagery EEG signals from an EEG dataset recordedReference Keirn and Aunon ¹⁰⁹ with different MI signals recorded separately. In other work, these signals are combined to synthesize new words. An actual experiment for spelling and performance analysis is not performed in these studies.

5.5. Measuring Speller Performance

It is difficult to measure the performance of different BCI speller systems and compare them in a meaningful way. BCI spellers use different spelling paradigms that make them very different from each other. One traditional way to measure performance is to compute typing accuracy. However, this doesn’t provide any information about spelling speed, which is also an important issue, and an information transfer rate (ITR) metric has been proposed to measure the performance of BCI speller applications.Reference Wolpaw, Birbaumer, McFarland, Pfurtscheller and Vaughan ¹¹⁰ ITR is the amount of information communicated per unit time. It takes into account the accuracy, the number of possible selectable commands that the interface supports, and the time required for communicating one command. However, this metric has some drawbacks, such as considering backspace command as a correct transformation of information. In addition, in spellers that use word compilation strategies, this metric can’t provide a fair performance measurement. Another strategy is to use character per minute measure beside bit per minute in ITR.

It has also been proposed to use output character per minute (OCM) measure for spelling performance measurement.Reference Ryan, Frye, Townsend, Berry, Mesa, Gates and Sellers ¹¹¹ OCM is defined as the ratio of the total number of characters in the final text to the total time spent spelling it. This metric can be used to compare different BCI spellers with different paradigms and even different language models.

6. Conclusions

This article has discussed different parts of a Brain Computer Interface (BCI) system from signal acquisition to signal processing. How to measure brain activity in general, and especially how different modalities for EEG signal acquisition can be used, have been explained. Various EEG signal processing techniques used in BCI application for preprocessing, feature extraction and classification have been presented and information has been provided about the toolboxes, software libraries and datasets. Various speech communication systems based on neural activity are explained in detail. Current speech communication studies are discussed and different spelling paradigms and methods are explained.

Speech communication systems can provide a huge benefit for people with severe disabilities. Current spellers mostly use P300, SSVEP and motor imagery paradigms to provide communication. Signal processing and machine learning algorithms for BCI signals have been improved extensively in recent years. The classification performances of these methods are near acceptable. However, designing a spelling paradigm and graphical interface suitable for the daily life use of people with disabilities is still a challenge. Current studies provide slow communication rates that make them less preferable for common utilization. Improvements in signal processing algorithms as well as designing easy to use and fast spellers are needed to make a BCI-based speller in the future. New portable signal acquisition methods can also help a lot to make a usable spelling device.

Acknowledgements

This study is partially supported under projects BAP-03-01-2015-001, BAP-07-02-2015-005, BAP-03-2016-003 and BAP-03-01-2016-002, Middle East Technical University, Ankara.

Yousef Rezaei Tabar received a BS degree in Electrical Engineering from Urmia University, Urmia, Iran, in 2005. He is currently working toward a PhD degree in Biomedical Engineering at Middle East Technical University, Ankara, Turkey. His current research interests include biomedical signal processing, deep learning and brain–computer interfaces.

Ugur Halici is a faculty member of the Department of Electrical and Electronics Engineering, Middle East Technical University, Ankara, Turkey and chairperson of the METU-Hacettepe University joint Neuroscience and Neurotechnology PhD programme that she helped establish in 2014. She has published two internationally co-edited books Intelligent Biometric Techniques in Fingerprint and Face Recognition with OCR Press 1999 and Innovations in ART Neural Networks with Springer Verlag in 2000, two books in Turkish, and over 100 journal/conference papers. Her research interests cover Computer Vision, Machine Learning, 3D modelling, Pattern Recognition, Intelligent Systems and Computational Neuroscience.

References

1. Graimann, B., Allison, B. and Pfurtscheller, G. (2010) Brain–computer interfaces: a gentle introduction. Brain-Computer Interfaces (Berlin Heidelberg: Springer), pp. 1–27.Google Scholar

2. Sellers, E.W., Vaughan, T.M. and Wolpaw, J.R. (2010) A brain-computer interface for long-term independent home use. Amyotrophic Lateral Sclerosis, 11(5), pp. 449–455.CrossRef Google Scholar PubMed

3. Rebsamen, B., Guan, C., Zhang, H., Wang, C., Teo, C., Ang, M.H. Jr and Burdet, E. (2010) A brain controlled wheelchair to navigate in familiar environments. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 18(6), pp. 590–598.CrossRef Google Scholar PubMed

4. Krepki, R., Blankertz, B., Curio, G. and Muller, K.-R. (2007) The Berlin Brain-Computer Interface (BBCI) towards a new communication channel for online control in gaming applications. Multimedia Tools and Applications, 33, pp. 73–90.CrossRef Google Scholar

5. Farwell, L.A. and Donchin, E. (1988) Talking off the top of your head: toward a mental prosthesis utilizing event-related brain potentials. Electroencephalography and Clinical Neurophysiology, 70(6), pp. 510–523.CrossRef Google Scholar

6. Townsend, G., LaPallo, B.K., Boulay, C.B., Krusienski, D.J., Frye, G.E., Hauser, C.K., Schwartz, N.E., Vaughan, T.M., Wolpaw, J.R. and Sellers, E.W. (2010) A novel P300-based brain-computer interface stimulus presentation paradigm: Moving beyond rows and columns. Clinical Neurophysiology, 121, pp. 1109–1120.CrossRef Google Scholar PubMed

7. Ahi, S.T., Kambara, H. and Koike, Y. (2011) A dictionary-driven P300 speller with a modified interface. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 19, pp. 6–14.CrossRef Google Scholar PubMed

8. Takano, K., Komatsu, T., Hata, N., Nakajima, Y. and Kansaku, K. (2009) Visual stimuli for the P300 brain–computer interface: a comparison of white/gray and green/blue flicker matrices. Clinical Neurophysiology, 120, pp. 1562–1566.CrossRef Google Scholar PubMed

9. Li, Y., Nam, C.S., Shadden, B.B. and Johnson, S.L. (2011) A P300-based brain–computer interface: effects of interface type and screen size. International Journal of Human–Computer Interactface, 27(1), pp. 52–68.CrossRef Google Scholar

10. Cheng, M., Gao, X., Gao, S. and Xu, D. (2002) Design and implementation of a brain-computer interface with high transfer rates. IEEE Transactions on Biomedical Engineering, 49(10), pp. 1181–1186.CrossRef Google Scholar PubMed

11. Trejo, L.J., Rosipal, R. and Matthews, B. (2006) Brain-computer interfaces for 1-D and 2-D cursor control: designs using volitional control of the EEG spectrum or steady-state visual evoked potentials. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 14(2), pp. 225–229.CrossRef Google Scholar PubMed

12. Allison, B.Z., McFarland, D.J., Schalk, G., Zheng, S.D., Jackson, M.M. and Wolpaw, J.R. (2008) Towards an independent brain - computer interface using steady state visual evoked potentials. Clinical Neurophysiology :Official Journal of the International Federation of Clinical Neurophysiology, 119(2), pp. 399–408.CrossRef Google Scholar PubMed

13. Segers, H., Combaz, A., Manyakov, N.V., Chumerin, N., Vanderperren, K., Van Huffel, S. and Van Hulle, M.M. (2011) Steady State Visual Evoked Potential (SSVEP)-based brain spelling system with synchronous and asynchronous typing modes, In 15th Nordic-Baltic Conference on Biomedical Engineering and Medical Physics (NBC 2011), Aalborg, Denmark, 14–17, pp. 164–167.Google Scholar

14. Obermaier, B., Muller, G.R. and Pfurtscheller, G. (2003) Virtual keyboard controlled by spontaneous EEG activity. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 11, pp. 422–426.CrossRef Google Scholar PubMed

15. Scherer, R., Müller, G.R., Neuper, C., Graimann, B. and Pfurtscheller, G. (2004) An asynchronously controlled EEG-based virtual keyboard: improvement of the spelling rate. IEEE Transactions on Biomedical Engineering, 51(6), pp. 979–984.CrossRef Google Scholar PubMed

16. Blankertz, B., Müller, K.R., Krusienski, D.J., Schalk, G., Wolpaw, J.R., Schlögl, A., Pfurtscheller, G., Millan, J.R., Schröder, M. and Birbaumer, N. (2006) The BCI competition III: validating alternative approaches to actual BCI problems. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 14(2), pp. 153–159.CrossRef Google Scholar PubMed

17. Nicolas-Alonso, L.F. and Gomez-Gil, J. (2012) Brain computer interfaces, a review. Sensors, 12(2), pp. 1211–1279.CrossRef Google Scholar PubMed

18. Suner, S., Fellows, M.R., Vargas-Irwin, C., Nakata, G.K. and Donoghue, J.P. (2005) Reliability of signals from a chronically implanted, silicon-based electrode array in non-human primate primary motor cortex. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 13, pp. 524–541.CrossRef Google Scholar PubMed

19. Freeman, W.J., Holmes, M.D., Burke, B.C. and Vanhatalo, S. (2003) Spatial spectra of scalp EEG and EMG from awake humans. Clinical Neurophysiology, 114, pp. 1053–1068.CrossRef Google Scholar PubMed

20. Levine, S.P., Huggins, J.E., BeMent, S.L., Kushwaha, R.K., Schuh, L.A., Passaro, E.A., Rohde, M.M. and Ross, D.A. (1999) Identification of electrocorticogram patterns as the basis for a direct brain interface. Journal of Clinical Neurophysiology, 16, pp. 439–447.CrossRef Google Scholar PubMed

21. Kennedy, P.R., Kirby, M.T., Moore, M.M., King, B. and Mallory, A. (2004) Computer control using human intracortical local field potentials. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 12, pp. 339–344.CrossRef Google Scholar PubMed

22. Wolpaw, J.R., Loeb, G.E., Allison, B.Z., Donchin, E., do Nascimento, O.F., Heetderks, W.J., Nijboer, F., Shain, W.G. and Turner, J.N., BCI Meeting (2005) workshop on signals and recording methods. IEEE Transactions on Neural Systems and Rehabilitation and Engineering, 14, pp. 138–141.CrossRef Google Scholar

23. Bauernfeind, G., Leeb, R., Wriessnegger, S.C. and Pfurtscheller, G. (2008) Development, set-up and first results for a one-channel near-infrared spectroscopy system. Biomedizinische Technik, 53, pp. 36–43.CrossRef Google Scholar PubMed

24. Ward, B.D. and Mazaheri, Y. (2008) Information transfer rate in fMRI experiments measured using mutual information theory. Journal of Neuroscience Methods, 167, pp. 22–30.CrossRef Google Scholar PubMed

25. Coyle, S.M., Ward, T.E. and Markham, C.M. (2007) Brain-computer interface using a simplified functional near-infrared spectroscopy system. Journal of Neural Engineering, 4(3), p. 219.CrossRef Google Scholar

26. Power, S.D., Kushki, A. and Chau, T. (2011) Towards a system-paced near-infrared spectroscopy brain-computer interface: differentiating prefrontal activity due to mental arithmetic and mental singing from the no-control state. Journal of Neural Engineering, 8, 066004.CrossRef Google Scholar PubMed

27. Lal, T.N., Schröder, M., Hill, N.J., Preissl, H., Hinterberger, T., Mellinger, J., Bogdan, M., Rosenstiel, W., Hofmann, T., Birbaumer, N. and Schölkopf, B. (2005) A Brain Computer Interface with Online Feedback Based on Magnetoencephalography. In Proceedings of the 22nd International Conference on Machine Learning (ICML’ 05), Bonn, Germany, pp. 7–11, 465–472.Google Scholar

28. Mellinger, J., Schalk, G., Braun, C., Preissl, H., Rosenstiel, W., Birbaumer, N. and Kübler, A. (2007) An MEG-based brain-computer interface (BCI). Neuroimage, 36, pp. 581–593.CrossRef Google Scholar PubMed

29. Jinyin, Z., Sudre, G., Xin, L., Wei, W., Weber, D.J. and Bagic, A. (2011) Clustering linear discriminant analysis for MEG-Based brain computer interfaces. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 19, pp. 221–231.Google Scholar

30. Citi, L., Poli, R., Cinel, C. and Sepulveda, F. (2008) P300-based BCI mouse with genetically-optimized analogue control. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 16.CrossRef Google Scholar PubMed

31. Bell, C.J., Shenoy, P., Chalodhorn, R. and Rao, R.P.N. (2008) Control of a humanoid robot by a noninvasive brain-computer interface in humans. Journal of Neural Engineering, 5, pp. 214–220.CrossRef Google Scholar PubMed

32. Herrmann, C.S. (2001) Human EEG responses to 1–100 Hz flicker: resonance phenomena in visual cortex and their potential correlation to cognitive phenomena. Experimental Brain Research, 137(3), pp. 346–353.CrossRef Google Scholar PubMed

33. Hinterberger, T., Schmidt, S., Neumann, N., Mellinger, J., Blankertz, B., Curio, G. and Birbaumer, N. (2004) Brain-computer communication and slow cortical potentials. IEEE Transactions on Biomedical Engineering, 51, pp. 1011–1018.CrossRef Google Scholar PubMed

34. Iversen, I.H., Ghanayim, N., Kübler, A., Neumann, N., Birbaumer, N. and Kaiser, J. (2008) A brain-computer interface tool to assess cognitive functions in completely paralyzed patients with amyotrophic lateral sclerosis. Clinical Neurophysiology, 119, pp. 2214–2223.CrossRef Google Scholar PubMed

35. Pfurtscheller, G. and da Silva, F.H.L. (1999) Event-related EEG/MEG synchronization and desynchronization: basic principles. Clinical Neurophysiology, 110(11), pp. 1842–1857.CrossRef Google Scholar PubMed

36. Schlögl, A., Lee, F., Bischof, H. and Pfurtscheller, G. (2005) Characterization of four-class motor imagery EEG data for the BCI-competition 2005. Journal of Neural Engineering, 2, pp. L14–L22.CrossRef Google Scholar PubMed

37. Pfurtscheller, G., Brunner, C., Schlogl, A. and da Silva, F.H.L. (2006) Mu rhythm (de)synchronization and EEG single-trial classification of different motor imagery tasks. NeuroImage, 31, pp. 153–159.CrossRef Google Scholar PubMed

38. Fabiani, G.E., McFarland, D.J., Wolpaw, J.R. and Pfurtscheller, G. (2004) Conversion of EEG activity into cursor movement by a brain-computer interface (BCI). IEEE Transactions on Neural Systems and Rehabilitation Engineering, 12, pp. 331–338.CrossRef Google Scholar PubMed

39. Long, J.Y., Li, Y.Q., Wang, H.T., Yu, T.Y., Pan, J.H. and Li, F. (2012) A hybrid brain computer interface to control the direction and speed of a simulated or real wheelchair. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 20(5), pp. 720–729.CrossRef Google Scholar PubMed

40. Horki, P., Solis-Escalante, T., Neuper, C. and Müller-Putz, G. (2011) Combined motor imagery and SSVEP based BCI control of a 2 DoF artificial upper limb. Medical and Biological Engineering and Computing, 49(5), pp. 567–577.CrossRef Google Scholar PubMed

41. Al-ani, T. and Trad, D. (2010) signal processing and classification approaches for brain-computer interface. Intelligent and Biosensors. V.S. Somerset, (Ed.), (InTech), pp. 25–66.Google Scholar

42. Jolliffe, I. (2002) Principal Component Analysis (New York: Springer-Verlag), DOI: 10.1007/b98835.Google Scholar

43. Comon, P. (1994) Independent component analysis: a new concept? Signal Processing, 36(3), pp. 287–314.CrossRef Google Scholar

44. An, X., Kuang, D., Guo, X., Zhao, Y. and He, L. (2014) A deep learning method for classification of EEG data based on motor imagery. Intelligent Computing in Bioinformatics, pp. 203–210.CrossRef Google Scholar

45. Ince, N.F., Arica, S. and Tewfik, A. (2006) Classification of single trial motor imagery EEG recordings with subject adapted non-dyadic arbitrary time–frequency tilings. Journal of Neural Engineering, 3, 3.CrossRef Google Scholar PubMed

46. Kaiser, V., Bauernfeind, G., Kreilinger, A., Kaufmann, T., Kübler, A., Neuper, C. and Müller-Putz, G.R. (2014) Cortical effects of user training in a motor imagery based brain computer interface measured by fNIRS and EEG. NeuroImage, 85(1), pp. 432–444.CrossRef Google Scholar

47. Hwang, H.J., Kwon, K. and Im, C.H. (2009) Neurofeedback-based motor imagery training for brain-computer interface (BCI). Journal of Neuroscience Methods, 179(1), pp. 150–156.CrossRef Google Scholar PubMed

48. Boye, A.T., Kristiansen, U.Q., Billinger, M., do Nascimento, O.F. and Farina, D. (2008) Identification of movement-related cortical potentials with optimized spatial filtering and principal component analysis. Biomedical Signal Process . Control, 3, pp. 300–304.Google Scholar

49. Lin, C.J. and Hsieh, M.H. (2009) Classification of mental task from EEG data using neural networks based on particle swarm optimization. Neurocomputing, 72, pp. 1121–1130.CrossRef Google Scholar

50. Yıldırım, A. and Halici, U. (2013) Analysis of dimension reduction by PCA and AdaBoost on spelling paradigm EEG data Sixth International Conference on Biomedical Engineering and Informatics.CrossRef Google Scholar

51. Talukdar, M.T., Sakib, S.K., Pathan, N.S. and Fattah, S.A. (2014) Motor imagery EEG signal classification scheme based on autoregressive reflection coefficients. Informatics, Electronics & Vision (ICIEV), International Conference on. IEEE.CrossRef Google Scholar

52. Te-Won, L., Lewicki, M.S., Girolami, M. and Sejnowski, T.J. (1999) Blind source separation of more sources than mixtures using overcomplete representations. IEEE Signal Processing Letters, 6, pp. 87–90.CrossRef Google Scholar

53. Gao, J., Yang, Y., Lin, P., Wang, P. and Zheng, C. (2010) Automatic removal of eye-movement and blink artifacts from EEG signals. Brain Topography, 23, pp. 105–114.CrossRef Google Scholar PubMed

54. Erfanian, A. and Erfani, A. (2004) ICA-based classification scheme for EEG-based brain-computer interface: the role of mental practice and concentration skills. In Engineering in Medicine and Biology Society, IEMBS'04. 26th Annual International Conference of the IEEE, 1, pp. 235–238.CrossRef Google Scholar

55. Ramoser, H., Muller-Gerking, J. and Pfurtscheller, G. (2000) Optimal spatial filtering of single trial EEG during imagined hand movement. IEEE Transactions on Rehabilitation Engineering, 8(4), pp. 441–446.CrossRef Google Scholar PubMed

56. Grosse-Wentrup, M. and Buss, M. (2008) Multiclass common spatial patterns and information theoretic feature extraction. IEEE Transactions on Biomedical Engineering, 55(8), pp. 1991–2000.CrossRef Google Scholar PubMed

57. Ang, K.K., Chin, Z.Y., Zhang, H. and Guan, C. (2008) Filter bank common spatial pattern (FBCSP) in brain-computer interface. In Neural Networks, IJCNN. IEEE World Congress on Computational Intelligence, pp. 2390-2397.Google Scholar

58. Holland, J.H. (1975) Adaption in Natural and Artificial Systems (Cambridge, MA: MIT Press).Google Scholar

59. Corralejo, R., Hornero, R. and Alvarez, D. (2011) Feature selection using a genetic algorithm in a motor imagery-based Brain Computer Interface. Engineering in Medicine and Biology Society, EMBC, 2011 Annual International Conference of the IEEE.CrossRef Google Scholar

60. Seno, D.B., Matteucci, M. and Mainardi, L. (2008) A genetic algorithm for automatic feature extraction in P300 detection. In Proceedings of the IEEE International Joint Conference on Neural Networks (IJCNN’08), Hong Kong, China, pp. 3145–3152.Google Scholar

61. Freund, R.E. and Schapire, Y. (1997) A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1), pp. 119–139.CrossRef Google Scholar

62. Boostani, R. and Moradi, M.H. (2004) A new approach in the BCI research based on fractal dimension as feature and Adaboost as classifier. Journal of Neural Engineering, 1(4), p. 212.CrossRef Google Scholar PubMed

63. Fix, E. and Hodges, J.L. (1951) Discriminatory analysis-nonparametric discrimination: consistency properties. Technical Report 4. USAF School of Aviation Medicine, Randolph Field, TX.CrossRef Google Scholar

64. Fukunaga, K. (1972) Introduction to Statistical Pattern Recognition (Oxford, UK: Clarendon).Google Scholar

65. Hoffmann, U., Vesin, J.M., Ebrahimi, T. and Diserens, K. (2008) An efficient P300-based brain-computer interface for disabled subjects. Journal of Neuroscience Methods, 167, pp. 115–125.CrossRef Google Scholar PubMed

66. Garrett, D., Peterson, D.A., Anderson, C.W. and Thaut, M.H. (2003) Comparison of linear, nonlinear, and feature selection methods for EEG signal classification. IEEE Transactions of Neural Systems and Rehabilitation Engineering, 11, pp. 141–144.CrossRef Google Scholar PubMed

67. Cortes, C. and Vapnik, V. (1995) Support-vector networks. Machine Learning, 20, 273–297.CrossRef Google Scholar

68. Burges, C.J.C. (1998) A tutorial on support vector machines for pattern recognition. Data Mining and Knowledge Discovery, 2, pp. 121–167.CrossRef Google Scholar

69. Blankertz, B., Curio, G. and Muller, K.R. (2002) Classifying single trial EEG: towards brain computer interfacing. Advances in Neural Information Processing Systems, 14, pp. 157–164.Google Scholar

70. Rakotomamonjy, A. and Guigue, V. (2008) BCI 2008, competition III: data set II—Ensemble of SVMs for BCI p300 speller. IEEE Transactions on Biomedical Engineering, 55(3), pp. 1147–1154.CrossRef Google Scholar

71. Jensen, F.V. (2001) Bayesian Networks and Decision Graphs (Berlin: Springer).CrossRef Google Scholar

72. Moon, T.K. (1996) The expectation-maximization algorithm. Signal Processing Magazine, IEEE, 13(6), pp. 47–60.CrossRef Google Scholar

73. Rabiner, L.R. and Juang, B.H. (1986) An introduction to hidden Markov models. IEEE ASSP Magazine, pp. 4–16.CrossRef Google Scholar

74. Obermaier, B., Guger, C., Neuper, C. and Pfurtscheller, G. (2001) Hidden Markov models for online classification of single trial EEG data. Pattern Recognition Letters, 22(12), pp. 1299–1309.CrossRef Google Scholar

75. Zhong, S. and Gosh, J. (2002) HMMs and coupled HMMs for multi-channel EEG classification. Proceedings of the IEEE International Joint Conference on. Neural Networks, 2, pp. 1154–1159.Google Scholar

76. Rumelhart, D.E., Hinton, G.E. and Williams, R.J. (1986) Learning internal representations by error propagation. Parallel Distributed Processing, 1, pp. 151–193.Google Scholar

77. Masic, N. and Pfurtscheller, G. (1993) Neural network based classification of single-trial EEG data. Artificial Intelligence in Medicine, 5(6), pp. 503–513.CrossRef Google Scholar PubMed

78. Anderson, C.W., Devulapalli, S.V. and Stolz, E.A. (1995) Determining mental state from EEG signals using parallel implementations of neural networks. Proceedings of the IEEE Workshop on Neural Networks for Signal in Processing, pp. 475–483.Google Scholar

79. Felzer, T. and Freisieben, B. (2003) Analyzing EEG signals using the probability estimating guarded neural classifier. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 11(4), pp. 361–371.CrossRef Google Scholar PubMed

80. Cecotti, H. and Graser, A. (2008) Time delay neural network with Fourier transform for multiple channel detection of steady-state visual evoked potential for brain-computer interfaces. Proceedings of the European Signal Processing Conference.Google Scholar

81. Haselsteiner, E. and Pfurtscheller, G. (2000) Using time dependent neural networks for EEG classification. IEEE Transactions on Rehabilitation Engineering, 8(4), pp. 457–463.CrossRef Google Scholar PubMed

82. Masic, N., Pfurtscheller, G. and Flotzinger, D. (2008) Neural network-based predictions of hand movements using simulated and real EEG data. Neurocomputing, 7(3), pp. 259–274.CrossRef Google Scholar

83. Hamedi, M., Salleh, S.H., Noor, A.M. and Mohammad-Rezazadeh, I. (2014) Neural network-based three-class motor imagery classification using time-domain features for BCI applications. Region 10 Symposium.CrossRef Google Scholar

84. LeCun, Y., Bottou, L., Bengio, Y. and Haffner, P. (1998) Gradient-based learning applied to document recognition. Proceedings of IEEE, 86(11), pp. 2278–2324.CrossRef Google Scholar

85. Bengio, Y., Lamblin, P., Popovici, D. and Larochelle, H. (2007) Greedy layer-wise training of deep networks. Advances in Neural Information Processing Systems 19 (NIPS’06), pp. 153–160.Google Scholar

86. Hinton, G.E. (2002) Training products of experts by minimizing contrastive divergence. Neural Computation, 14(8), pp. 1711–1800.CrossRef Google Scholar PubMed

87. Cecotti, H. and Axel, G. (2011) Convolutional neural networks for P300 detection with application to brain-computer interfaces. IEEE Transactions on Pattern Analysis and Machine Intelligence, 33(3), pp. 433–445.CrossRef Google Scholar PubMed

88. Junhua, L. and Cichocki, A. (2014) Deep learning of multifractal attributes from motor imagery induced EEG. Neural Information Processing (Springer International Publishing).Google Scholar

89. Rezaeitabar, Y. and Halici, U. (2016) A novel deep learning approach for classification of EEG motor imagery signals. Journal of Neural Engineering, in press.CrossRef Google Scholar

90. Delorme, A. and Makeig, S. (2004) EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics. Journal of Neuroscience Methods, 134, pp. 9–21.CrossRef Google Scholar PubMed

91. Kothe, C.A. and Makeig, S. (2013) BCILAB: a platform for brain–computer interface development. Journal of Neural Engineering, 10(5), 056014.CrossRef Google Scholar PubMed

92. Vidaurre, C., Sander, T.H. and Schlögl, A. (2011) BioSig: the free and open source software library for biomedical signal processing. Computational Intelligence and Neuroscience, 935364. doi: 10.1155/2011/935364. pmid:21437227.CrossRef Google Scholar

93. Brainard, D.H. (1997) The psychophysics toolbox. Spatial Vision, 10, pp. 433–436.CrossRef Google Scholar PubMed

94. Blankertz, B. (2003) BCI Competition II–P300 speller dataset webpage. Online: http://www.bbci.de/competition/ii/,http://www.bbci.de/competition/ii/albany_desc/albany_desc_ii.html.Google Scholar

95. Blankertz, B. BCI Competition III– P300 speller dataset webpage. Online: http://www.bbci.de/competition/iii/, Documentation: http://www.bbci.de/competition/iii/desc_II.pdf, 2005, Retrieved 20/11/2010.Google Scholar

96. Blankertz, B. (2008) BCI Competition IV, Fraunhofer FIRST (IDA), http://ida. first.fraunhofer.de/projects/bci/competition_iv.Google Scholar

97. Renard, Y., Lotte, F., Gibert, G., Congedo, M., Maby, E., Delannoy, V. and Lécuyer, A. (2010) OpenViBE: an open-source software platform to design, test, and use brain-computer interfaces in real and virtual environments. Presence: Teleoperators and Virtual Environments, 19(1), pp. 35–53.CrossRef Google Scholar

98. Blankertz, B., Dornhege, G., Krauledat, M., Schroder, M., Williamson, J., Murray-Smith, R. and Müller, K.-R. (2006) The Berlin brain-computer interface presents the novel mental typewriter Hex-o-Spell. In Proceedings of the Third International Brain Computer Interface Workshop and Training Course, Graz, Austria, pp. 108–109.Google Scholar

99. Cecotti, H. (2011) Spelling with non-invasive Brain–Computer Interfaces – current and future trends. Journal of Physiology-Paris, 105(1–3), pp. 106–114.CrossRef Google Scholar PubMed

100. Mora-Cortes, A., Manyakov, N.V., Chumerin, N. and Van Hulle, M.M. (2014) Language model applications to spelling with Brain-Computer Interfaces. Sensors (Basel), 14(4), pp. 5967–5993.CrossRef Google Scholar PubMed

101. Jia, C., Gao, X., Hong, B. and Gao, S. (2011) Frequency and phase mixed coding in SSVEP-based brain-computer interface. IEEE Transactions on Biomedical Engineering, 58, pp. 200–206.Google Scholar PubMed

102. D’albis, T., Blatt, R., Tedesco, R., Sbattella, L. and Matteucci, M. (2012) A predictive speller controlled by a brain-computer interface based on motor imagery. ACM Transactions on Computer–Human Interactions, 19, pp. 1–25.CrossRef Google Scholar

103. Palaniappan, R., Paramesran, R., Nishida, S. and Saiwaki, N. (2002) A new brain-computer interface design using fuzzy ARTMAP. IEEE Transactions on Neural Systems and Rehabilitation Engineering, 10(3), pp. 140–148.CrossRef Google Scholar PubMed

104. Nicolaou, N. and Georgiou, J. (2008) Towards a Morse code-based non-invasive thought-to-speech converter. In BIOSTEC (Selected Papers), pp. 123–135.Google Scholar

105. Gelenbe, E., Feng, Y. and Krishnan, K.R.R. (1996) Neural network methods for volumetric magnetic resonance imaging of the human brain. Proceedings of the IEEE, 84(10), pp. 1488–1496.CrossRef Google Scholar

106. Gelenbe, E. and Fourneau, J.M. (1999) Random neural networks with multiple classes of signals. Neural Computation, 11(4), pp. 953–963.CrossRef Google Scholar PubMed

107. Gelenbe, E., Mao, Z.-H. and Li, Y.-D. (1999) Function approximation with spiked random networks. IEEE Transactions on Neural Networks, 10(1), pp. 3–9.CrossRef Google Scholar PubMed

108. Gelenbe, E. and Timotheou, S. (2008) Random neural networks with synchronized interactions. Neural Computation, 20(9), pp. 2308–2324.CrossRef Google Scholar PubMed

109. Keirn, Z.A. and Aunon, J.I. (1990) A new mode of communication between man and his surroundings. IEEE Transactions on Biomedical Engineering, 37, 1209–1214.CrossRef Google Scholar PubMed

110. Wolpaw, J.R., Birbaumer, N., McFarland, D.J., Pfurtscheller, G. and Vaughan, T.M. (2002) Brain computer interfaces for communication and control. Clinical Neurophysiology, 113, pp. 767–791.CrossRef Google Scholar PubMed

111. Ryan, D.B., Frye, G.E., Townsend, G., Berry, D.R., Mesa, G.S., Gates, N.A. and Sellers, E.W. (2010) Predictive spelling with a P300-based brain-computer interface: increasing the rate of communication. International Journal of Human–Computer Interactions, 27, pp. 69–84.CrossRef Google Scholar

112. Gelenbe, E. and Yin, Y. (2016) Deep learning with random neural networks. IJCNN 2016, IEEE World Congress on Computational Intelligence, Vancouver, BC, July 2016.CrossRef Google Scholar

Figure 1 Overview of a BCI system.

Table 1 Properties of different signal acquisition methods.

Figure 2 Electrode locations in the international 10–20 System.

Figure 3 P300 Spelling Paradigm character matrix that is displayed to user.94

Figure 4 Character sets based on SSVEP.100

Figure 5 MI based speller with six hexagons chosen by two MI tasks.98

Figure 6 MI-based speller with four boxes each chosen by one of four MI tasks.102

Article contents

Brain Computer Interfaces for Silent Speech

Abstract

1. Introduction

2. Measuring Brain Activity

3. Brain Activities Used in EEG-based BCI

3.1. P300

3.2. SSVEP

3.3. SCP

3.4. Sensory-Motor Rhythms and Motor Imagery

4. EEG Signal Processing

4.1. Preprocessing

Referencing

Temporal Filtering

Signal Enhancement

EEG Artefacts

4.2. Feature Extraction

Time and Frequency Domain Features

Principal Component Analysis (PCA)

Independent Component Analysis (ICA)

Common Spatial Pattern (CSP)

Genetic Algorithm (GA)

AdaBoost

4.3. Classification

K-Nearest Neighbour Classifier (k-NNC)

Linear Discriminant Analysis (LDA)

Support Vector Machine (SVM)

Bayesian Statistical Classifier

Hidden Markov Models (HMM)

Artificial Neural Network (ANN)

Deep Neural Networks

4.4. Tools, Libraries and Datasets

5. BCI Applications for Silent Speech

5.1. BCIs Based on P300

5.2. BCIs Based on SSVEP

5.3. BCIs Based on MI

5.4. Other Studies Use Similar Interfaces for Selecting Characters

5.5. Measuring Speller Performance

6. Conclusions

Acknowledgements

References

Save article to Kindle

Save article to Dropbox

Save article to Google Drive

Reply to: Submit a response

Your details

You have entered the maximum number of contributors

Conflicting interests