nips nips2007 nips2007-173 knowledge-graph by maker-knowledge-mining

173 nips-2007-Second Order Bilinear Discriminant Analysis for single trial EEG analysis


Source: pdf

Author: Christoforos Christoforou, Paul Sajda, Lucas C. Parra

Abstract: Traditional analysis methods for single-trial classification of electroencephalography (EEG) focus on two types of paradigms: phase locked methods, in which the amplitude of the signal is used as the feature for classification, e.g. event related potentials; and second order methods, in which the feature of interest is the power of the signal, e.g. event related (de)synchronization. The procedure for deciding which paradigm to use is ad hoc and is typically driven by knowledge of the underlying neurophysiology. Here we propose a principled method, based on a bilinear model, in which the algorithm simultaneously learns the best first and second order spatial and temporal features for classification of EEG. The method is demonstrated on simulated data as well as on EEG taken from a benchmark data used to test classification algorithms for brain computer interfaces. 1 1.1

Reference: text


Summary: the most important sentenses genereted by tfidf model

sentIndex sentText sentNum sentScore

1 edu Abstract Traditional analysis methods for single-trial classification of electroencephalography (EEG) focus on two types of paradigms: phase locked methods, in which the amplitude of the signal is used as the feature for classification, e. [sent-7, score-0.484]

2 event related potentials; and second order methods, in which the feature of interest is the power of the signal, e. [sent-9, score-0.115]

3 The procedure for deciding which paradigm to use is ad hoc and is typically driven by knowledge of the underlying neurophysiology. [sent-12, score-0.043]

4 Here we propose a principled method, based on a bilinear model, in which the algorithm simultaneously learns the best first and second order spatial and temporal features for classification of EEG. [sent-13, score-0.507]

5 The method is demonstrated on simulated data as well as on EEG taken from a benchmark data used to test classification algorithms for brain computer interfaces. [sent-14, score-0.237]

6 1 Introduction Utility of discriminant analysis in EEG Brain computer interface (BCI) algorithms [1][2][3][4] aim to decode brain activity, on a singletrial basis, in order to provide a direct control pathway between a user’s intentions and a computer. [sent-16, score-0.218]

7 Single-trial discriminant analysis has also been used as a research tool to study the neural correlates of behavior. [sent-19, score-0.155]

8 By extracting activity that differs maximally between two experimental conditions, the typically low signal-noise ratio of EEG can be overcome. [sent-20, score-0.154]

9 The resulting discriminant components can be used to identify the spatial origin and time course of stimulus/response specific activity, while the improved SNR can be leveraged to correlate variability of neural activity across trials to behavioral variability and behavioral performance [7, 5]. [sent-21, score-0.582]

10 In essence, discriminant analysis adds to the existing set of multi-variate statistical tools commonly used in neuroscience research (ANOVA, Hoteling T 2 , Wilks’ Λ test). [sent-22, score-0.155]

11 2 Linear and quadratic approaches In EEG the signal-to-noise ratio of individual channels is low, often at -20dB or less. [sent-24, score-0.232]

12 To overcome this limitation, all analysis methods perform some form of averaging, either across repeated trials, across time, or across electrodes. [sent-25, score-0.117]

13 Traditional EEG analysis averages signals across many repeated trials for individual electrodes. [sent-26, score-0.115]

14 A conventional method is to average the measured potentials following stimulus presentation, thereby canceling uncorrelated noise that is not reproducible from one trial to the next. [sent-27, score-0.202]

15 This averaged activity, called an event related potential (ERP), captures activity that is time-locked to the stimulus presentation but cancels evoked oscillatory activity that is not locked in phase to the timing of the stimulus. [sent-28, score-0.986]

16 Alternatively, many studies compute the oscillatory activity in specific frequency bands by filtering and squaring the signal prior to averaging. [sent-29, score-0.479]

17 Thus, changes in oscillatory activity are termed event related synchronization or desynchronization (ERS/ERD). [sent-30, score-0.308]

18 First order methods include temporal filtering + thresholding [2], hierarchical linear classifiers [5] and bilinear discriminant analysis [8, 9]. [sent-32, score-0.5]

19 Second order methods include the logistic regression with a quadratic term [11] and the well known common spatial patterns method (CSP) [10] and its variants: common spatio-spectral patterns (CSSP)[12], and common sparse spectral spatial patterns (CSSSP)[13] . [sent-33, score-0.412]

20 Choosing what kind of features to use traditionally has been an ad hoc process motivated by knowledge of the underlying neurophysiology and task. [sent-34, score-0.085]

21 Instead it would be desirable for the analysis method to extract the relevant neurophysiological activity de novo with minimal prior expectations. [sent-36, score-0.205]

22 In this paper we present a new framework that combines both the first order features and the second order features in the analysis of EEG. [sent-37, score-0.084]

23 We use a bilinear formulation which can simultaneously extract spatial linear components as well as temporal (filtered) features. [sent-38, score-0.554]

24 1 Second order bilinear discriminant analysis Problem setting Given a set of sample points D = {Xn , yn }N , X ∈ RD × T , y ∈ {−1, 1} , where Xn corresponds n=1 to the EEG signal of D channels and T sample points and yn indicate the class that corresponds to one of two conditions (e. [sent-40, score-0.907]

25 right or left hand imaginary movement, stimulus versus control conditions, etc. [sent-42, score-0.049]

26 ), the task is then to predict the class label y for an unobserved trial X. [sent-43, score-0.094]

27 2 Second order bilinear model Define a function, f (X; θ) = C Trace(UT XV) + (1 − C) Trace(ΛAT (XB)(XB)T A) (1) where θ = {U ∈ RD × R , V ∈ RT × R , A ∈ RD × K B ∈ RT × T } are the parameters of the model, Λ ∈ diag({−1, 1}) a given diagonal matrix with elements {−1, 1} and C ∈ [0, 1]. [sent-45, score-0.265]

28 1 Interpretation of the model The first term of the equation (1) can be interpreted as a spatio-temporal projection of the signal, under the bilinear model, and captures the first order statistics of the signal. [sent-48, score-0.346]

29 Specifically, the columns ur of U represent R linear projections in space (rows of X). [sent-49, score-0.178]

30 Similarly, each of the R columns of vk in matrix V represent linear projections in time (columns of X). [sent-50, score-0.106]

31 By re-writing the term as: Trace(UT XV) = Trace(VUT X) = Trace(WT X) (3) T where we defined W = UV , it is easy to see that the bilinear projection is a linear combination of elements of X with the rank − R constrained on W. [sent-51, score-0.275]

32 This expression is linear in X and thus captures directly the amplitude of the signal directly. [sent-52, score-0.298]

33 In particular, the polarity of the signal (positive evoked response versus negative evoked response) will contribute significantly to discrimination if it is consistent across trials. [sent-53, score-0.465]

34 This term, therefore, captures phase locked event related potentials in the EEG signal. [sent-54, score-0.451]

35 The second term of equation (1), is a projection of the power of the filtered signal, which captures the second order statistics of the signal. [sent-55, score-0.172]

36 As before, each column of matrix A and B, represent components that project the data in space and time respectively. [sent-56, score-0.078]

37 Depending on the structure one enforces in matrix B different interpretations of the model can be archived. [sent-57, score-0.04]

38 In the general case where no structure on B is assumed, the model captures a linear combination of the elements of a rank − T second order matrix approximation of the signal Σ = XB(XB)T . [sent-58, score-0.252]

39 In the case where Toeplitz structure is enforced on B, then B defines a temporal filter on the signal and the model captures the linear combination of the power of the second order matrix of the filtered signal. [sent-59, score-0.464]

40 For example if B is fixed to a Toeplitz matrix with coefficients corresponding to a 8Hz-12Hz band pass filter, then the second term is able to extract differences in the alpha-band which is known to be modulated during motor related tasks. [sent-60, score-0.141]

41 Further, by learning B from the data, we may be able to identify new frequency bands that have so far not been identified in novel experimental paradigms. [sent-61, score-0.094]

42 The spatial weights A together with the Trace operation ensure that the power is measured, not in individual electrodes, but in some component space that may reflect activity distributed across several electrodes. [sent-62, score-0.44]

43 3 Logistic regression We use a logistic Rregression (LR) formalism as it is particularly convenient when imposing additional statistical properties on the matrices U, V, A, B such as smoothness or sparseness. [sent-65, score-0.119]

44 In addition, in our experience, LR performs well in strongly overlapping high-dimensional datasets and is insensitive to outliers, the later being of particular concern when including quadratic features. [sent-66, score-0.042]

45 4 (9) n=1 (11) Fourier Basis for B If matrix B is constrained to have a circular toepliz structure then it can be represented as B = F−1 DF, where F−1 denotes the inverse Fourier matrix, and D is a diagonal complex-valued matrix of Fourier coefficients. [sent-72, score-0.08]

46 1 Results Simulated data In order to validate our method and its ability to capture both linear and second order features, we generated simulated data that contained both types of features; namely ERP type of features and ERS/ERD type of features. [sent-76, score-0.12]

47 The simulated signals were generated with a signal to noise ratio of −20dB which is a typical noise level for EEG. [sent-77, score-0.219]

48 A total of 28 channels, 500 ms long signals and at a sampling frequency of 100Hz where generated, resulting in a matrix of X of 28 by 50 elements, for each trial. [sent-78, score-0.093]

49 1 30 0 50 100 150 200 250 300 time(m/s) channels A component 350 400 450 500 350 400 450 500 B component 1. [sent-88, score-0.342]

50 2 0 50 100 150 channels 200 250 300 time (m/s) Figure 1: Spatial and temporal component extracted on simulated data for the linear term (top) and quadratic term (bottom). [sent-98, score-0.606]

51 For channels 10-18, a peak represented by a half cycle sinusoid was added at approximately 400 ms, which T simulates an ERP type feature. [sent-101, score-0.289]

52 The linear component U (in this case only a column vector) has non-zero coefficients for channels 10 to 18 only, showing that the method correctly identified the ERP activity. [sent-103, score-0.266]

53 Furthermore, the associated temporal component V has a temporal profile that matches the time course of the simulated evoked response. [sent-104, score-0.489]

54 Similarly, the second order components A have non-zero weights for only channels 1-9 showing that the method also identified the spatial distribution of the non-phase locked activity. [sent-105, score-0.552]

55 The temporal filter B was trained in the frequency domain and the resulting filter is shown here in the time domain. [sent-106, score-0.173]

56 It exhibits a dominant 10Hz component, which is indeed the frequency of the non-phase locked activity. [sent-107, score-0.257]

57 2 BCI competition dataset To evaluate the performance of the proposed method on real data we applied the algorithm to an EEG data set that was made available through The BCI Competition 2003 ([14], Data Set IV). [sent-109, score-0.082]

58 EEG was recorded on 28 channels for a single subject performing self-paced key typing, that is, pressing the corresponding keys with the index and little fingers in a self-chosen order and timing (i. [sent-110, score-0.242]

59 A total of 416 epochs were recorded, each of length 500ms. [sent-115, score-0.057]

60 For the competition, the first 316 epochs were to be used for classifier training, while the remaining 100 epochs were to be used as a test set. [sent-116, score-0.114]

61 For this experiment, the matrix B was fixed to a Toeplitz structure that encodes a 10Hz33Hz bandpass filter and only the parameters U, V, A and w0 were trained. [sent-119, score-0.04]

62 The number of columns of U and V were set to 1, where two columns were used for A. [sent-120, score-0.132]

63 The temporal filter was selected based on prior knowledge of the relevant frequency band. [sent-121, score-0.173]

64 This demonstrates the flexibility of our approach to either incorporate prior knowledge when available or extract it from 5 U component V component 0. [sent-122, score-0.203]

65 1 First Column of A 0 100 200 300 time (m/s) 400 500 Second Column of A Figure 2: Spatial and temporal component (top), and two spatial components for second order features (bottom) learned on the benchmark dataset data otherwise. [sent-126, score-0.492]

66 Regularization parameters where chosen via a five fold cross validation procedure (details can be found in [8]). [sent-127, score-0.042]

67 The resulting components for this dataset are shown in Figure 2. [sent-128, score-0.038]

68 Benchmark performance was measured on the test set which had not been used during either training or cross validation. [sent-129, score-0.042]

69 The number of misclassified trials in the test set was 13 which places our method on a new first place given the results of the competition which can be found online http://ida. [sent-130, score-0.158]

70 The receiveroperator characteristic (ROC) curve for cross validation and for the independent testset are shown in Figure 3. [sent-136, score-0.09]

71 2 also shows the contribution of the linear and quadratic terms for every trial for the two types of key-presses. [sent-138, score-0.136]

72 4 Conclusion In this paper we have presented a framework for uncovering spatial as well as temporal features in EEG that combine the two predominant paradigms used in EEG analysis: event related potentials and oscillatory power. [sent-141, score-0.543]

73 These represent phase locked activity (where polarity of the activity matters), and non-phase locked activity (where only the power of the signal is relevant). [sent-142, score-1.171]

74 We used the probabilistic formalism of logistic regression that readily incorporates prior probabilities to regularize the increased number of parameters. [sent-143, score-0.119]

75 We have evaluated the proposed method on both simulated data, and a real BCI benchmark dataset, achieving state-of-the-art classification performance. [sent-144, score-0.174]

76 For example, different sets of basis functions (other than a Fourier basis) can be enforced on the temporal decomposition of the data through the matrix B (e. [sent-146, score-0.201]

77 8 1 Figure 3: ROC curve with area under the curve 0. [sent-177, score-0.096]

78 96 for the cross validation on the benchmark dataset (left). [sent-178, score-0.138]

79 93, on the independent test set, for the benchmark dataset. [sent-180, score-0.096]

80 There were a total of 13 errors on unseen data, which is less than any of the results previously reported, placing this method in first place in the benchmark ranking. [sent-181, score-0.096]

81 It is clear that the two types of features contain independent information that can help improve the classification performance. [sent-183, score-0.042]

82 Boosting bit rates and error detection for the classification of fast-paced motor commands based on single-trial eeg analysis. [sent-232, score-0.413]

83 Spatiotemporal linear decoding of brain state: Application to performance augmentation in high-throughput tasks. [sent-247, score-0.063]

84 Neural representation of task difficulty and decision making during perceptual categorization: A timing diagram. [sent-251, score-0.052]

85 Optimal spatial filtering of single trial EEG during u imagined hand movement. [sent-268, score-0.214]

86 Spatio-spectral filters for improving the classification of single trial eeg. [sent-285, score-0.094]

87 Combined optimization of spatial and temporal filters for improving brain-computer interfacing. [sent-296, score-0.24]

88 The bci competition 2003: progress and perspectives in detection and discrimination of eeg single trials. [sent-317, score-0.662]


similar papers computed by tfidf model

tfidf for this paper:

wordName wordTfidf (topN-words)

[('eeg', 0.413), ('bilinear', 0.225), ('locked', 0.204), ('channels', 0.19), ('erp', 0.161), ('discriminant', 0.155), ('activity', 0.154), ('signal', 0.141), ('xn', 0.132), ('bci', 0.128), ('blankertz', 0.128), ('spatial', 0.12), ('temporal', 0.12), ('ur', 0.112), ('curio', 0.112), ('lucas', 0.112), ('wo', 0.102), ('yn', 0.098), ('christoforos', 0.096), ('christoforou', 0.096), ('benchmark', 0.096), ('evoked', 0.095), ('parra', 0.095), ('trial', 0.094), ('fourier', 0.092), ('xb', 0.09), ('oscillatory', 0.09), ('amplitude', 0.086), ('ar', 0.082), ('competition', 0.082), ('logistic', 0.08), ('simulated', 0.078), ('trials', 0.076), ('component', 0.076), ('vr', 0.072), ('captures', 0.071), ('trace', 0.07), ('columns', 0.066), ('lter', 0.065), ('dyrholm', 0.064), ('mads', 0.064), ('marios', 0.064), ('philiastides', 0.064), ('toeplitz', 0.064), ('event', 0.064), ('brain', 0.063), ('classi', 0.062), ('ut', 0.059), ('potentials', 0.059), ('avenue', 0.059), ('biomedical', 0.057), ('epochs', 0.057), ('losch', 0.056), ('ryota', 0.056), ('sajda', 0.056), ('tomioka', 0.056), ('kazuyuki', 0.056), ('polarity', 0.056), ('rt', 0.056), ('paul', 0.055), ('phase', 0.053), ('ak', 0.053), ('frequency', 0.053), ('timing', 0.052), ('ltered', 0.052), ('gerson', 0.051), ('birbaumer', 0.051), ('dornhege', 0.051), ('pfurtscheller', 0.051), ('wolpaw', 0.051), ('sinusoid', 0.051), ('power', 0.051), ('extract', 0.051), ('rd', 0.05), ('term', 0.05), ('city', 0.049), ('stimulus', 0.049), ('curve', 0.048), ('hinterberger', 0.048), ('paradigms', 0.048), ('simulates', 0.048), ('bt', 0.047), ('xv', 0.045), ('lr', 0.045), ('roc', 0.045), ('engineering', 0.044), ('hoc', 0.043), ('log', 0.043), ('quadratic', 0.042), ('features', 0.042), ('cross', 0.042), ('bands', 0.041), ('enforced', 0.041), ('matrix', 0.04), ('discrimination', 0.039), ('adam', 0.039), ('formalism', 0.039), ('across', 0.039), ('components', 0.038), ('df', 0.038)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 1.000001 173 nips-2007-Second Order Bilinear Discriminant Analysis for single trial EEG analysis

Author: Christoforos Christoforou, Paul Sajda, Lucas C. Parra

Abstract: Traditional analysis methods for single-trial classification of electroencephalography (EEG) focus on two types of paradigms: phase locked methods, in which the amplitude of the signal is used as the feature for classification, e.g. event related potentials; and second order methods, in which the feature of interest is the power of the signal, e.g. event related (de)synchronization. The procedure for deciding which paradigm to use is ad hoc and is typically driven by knowledge of the underlying neurophysiology. Here we propose a principled method, based on a bilinear model, in which the algorithm simultaneously learns the best first and second order spatial and temporal features for classification of EEG. The method is demonstrated on simulated data as well as on EEG taken from a benchmark data used to test classification algorithms for brain computer interfaces. 1 1.1

2 0.30144376 106 nips-2007-Invariant Common Spatial Patterns: Alleviating Nonstationarities in Brain-Computer Interfacing

Author: Benjamin Blankertz, Motoaki Kawanabe, Ryota Tomioka, Friederike Hohlefeld, Klaus-Robert Müller, Vadim V. Nikulin

Abstract: Brain-Computer Interfaces can suffer from a large variance of the subject conditions within and across sessions. For example vigilance fluctuations in the individual, variable task involvement, workload etc. alter the characteristics of EEG signals and thus challenge a stable BCI operation. In the present work we aim to define features based on a variant of the common spatial patterns (CSP) algorithm that are constructed invariant with respect to such nonstationarities. We enforce invariance properties by adding terms to the denominator of a Rayleigh coefficient representation of CSP such as disturbance covariance matrices from fluctuations in visual processing. In this manner physiological prior knowledge can be used to shape the classification engine for BCI. As a proof of concept we present a BCI classifier that is robust to changes in the level of parietal α -activity. In other words, the EEG decoding still works when there are lapses in vigilance.

3 0.21867406 74 nips-2007-EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection

Author: Pierre Ferrez, José Millán

Abstract: Brain-computer interfaces (BCIs), as any other interaction modality based on physiological signals and body channels (e.g., muscular activity, speech and gestures), are prone to errors in the recognition of subject’s intent. An elegant approach to improve the accuracy of BCIs consists in a verification procedure directly based on the presence of error-related potentials (ErrP) in the EEG recorded right after the occurrence of an error. Six healthy volunteer subjects with no prior BCI experience participated in a new human-robot interaction experiment where they were asked to mentally move a cursor towards a target that can be reached within a few steps using motor imagination. This experiment confirms the previously reported presence of a new kind of ErrP. These “Interaction ErrP” exhibit a first sharp negative peak followed by a positive peak and a second broader negative peak (∼290, ∼350 and ∼470 ms after the feedback, respectively). But in order to exploit these ErrP we need to detect them in each single trial using a short window following the feedback associated to the response of the classifier embedded in the BCI. We have achieved an average recognition rate of correct and erroneous single trials of 81.8% and 76.2%, respectively. Furthermore, we have achieved an average recognition rate of the subject’s intent while trying to mentally drive the cursor of 73.1%. These results show that it’s possible to simultaneously extract useful information for mental control to operate a brain-actuated device as well as cognitive states such as error potentials to improve the quality of the braincomputer interaction. Finally, using a well-known inverse model (sLORETA), we show that the main focus of activity at the occurrence of the ErrP are, as expected, in the pre-supplementary motor area and in the anterior cingulate cortex. 1

4 0.11112355 127 nips-2007-Measuring Neural Synchrony by Message Passing

Author: Justin Dauwels, François Vialatte, Tomasz Rutkowski, Andrzej S. Cichocki

Abstract: A novel approach to measure the interdependence of two time series is proposed, referred to as “stochastic event synchrony” (SES); it quantifies the alignment of two point processes by means of the following parameters: time delay, variance of the timing jitter, fraction of “spurious” events, and average similarity of events. SES may be applied to generic one-dimensional and multi-dimensional point processes, however, the paper mainly focusses on point processes in time-frequency domain. The average event similarity is in that case described by two parameters: the average frequency offset between events in the time-frequency plane, and the variance of the frequency offset (“frequency jitter”); SES then consists of five parameters in total. Those parameters quantify the synchrony of oscillatory events, and hence, they provide an alternative to existing synchrony measures that quantify amplitude or phase synchrony. The pairwise alignment of point processes is cast as a statistical inference problem, which is solved by applying the maxproduct algorithm on a graphical model. The SES parameters are determined from the resulting pairwise alignment by maximum a posteriori (MAP) estimation. The proposed interdependence measure is applied to the problem of detecting anomalies in EEG synchrony of Mild Cognitive Impairment (MCI) patients; the results indicate that SES significantly improves the sensitivity of EEG in detecting MCI.

5 0.09359847 103 nips-2007-Inferring Elapsed Time from Stochastic Neural Processes

Author: Misha Ahrens, Maneesh Sahani

Abstract: Many perceptual processes and neural computations, such as speech recognition, motor control and learning, depend on the ability to measure and mark the passage of time. However, the processes that make such temporal judgements possible are unknown. A number of different hypothetical mechanisms have been advanced, all of which depend on the known, temporally predictable evolution of a neural or psychological state, possibly through oscillations or the gradual decay of a memory trace. Alternatively, judgements of elapsed time might be based on observations of temporally structured, but stochastic processes. Such processes need not be specific to the sense of time; typical neural and sensory processes contain at least some statistical structure across a range of time scales. Here, we investigate the statistical properties of an estimator of elapsed time which is based on a simple family of stochastic process. 1

6 0.092440739 37 nips-2007-Blind channel identification for speech dereverberation using l1-norm sparse learning

7 0.08975286 77 nips-2007-Efficient Inference for Distributions on Permutations

8 0.080885321 44 nips-2007-Catching Up Faster in Bayesian Model Selection and Model Averaging

9 0.077878572 154 nips-2007-Predicting Brain States from fMRI Data: Incremental Functional Principal Component Regression

10 0.076894656 43 nips-2007-Catching Change-points with Lasso

11 0.076248623 212 nips-2007-Using Deep Belief Nets to Learn Covariance Kernels for Gaussian Processes

12 0.076208994 59 nips-2007-Continuous Time Particle Filtering for fMRI

13 0.07450752 164 nips-2007-Receptive Fields without Spike-Triggering

14 0.071940318 177 nips-2007-Simplified Rules and Theoretical Analysis for Information Bottleneck Optimization and PCA with Spiking Neurons

15 0.070918269 146 nips-2007-On higher-order perceptron algorithms

16 0.070449315 111 nips-2007-Learning Horizontal Connections in a Sparse Coding Model of Natural Images

17 0.069239132 202 nips-2007-The discriminant center-surround hypothesis for bottom-up saliency

18 0.068788938 17 nips-2007-A neural network implementing optimal state estimation based on dynamic spike train decoding

19 0.067238919 149 nips-2007-Optimal ROC Curve for a Combination of Classifiers

20 0.067154661 203 nips-2007-The rat as particle filter


similar papers computed by lsi model

lsi for this paper:

topicId topicWeight

[(0, -0.229), (1, 0.076), (2, 0.078), (3, 0.038), (4, 0.01), (5, 0.191), (6, 0.052), (7, 0.088), (8, -0.139), (9, -0.122), (10, 0.022), (11, -0.144), (12, -0.049), (13, -0.035), (14, 0.224), (15, -0.037), (16, 0.009), (17, -0.257), (18, -0.168), (19, -0.046), (20, -0.095), (21, -0.1), (22, 0.045), (23, 0.172), (24, 0.129), (25, -0.089), (26, 0.122), (27, -0.126), (28, 0.056), (29, 0.056), (30, 0.021), (31, 0.127), (32, 0.014), (33, -0.016), (34, -0.021), (35, -0.03), (36, 0.03), (37, -0.03), (38, 0.016), (39, -0.001), (40, -0.006), (41, -0.042), (42, 0.016), (43, 0.064), (44, 0.017), (45, 0.033), (46, 0.018), (47, 0.042), (48, 0.012), (49, 0.019)]

similar papers list:

simIndex simValue paperId paperTitle

same-paper 1 0.94415921 173 nips-2007-Second Order Bilinear Discriminant Analysis for single trial EEG analysis

Author: Christoforos Christoforou, Paul Sajda, Lucas C. Parra

Abstract: Traditional analysis methods for single-trial classification of electroencephalography (EEG) focus on two types of paradigms: phase locked methods, in which the amplitude of the signal is used as the feature for classification, e.g. event related potentials; and second order methods, in which the feature of interest is the power of the signal, e.g. event related (de)synchronization. The procedure for deciding which paradigm to use is ad hoc and is typically driven by knowledge of the underlying neurophysiology. Here we propose a principled method, based on a bilinear model, in which the algorithm simultaneously learns the best first and second order spatial and temporal features for classification of EEG. The method is demonstrated on simulated data as well as on EEG taken from a benchmark data used to test classification algorithms for brain computer interfaces. 1 1.1

2 0.91696924 106 nips-2007-Invariant Common Spatial Patterns: Alleviating Nonstationarities in Brain-Computer Interfacing

Author: Benjamin Blankertz, Motoaki Kawanabe, Ryota Tomioka, Friederike Hohlefeld, Klaus-Robert Müller, Vadim V. Nikulin

Abstract: Brain-Computer Interfaces can suffer from a large variance of the subject conditions within and across sessions. For example vigilance fluctuations in the individual, variable task involvement, workload etc. alter the characteristics of EEG signals and thus challenge a stable BCI operation. In the present work we aim to define features based on a variant of the common spatial patterns (CSP) algorithm that are constructed invariant with respect to such nonstationarities. We enforce invariance properties by adding terms to the denominator of a Rayleigh coefficient representation of CSP such as disturbance covariance matrices from fluctuations in visual processing. In this manner physiological prior knowledge can be used to shape the classification engine for BCI. As a proof of concept we present a BCI classifier that is robust to changes in the level of parietal α -activity. In other words, the EEG decoding still works when there are lapses in vigilance.

3 0.83923161 74 nips-2007-EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection

Author: Pierre Ferrez, José Millán

Abstract: Brain-computer interfaces (BCIs), as any other interaction modality based on physiological signals and body channels (e.g., muscular activity, speech and gestures), are prone to errors in the recognition of subject’s intent. An elegant approach to improve the accuracy of BCIs consists in a verification procedure directly based on the presence of error-related potentials (ErrP) in the EEG recorded right after the occurrence of an error. Six healthy volunteer subjects with no prior BCI experience participated in a new human-robot interaction experiment where they were asked to mentally move a cursor towards a target that can be reached within a few steps using motor imagination. This experiment confirms the previously reported presence of a new kind of ErrP. These “Interaction ErrP” exhibit a first sharp negative peak followed by a positive peak and a second broader negative peak (∼290, ∼350 and ∼470 ms after the feedback, respectively). But in order to exploit these ErrP we need to detect them in each single trial using a short window following the feedback associated to the response of the classifier embedded in the BCI. We have achieved an average recognition rate of correct and erroneous single trials of 81.8% and 76.2%, respectively. Furthermore, we have achieved an average recognition rate of the subject’s intent while trying to mentally drive the cursor of 73.1%. These results show that it’s possible to simultaneously extract useful information for mental control to operate a brain-actuated device as well as cognitive states such as error potentials to improve the quality of the braincomputer interaction. Finally, using a well-known inverse model (sLORETA), we show that the main focus of activity at the occurrence of the ErrP are, as expected, in the pre-supplementary motor area and in the anterior cingulate cortex. 1

4 0.53367877 37 nips-2007-Blind channel identification for speech dereverberation using l1-norm sparse learning

Author: Yuanqing Lin, Jingdong Chen, Youngmoo Kim, Daniel D. Lee

Abstract: Speech dereverberation remains an open problem after more than three decades of research. The most challenging step in speech dereverberation is blind channel identification (BCI). Although many BCI approaches have been developed, their performance is still far from satisfactory for practical applications. The main difficulty in BCI lies in finding an appropriate acoustic model, which not only can effectively resolve solution degeneracies due to the lack of knowledge of the source, but also robustly models real acoustic environments. This paper proposes a sparse acoustic room impulse response (RIR) model for BCI, that is, an acoustic RIR can be modeled by a sparse FIR filter. Under this model, we show how to formulate the BCI of a single-input multiple-output (SIMO) system into a l1 norm regularized least squares (LS) problem, which is convex and can be solved efficiently with guaranteed global convergence. The sparseness of solutions is controlled by l1 -norm regularization parameters. We propose a sparse learning scheme that infers the optimal l1 -norm regularization parameters directly from microphone observations under a Bayesian framework. Our results show that the proposed approach is effective and robust, and it yields source estimates in real acoustic environments with high fidelity to anechoic chamber measurements.

5 0.48827711 24 nips-2007-An Analysis of Inference with the Universum

Author: Olivier Chapelle, Alekh Agarwal, Fabian H. Sinz, Bernhard Schölkopf

Abstract: We study a pattern classification algorithm which has recently been proposed by Vapnik and coworkers. It builds on a new inductive principle which assumes that in addition to positive and negative data, a third class of data is available, termed the Universum. We assay the behavior of the algorithm by establishing links with Fisher discriminant analysis and oriented PCA, as well as with an SVM in a projected subspace (or, equivalently, with a data-dependent reduced kernel). We also provide experimental results. 1

6 0.41981655 127 nips-2007-Measuring Neural Synchrony by Message Passing

7 0.38083169 44 nips-2007-Catching Up Faster in Bayesian Model Selection and Model Averaging

8 0.38048893 43 nips-2007-Catching Change-points with Lasso

9 0.35397965 130 nips-2007-Modeling Natural Sounds with Modulation Cascade Processes

10 0.32478917 26 nips-2007-An online Hebbian learning rule that performs Independent Component Analysis

11 0.32005152 25 nips-2007-An in-silico Neural Model of Dynamic Routing through Neuronal Coherence

12 0.30633327 103 nips-2007-Inferring Elapsed Time from Stochastic Neural Processes

13 0.29849166 59 nips-2007-Continuous Time Particle Filtering for fMRI

14 0.29486686 149 nips-2007-Optimal ROC Curve for a Combination of Classifiers

15 0.29395545 77 nips-2007-Efficient Inference for Distributions on Permutations

16 0.29334912 57 nips-2007-Congruence between model and human attention reveals unique signatures of critical visual events

17 0.28728011 93 nips-2007-GRIFT: A graphical model for inferring visual classification features from human data

18 0.28649741 188 nips-2007-Subspace-Based Face Recognition in Analog VLSI

19 0.27653056 124 nips-2007-Managing Power Consumption and Performance of Computing Systems Using Reinforcement Learning

20 0.26900733 28 nips-2007-Augmented Functional Time Series Representation and Forecasting with Gaussian Processes


similar papers computed by lda model

lda for this paper:

topicId topicWeight

[(4, 0.015), (5, 0.046), (13, 0.018), (16, 0.032), (18, 0.024), (19, 0.461), (21, 0.06), (34, 0.019), (35, 0.026), (47, 0.077), (83, 0.081), (85, 0.023), (87, 0.013), (90, 0.043)]

similar papers list:

simIndex simValue paperId paperTitle

1 0.86131924 106 nips-2007-Invariant Common Spatial Patterns: Alleviating Nonstationarities in Brain-Computer Interfacing

Author: Benjamin Blankertz, Motoaki Kawanabe, Ryota Tomioka, Friederike Hohlefeld, Klaus-Robert Müller, Vadim V. Nikulin

Abstract: Brain-Computer Interfaces can suffer from a large variance of the subject conditions within and across sessions. For example vigilance fluctuations in the individual, variable task involvement, workload etc. alter the characteristics of EEG signals and thus challenge a stable BCI operation. In the present work we aim to define features based on a variant of the common spatial patterns (CSP) algorithm that are constructed invariant with respect to such nonstationarities. We enforce invariance properties by adding terms to the denominator of a Rayleigh coefficient representation of CSP such as disturbance covariance matrices from fluctuations in visual processing. In this manner physiological prior knowledge can be used to shape the classification engine for BCI. As a proof of concept we present a BCI classifier that is robust to changes in the level of parietal α -activity. In other words, the EEG decoding still works when there are lapses in vigilance.

same-paper 2 0.83746594 173 nips-2007-Second Order Bilinear Discriminant Analysis for single trial EEG analysis

Author: Christoforos Christoforou, Paul Sajda, Lucas C. Parra

Abstract: Traditional analysis methods for single-trial classification of electroencephalography (EEG) focus on two types of paradigms: phase locked methods, in which the amplitude of the signal is used as the feature for classification, e.g. event related potentials; and second order methods, in which the feature of interest is the power of the signal, e.g. event related (de)synchronization. The procedure for deciding which paradigm to use is ad hoc and is typically driven by knowledge of the underlying neurophysiology. Here we propose a principled method, based on a bilinear model, in which the algorithm simultaneously learns the best first and second order spatial and temporal features for classification of EEG. The method is demonstrated on simulated data as well as on EEG taken from a benchmark data used to test classification algorithms for brain computer interfaces. 1 1.1

3 0.75005537 13 nips-2007-A Unified Near-Optimal Estimator For Dimension Reduction in $l \alpha$ ($0<\alpha\leq 2$) Using Stable Random Projections

Author: Ping Li, Trevor J. Hastie

Abstract: Many tasks (e.g., clustering) in machine learning only require the lα distances instead of the original data. For dimension reductions in the lα norm (0 < α ≤ 2), the method of stable random projections can efficiently compute the lα distances in massive datasets (e.g., the Web or massive data streams) in one pass of the data. The estimation task for stable random projections has been an interesting topic. We propose a simple estimator based on the fractional power of the samples (projected data), which is surprisingly near-optimal in terms of the asymptotic variance. In fact, it achieves the Cram´ r-Rao bound when α = 2 and α = 0+. This e new result will be useful when applying stable random projections to distancebased clustering, classifications, kernels, massive data streams etc.

4 0.54910332 74 nips-2007-EEG-Based Brain-Computer Interaction: Improved Accuracy by Automatic Single-Trial Error Detection

Author: Pierre Ferrez, José Millán

Abstract: Brain-computer interfaces (BCIs), as any other interaction modality based on physiological signals and body channels (e.g., muscular activity, speech and gestures), are prone to errors in the recognition of subject’s intent. An elegant approach to improve the accuracy of BCIs consists in a verification procedure directly based on the presence of error-related potentials (ErrP) in the EEG recorded right after the occurrence of an error. Six healthy volunteer subjects with no prior BCI experience participated in a new human-robot interaction experiment where they were asked to mentally move a cursor towards a target that can be reached within a few steps using motor imagination. This experiment confirms the previously reported presence of a new kind of ErrP. These “Interaction ErrP” exhibit a first sharp negative peak followed by a positive peak and a second broader negative peak (∼290, ∼350 and ∼470 ms after the feedback, respectively). But in order to exploit these ErrP we need to detect them in each single trial using a short window following the feedback associated to the response of the classifier embedded in the BCI. We have achieved an average recognition rate of correct and erroneous single trials of 81.8% and 76.2%, respectively. Furthermore, we have achieved an average recognition rate of the subject’s intent while trying to mentally drive the cursor of 73.1%. These results show that it’s possible to simultaneously extract useful information for mental control to operate a brain-actuated device as well as cognitive states such as error potentials to improve the quality of the braincomputer interaction. Finally, using a well-known inverse model (sLORETA), we show that the main focus of activity at the occurrence of the ErrP are, as expected, in the pre-supplementary motor area and in the anterior cingulate cortex. 1

5 0.4410378 37 nips-2007-Blind channel identification for speech dereverberation using l1-norm sparse learning

Author: Yuanqing Lin, Jingdong Chen, Youngmoo Kim, Daniel D. Lee

Abstract: Speech dereverberation remains an open problem after more than three decades of research. The most challenging step in speech dereverberation is blind channel identification (BCI). Although many BCI approaches have been developed, their performance is still far from satisfactory for practical applications. The main difficulty in BCI lies in finding an appropriate acoustic model, which not only can effectively resolve solution degeneracies due to the lack of knowledge of the source, but also robustly models real acoustic environments. This paper proposes a sparse acoustic room impulse response (RIR) model for BCI, that is, an acoustic RIR can be modeled by a sparse FIR filter. Under this model, we show how to formulate the BCI of a single-input multiple-output (SIMO) system into a l1 norm regularized least squares (LS) problem, which is convex and can be solved efficiently with guaranteed global convergence. The sparseness of solutions is controlled by l1 -norm regularization parameters. We propose a sparse learning scheme that infers the optimal l1 -norm regularization parameters directly from microphone observations under a Bayesian framework. Our results show that the proposed approach is effective and robust, and it yields source estimates in real acoustic environments with high fidelity to anechoic chamber measurements.

6 0.40776676 24 nips-2007-An Analysis of Inference with the Universum

7 0.37934646 127 nips-2007-Measuring Neural Synchrony by Message Passing

8 0.37099013 161 nips-2007-Random Projections for Manifold Learning

9 0.36534789 122 nips-2007-Locality and low-dimensions in the prediction of natural experience from fMRI

10 0.35865274 177 nips-2007-Simplified Rules and Theoretical Analysis for Information Bottleneck Optimization and PCA with Spiking Neurons

11 0.35844079 154 nips-2007-Predicting Brain States from fMRI Data: Incremental Functional Principal Component Regression

12 0.35688016 164 nips-2007-Receptive Fields without Spike-Triggering

13 0.35434884 202 nips-2007-The discriminant center-surround hypothesis for bottom-up saliency

14 0.35052606 25 nips-2007-An in-silico Neural Model of Dynamic Routing through Neuronal Coherence

15 0.34751585 179 nips-2007-SpAM: Sparse Additive Models

16 0.33996347 57 nips-2007-Congruence between model and human attention reveals unique signatures of critical visual events

17 0.33944613 36 nips-2007-Better than least squares: comparison of objective functions for estimating linear-nonlinear models

18 0.33925921 138 nips-2007-Near-Maximum Entropy Models for Binary Neural Representations of Natural Images

19 0.33486184 26 nips-2007-An online Hebbian learning rule that performs Independent Component Analysis

20 0.3345331 53 nips-2007-Compressed Regression