nips nips2012 nips2012-276 knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Jonathan Huang, Daniel Alexander
Abstract: Accurate and detailed models of neurodegenerative disease progression are crucially important for reliable early diagnosis and the determination of effective treatments. We introduce the ALPACA (Alzheimer’s disease Probabilistic Cascades) model, a generative model linking latent Alzheimer’s progression dynamics to observable biomarker data. In contrast with previous works which model disease progression as a fixed event ordering, we explicitly model the variability over such orderings among patients which is more realistic, particularly for highly detailed progression models. We describe efficient learning algorithms for ALPACA and discuss promising experimental results on a real cohort of Alzheimer’s patients from the Alzheimer’s Disease Neuroimaging Initiative. 1
Reference: text
sentIndex sentText sentNum sentScore
1 edu Abstract Accurate and detailed models of neurodegenerative disease progression are crucially important for reliable early diagnosis and the determination of effective treatments. [sent-6, score-0.539]
2 We introduce the ALPACA (Alzheimer’s disease Probabilistic Cascades) model, a generative model linking latent Alzheimer’s progression dynamics to observable biomarker data. [sent-7, score-0.446]
3 In contrast with previous works which model disease progression as a fixed event ordering, we explicitly model the variability over such orderings among patients which is more realistic, particularly for highly detailed progression models. [sent-8, score-0.956]
4 1 Introduction Models of disease progression are among the core tools of modern medicine for early disease diagnosis, treatment determination and for explaining symptoms to patients. [sent-10, score-0.698]
5 Despite their utility, traditional models of disease progression [3, 17] have largely been limited to coarse symptomatic staging which divides patients into a small number of groups by thresholding a crude clinical score of how far the disease has progressed. [sent-14, score-1.036]
6 The models are thus only as precise as these crude clinical scores — although providing insight into disease mechanisms, they provide little benefit for early diagnosis or accurate patient staging. [sent-15, score-0.704]
7 The recent availability of cross sectional datasets such as the Alzheimer’s Disease Neuroimaging Initiative data has generated intense speculation in the neurology community about the nature of the cascade of events in AD and the ordering in which biomarkers show abnormality. [sent-17, score-0.57]
8 Beckett [2] was the first, nearly two decades ago, to propose a data driven model of disease progression using a distribution over orderings of clinical events. [sent-20, score-0.565]
9 Examples of events in the model of [8] include (but are not limited to) clinical events (such as a transition from Presymptomatic Alzheimer’s to Mild Cognitive Impairment) and the onset of atrophy (reduction of tissue volume). [sent-24, score-0.597]
10 By assuming a single universal ordering of events within the disease progression, the method of [8] is able to scale to much larger collections of events, thus achieving much more detailed characterizations of disease progression compared to that of [2]. [sent-25, score-1.16]
11 1 The assumption made in [8] of a universal ordering common to all patients within a disease cohort, is a major oversimplification of reality, however, where the event ordering can vary considerably among patients even if it is consistent enough to distinguish different diseases. [sent-26, score-1.173]
12 In practice, the assumption of a universal ordering within the model means we cannot recover the diversity of orderings over population groups and can make fitting the model to patient data unstable. [sent-27, score-0.631]
13 Additionally, like [8], our method can achieve the scalability that is required to produced fine-grained disease progression models. [sent-31, score-0.424]
14 • We propose the Alzheimer’s disease Probabilistic Cascades (ALPACA) model, a probabilistic model of disease cascades, allowing for patients to have distinct event orderings. [sent-33, score-0.857]
15 • We develop efficient probabilistic inference and learning algorithms for ALPACA, including a novel patient “staging” method, which predicts a patient’s full trajectory through clinical and atrophy events from sparse and noisy measurement data. [sent-34, score-0.8]
16 2 Preliminaries: Snapshots of neurodegenerative disease cascades We model a neurodegenerative disease cascade as an ordering of a discrete set of N events, {e1 , . [sent-36, score-0.991]
17 These events represent changes in patient state, such as a sufficiently low score on a memory test for a clinical diagnosis of AD, or the first measurement of tissue pathology, such as significant atrophy in the hippocampus (memory related brain area). [sent-40, score-0.844]
18 An ordering over events is represented as a permutation σ which corresponds events to the positions within the ordering at which they occur. [sent-41, score-0.882]
19 In practice, the ordering σ for a particular patient can only be observed indirectly via snapshots which probe at a particular point in time whether each event has occurred or not. [sent-46, score-0.953]
20 We denote a snapshot by a vector of N measurements z = (ze1 , . [sent-47, score-0.49]
21 , zen ), where each zei is a real valued measurement reflecting a noisy diagnosis as to whether event i of the disease progression has occurred prior to measuring z. [sent-50, score-0.825]
22 1 Were it not for noise within the measurement process, a single snapshot z would partition the event set into two disjoint subsets: events that have occurred already (e. [sent-51, score-1.035]
23 Where prior models [8] considered data in which a patient is only associated with a single snapshot (taken at a single time point), we allow for multiple snapshots of a patient to be taken spaced throughout that patient’s disease cascade. [sent-62, score-1.511]
24 For example, if σ = e3 |e1 |e4 |e5 |e6 |e2 , then k = 2 snapshots might partition the event ordering into sets X1 = {e1 , e3 }, X2 = {e4 , e5 }, X3 = {e2 , e6 }, reflecting that events in X1 occur before events in X2 , which occur before events in X3 . [sent-64, score-1.278]
25 Such partitions can also be thought of as partial rankings over the events (and indeed, we will exploit recent methods for learning with partial rankings in our own approach, [11]). [sent-65, score-0.4]
26 To denote partial rankings, we again use vertical bars, separating the events that occur between snapshots. [sent-66, score-0.307]
27 Instead of reasoning with continuous snapshot times, we use the fact that many distinct snapshot times can result in the same partial ranking, to reason instead with discrete snapshot sets. [sent-70, score-1.446]
28 By snapshot set, we refer to the collection of positions in the full event ordering just before each snapshot is taken. [sent-71, score-1.337]
29 In our running example, the snapshot set is τ = {2, 4}. [sent-72, score-0.464]
30 Given a full ordering σ, the partial ranking which arises from snapshot data (assuming no noise) is fully determined by τ . [sent-73, score-0.842]
31 3 ALPACA: the Alzheimer’s disease Probabilistic Cascades model We now present ALPACA, a generative model of noisy snapshots in which the event ordering for each patient is a latent variable. [sent-76, score-1.142]
32 ALPACA makes two main assumptions: (1), that the measured outcomes for each patient are independent of each other and (2), conditioned on the event ordering of each 1 For notational simplicity, we assume that measurements corresponding to each event are scalar valued. [sent-77, score-0.957]
33 2 patient and the time at which a snapshot is taken, the measurements for each event are independent. [sent-79, score-0.99]
34 In contrast with [8], we do not assume that multiple snapshot vectors for the same patient are independent of each other. [sent-80, score-0.778]
35 Draw an ordering of the events σ (j) from a Mallows distribution P (σ; σ0 , λ) over orderings. [sent-86, score-0.441]
36 Draw a snapshot set τ (j) from a uniform distribution P (τ ) over subsets of the event set. [sent-88, score-0.65]
37 For each element of the snapshot set, τi = τ1 , . [sent-90, score-0.464]
38 , if event e has occurred prior to time τi ), draw zi,e ∼ (j) N (µoccurred , coccurred ). [sent-98, score-0.332]
39 e e e e (j) In the above basic model, each entry of a snapshot vector, zi,e , is generated by sampling from a univariate measurement model (assumed in this case to be Gaussian). [sent-100, score-0.574]
40 If event e has already occurred, (j) (j) the observation zi,e is sampled from the distribution N (µoccurred , coccurred ) — otherwise zi,e is e e sampled from a measurement distribution estimated from a control population of healthy individuals, N (µhealthy , chealthy ). [sent-101, score-0.395]
41 For notational simplicity, we denote the collection of snapshots for patient e e (j) (j) j by z·,· = {zi,e }i=1,. [sent-102, score-0.459]
42 The Kendall’s tau distance penalizes the number of inversions, or pairs of events for which σ and σ0 disagree over relative ordering. [sent-112, score-0.306]
43 Our algorithms are thus applicable for more general classes of distributions over orderings as well as snapshot sets. [sent-117, score-0.537]
44 With respect to the event-based characterization of disease progression, a critical problem is that of patient staging, the problem of determining the extent to which a disease has progressed for a particular patient given corresponding measurement data. [sent-119, score-1.258]
45 ALPACA offers a simple and natural formulation of the patient staging problem as a probabilistic inference query. [sent-120, score-0.465]
46 In particular, given the measurements corresponding to a particular patient, we perform patient staging by: (1) computing a posterior distribution over the event ordering σ (j) , then (2) computing a posterior distribution over the most recent element of the snapshot set τ (j) . [sent-121, score-1.423]
47 To visualize the posterior distribution over the event ordering σ (j) , we plot a simple “first-order staging diagram”, displaying the probability that event e has occurred (or will occur) in position q according to the posterior. [sent-122, score-0.87]
48 Two major features differentiate ALPACA from traditional patient staging approaches, in which patients are binned into a small number of imprecisely defined stages. [sent-123, score-0.563]
49 In particular, our method is more fine-grained, allowing for a detailed picture of what the patient has undergone as well as a prediction of what is to come next. [sent-124, score-0.314]
50 Given a collection of K (j) snapshots for a patient j, the critical inference problem that we must solve is that of computing a posterior distribution over the latent event order and snapshot set for that patient. [sent-129, score-1.176]
51 One reason is that it is not obvious how to tractably sample the event ordering σ conditioned on its Markov blanket, given that the corresponding likelihood function is not conjugate prior to the Mallows model. [sent-134, score-0.431]
52 This augmented model is equivalent to the original model, but has the advantage that it reduces the sampling step for the event ordering σ to a well understood problem (described below). [sent-138, score-0.437]
53 1) Observe that since the snapshot set τ is fully determined by the partial ranking γ, it is not necessary to condition on τ in Equation 4. [sent-141, score-0.619]
54 1 (right), since γ is fully determined given both the event ordering σ and the snapshot set τ , one can sample τ first, and deterministically reconstruct γ. [sent-144, score-0.873]
55 We now turn to the problem of sampling a snapshot set τ (j) of size K (j) from Equation 4. [sent-157, score-0.492]
56 In the following, we present a dynamic programming algorithm for sampling snapshot sets with running time much lower than the exhaustive setting (even for small K (j) ). [sent-161, score-0.492]
57 Our core insight is to exploit conditional independence relations within the posterior distribution over snapshot sets. [sent-162, score-0.506]
58 As we show in the following, however, we can bijectively associate each snapshot set with a trajectory through a certain grid. [sent-164, score-0.464]
59 With respect to this grid-based representation of snapshot sets, we then show that the posterior distribution can be viewed as that of a particular hidden Markov model (HMM). [sent-165, score-0.506]
60 , given a snapshot set τ , there is a unique staircase walk pτ mapping to τ ). [sent-186, score-0.669]
61 (b): Grid structured state space G for sampling snapshot sets with edges labeled with transition probabilities according to Equation 4. [sent-188, score-0.492]
62 The example path (highlighted) is p = ((2, 3), (2, 2), (1, 2), (1, 1), (0, 1), (0, 0)), corresponding to the snapshot set τ = {4, 2}. [sent-191, score-0.464]
63 Conditioned on σ = σ (j) , the posterior probability P (τ = τ (j) | σ = σ (j) , z·,· ) is equal to the posterior probability of the staircase walk pτ (j) under the hidden Markov model defined by Equations 4. [sent-206, score-0.289]
64 To sample a snapshot set from the conditional distribution in Equation 4. [sent-209, score-0.464]
65 2, we therefore sample staircase walks from the above HMM and convert the resulting samples to snapshot sets. [sent-210, score-0.673]
66 We claim that the complexity of sampling a snapshot set is also O(N 2 ). [sent-214, score-0.492]
67 The one exception is when the size of the snapshot set is one less than the number of events (K (j) = N − 1). [sent-225, score-0.682]
68 2 Note that to have so many snapshots for a single patient would be rare indeed. [sent-230, score-0.459]
69 5 Even when K (j) < N − 1, mixing times for the chain can be longer for larger snapshot sets (where K (j) is close to N − 1). [sent-231, score-0.503]
70 For example, when K (j) = N − 2, it is possible to show that the T th ordering in the Gibbs chain can differ from the (T + 1)th ordering by at most an adjacent swap. [sent-232, score-0.446]
71 For each patient in the cohort, use the inference algorithm described in Section 4. [sent-244, score-0.339]
72 Note that the sampled snapshot sets ({τ (j) }) do not play a role in the M-step described here, but can be used to estimate parameters for the more complex snapshot set distributions described in Section 5. [sent-252, score-0.928]
73 In clinical datasets, it is more conceivable that different biomarkers within a disease cascade change over different timescales, thus leading to higher positional variance for certain events and lower positional variance for others. [sent-259, score-0.756]
74 ([11]), in particular, proved that these hierarchical riffle independent models form a natural conjugate prior family for partial ranking likelihood functions and introduced efficient algorithms for conditioning on partial ranking observations. [sent-264, score-0.31]
75 It is finally interesting to note that it would not be trivial to use traditional Markov chains to capture the dependencies in the event sequence due to the fact that observations come in snapshot form instead of being indexed by time as they would be in an ordinary hidden Markov model. [sent-265, score-0.65]
76 (Left) first order staging diagram, the (e, q)th entry is the probability that event e has/will occur in position q. [sent-287, score-0.369]
77 (Right) posterior probability distribution over the position in the event ordering at which the patient snapshot was taken. [sent-288, score-1.251]
78 Setting 0 ≤ α < 1/2, however, reflects a prior bias for snapshots to have been taken earlier in the disease cascade, while setting 1/2 < α ≤ 1 reflects a prior bias for snapshots to have been taken later in the disease cascade. [sent-291, score-0.838]
79 Since we are interested in the ability of the model to recover the true central ranking, we evaluate based on the Kendall’s tau distance between the ground truth central ranking and the central rankings learned by our algorithms. [sent-294, score-0.331]
80 2(a) illustrates the results on a problem with N = 10 events and 250 patients (with K (j) set to be 1, 2, or 3 randomly for each patient) as cM AX varies between [0. [sent-297, score-0.341]
81 2(b) shows, as expected, that recovery rates for the central ordering improve as the number of patients increases. [sent-307, score-0.381]
82 2(c) shows example Gibbs trace plots with N = 20 events and varying sizes of the snapshot set, K (j) . [sent-312, score-0.682]
83 [8] (which assumes that all patients follow a single ordering σ ∗ ) by searching exhaustively over the collection of all 7! [sent-320, score-0.346]
84 We use a single Gaussian for each of the healthy and occurred measurement distributions (as described in [8]), assuming that all patients in the control group are healthy. [sent-330, score-0.341]
85 3 The results show that by allowing for the event ordering σ to vary across patients, the ALPACA model significantly outperforms the single ordering model (shown in the σ ∗ column) in BIC score with respect to all of the tried settings of α. [sent-331, score-0.653]
86 The optimal central ordering inferred by the Fonteijn model is: σ ∗ = adas|hippovol|hippoatrophy|brainatrophy|abeta|tau|brainvol, while ALPACA infers the central ordering: σ0 = adas|hippovol|abeta|hippoatrophy|tau|brainatrophy|brainvol. [sent-334, score-0.293]
87 Observe that the two event orderings are largely in agreement with each other with CSF Aβ42 and CSF tau events shifted to being earlier in the event ordering, which is more consistent with current thinking in neurology [12, 5, 1], which places the two CSF events first. [sent-335, score-1.004]
88 It is surprising that the hippocampal volume and atrophy events are inferred in both models to occur before the CSF events [13], but we believe that this may be due to the significant proportion of misdiagnosed patients in the data. [sent-337, score-0.741]
89 2(e) shows the patient staging result for an example patient from the ADNI data. [sent-341, score-0.754]
90 The left matrix visualizes the probability that each event will occur in each position of the event ordering given snapshot data from this patient, while the right histogram visualizes where in the event ordering the patient was situated when the snapshot was taken. [sent-342, score-2.303]
91 7 Conclusions We have developed the Alzheimer’s disease Probabilistic Cascades model for event ordering within the Alzheimer’s disease cascade. [sent-343, score-0.957]
92 In its most basic form, ALPACA is a simple model with generative semantics, allowing one to learn the central ordering of events that occur within a disease progression as well as to quantify the variance of this ordering across patients. [sent-344, score-1.158]
93 Our preliminary results show that relaxing the notion that a single ordering over events exists for all patients allows ALPACA to achieve a much better fit to snapshot data from a cohort of Alzheimer’s patients. [sent-345, score-1.117]
94 One of our main contributions is to show how the combinatorial structure of event ordering models can be exploited for algorithmic efficiency. [sent-346, score-0.409]
95 There may exist biomarkers for Alzheimer’s which are more effective than those considered in our current work for the purposes of patient staging. [sent-348, score-0.381]
96 Identifying such biomarker events remains an open question crucial to the success of data-driven models of disease cascades. [sent-349, score-0.514]
97 We have discussed several such possible extensions, from more general measurement models to more general riffle independent ordering models. [sent-351, score-0.305]
98 The alzheimer’s disease neuroimaging initiative: progress report and future plans. [sent-376, score-0.318]
99 s disease biomarkers in the alzheimer’s disease neuroimaging initiative cohort. [sent-394, score-0.682]
100 An event-based model for disease progression and its application in familial alzheimer’s disease and huntington’s disease. [sent-415, score-0.723]
wordName wordTfidf (topN-words)
[('snapshot', 0.464), ('patient', 0.314), ('alpaca', 0.304), ('disease', 0.274), ('ordering', 0.223), ('alzheimer', 0.221), ('events', 0.218), ('event', 0.186), ('staircase', 0.164), ('progression', 0.15), ('snapshots', 0.145), ('staging', 0.126), ('mallows', 0.123), ('patients', 0.123), ('ranking', 0.101), ('atrophy', 0.093), ('cohort', 0.089), ('csf', 0.089), ('tau', 0.088), ('occurred', 0.085), ('measurement', 0.082), ('gibbs', 0.074), ('orderings', 0.073), ('clinical', 0.068), ('adni', 0.068), ('biomarkers', 0.067), ('neurodegenerative', 0.067), ('adas', 0.063), ('cascades', 0.059), ('partial', 0.054), ('healthy', 0.051), ('fonteijn', 0.051), ('positional', 0.051), ('rif', 0.051), ('diagnosis', 0.048), ('jonathan', 0.048), ('walks', 0.045), ('neuroimaging', 0.044), ('posterior', 0.042), ('walk', 0.041), ('hmm', 0.041), ('mixing', 0.039), ('grid', 0.039), ('kendall', 0.039), ('abeta', 0.038), ('brainatrophy', 0.038), ('chealthy', 0.038), ('clifford', 0.038), ('coccurred', 0.038), ('hippoatrophy', 0.038), ('hippovol', 0.038), ('sampler', 0.037), ('rankings', 0.037), ('central', 0.035), ('neurology', 0.035), ('occur', 0.035), ('pg', 0.034), ('aisen', 0.033), ('shaw', 0.033), ('ronald', 0.032), ('bic', 0.032), ('yt', 0.032), ('petersen', 0.031), ('leslie', 0.029), ('jack', 0.029), ('alzheimers', 0.029), ('hippocampal', 0.029), ('sampling', 0.028), ('cascade', 0.027), ('measurements', 0.026), ('huang', 0.025), ('brainvol', 0.025), ('familial', 0.025), ('jagust', 0.025), ('laurel', 0.025), ('misdiagnosed', 0.025), ('oversimpli', 0.025), ('rachael', 0.025), ('sarah', 0.025), ('scahill', 0.025), ('schott', 0.025), ('tgibbs', 0.025), ('trojanowski', 0.025), ('inference', 0.025), ('draw', 0.023), ('initiative', 0.023), ('xt', 0.023), ('conditioned', 0.022), ('neurological', 0.022), ('biomarker', 0.022), ('pathology', 0.022), ('position', 0.022), ('ecting', 0.022), ('john', 0.022), ('universal', 0.021), ('worst', 0.021), ('subjects', 0.021), ('michael', 0.021), ('score', 0.021), ('gracefully', 0.021)]
simIndex simValue paperId paperTitle
same-paper 1 1.000001 276 nips-2012-Probabilistic Event Cascades for Alzheimer's disease
Author: Jonathan Huang, Daniel Alexander
Abstract: Accurate and detailed models of neurodegenerative disease progression are crucially important for reliable early diagnosis and the determination of effective treatments. We introduce the ALPACA (Alzheimer’s disease Probabilistic Cascades) model, a generative model linking latent Alzheimer’s progression dynamics to observable biomarker data. In contrast with previous works which model disease progression as a fixed event ordering, we explicitly model the variability over such orderings among patients which is more realistic, particularly for highly detailed progression models. We describe efficient learning algorithms for ALPACA and discuss promising experimental results on a real cohort of Alzheimer’s patients from the Alzheimer’s Disease Neuroimaging Initiative. 1
2 0.22731018 266 nips-2012-Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task
Author: Jenna Wiens, Eric Horvitz, John V. Guttag
Abstract: A patient’s risk for adverse events is affected by temporal processes including the nature and timing of diagnostic and therapeutic activities, and the overall evolution of the patient’s pathophysiology over time. Yet many investigators ignore this temporal aspect when modeling patient outcomes, considering only the patient’s current or aggregate state. In this paper, we represent patient risk as a time series. In doing so, patient risk stratification becomes a time-series classification task. The task differs from most applications of time-series analysis, like speech processing, since the time series itself must first be extracted. Thus, we begin by defining and extracting approximate risk processes, the evolving approximate daily risk of a patient. Once obtained, we use these signals to explore different approaches to time-series classification with the goal of identifying high-risk patterns. We apply the classification to the specific task of identifying patients at risk of testing positive for hospital acquired Clostridium difficile. We achieve an area under the receiver operating characteristic curve of 0.79 on a held-out set of several hundred patients. Our two-stage approach to risk stratification outperforms classifiers that consider only a patient’s current state (p<0.05). 1
Author: Hua Wang, Feiping Nie, Heng Huang, Jingwen Yan, Sungeun Kim, Shannon Risacher, Andrew Saykin, Li Shen
Abstract: Alzheimer’s disease (AD) is a neurodegenerative disorder characterized by progressive impairment of memory and other cognitive functions. Regression analysis has been studied to relate neuroimaging measures to cognitive status. However, whether these measures have further predictive power to infer a trajectory of cognitive performance over time is still an under-explored but important topic in AD research. We propose a novel high-order multi-task learning model to address this issue. The proposed model explores the temporal correlations existing in imaging and cognitive data by structured sparsity-inducing norms. The sparsity of the model enables the selection of a small number of imaging measures while maintaining high prediction accuracy. The empirical studies, using the longitudinal imaging and cognitive data of the ADNI cohort, have yielded promising results.
4 0.12959923 53 nips-2012-Bayesian Pedigree Analysis using Measure Factorization
Author: Bonnie Kirkpatrick, Alexandre Bouchard-côté
Abstract: Pedigrees, or family trees, are directed graphs used to identify sites of the genome that are correlated with the presence or absence of a disease. With the advent of genotyping and sequencing technologies, there has been an explosion in the amount of data available, both in the number of individuals and in the number of sites. Some pedigrees number in the thousands of individuals. Meanwhile, analysis methods have remained limited to pedigrees of < 100 individuals which limits analyses to many small independent pedigrees. Disease models, such those used for the linkage analysis log-odds (LOD) estimator, have similarly been limited. This is because linkage analysis was originally designed with a different task in mind, that of ordering the sites in the genome, before there were technologies that could reveal the order. LODs are difficult to interpret and nontrivial to extend to consider interactions among sites. These developments and difficulties call for the creation of modern methods of pedigree analysis. Drawing from recent advances in graphical model inference and transducer theory, we introduce a simple yet powerful formalism for expressing genetic disease models. We show that these disease models can be turned into accurate and computationally efficient estimators. The technique we use for constructing the variational approximation has potential applications to inference in other large-scale graphical models. This method allows inference on larger pedigrees than previously analyzed in the literature, which improves disease site prediction. 1
5 0.10507084 273 nips-2012-Predicting Action Content On-Line and in Real Time before Action Onset – an Intracranial Human Study
Author: Uri Maoz, Shengxuan Ye, Ian Ross, Adam Mamelak, Christof Koch
Abstract: The ability to predict action content from neural signals in real time before the action occurs has been long sought in the neuroscientific study of decision-making, agency and volition. On-line real-time (ORT) prediction is important for understanding the relation between neural correlates of decision-making and conscious, voluntary action as well as for brain-machine interfaces. Here, epilepsy patients, implanted with intracranial depth microelectrodes or subdural grid electrodes for clinical purposes, participated in a “matching-pennies” game against an opponent. In each trial, subjects were given a 5 s countdown, after which they had to raise their left or right hand immediately as the “go” signal appeared on a computer screen. They won a fixed amount of money if they raised a different hand than their opponent and lost that amount otherwise. The question we here studied was the extent to which neural precursors of the subjects’ decisions can be detected in intracranial local field potentials (LFP) prior to the onset of the action. We found that combined low-frequency (0.1–5 Hz) LFP signals from 10 electrodes were predictive of the intended left-/right-hand movements before the onset of the go signal. Our ORT system predicted which hand the patient would raise 0.5 s before the go signal with 68±3% accuracy in two patients. Based on these results, we constructed an ORT system that tracked up to 30 electrodes simultaneously, and tested it on retrospective data from 7 patients. On average, we could predict the correct hand choice in 83% of the trials, which rose to 92% if we let the system drop 3/10 of the trials on which it was less confident. Our system demonstrates— for the first time—the feasibility of accurately predicting a binary action on single trials in real time for patients with intracranial recordings, well before the action occurs. 1 1
6 0.099608473 356 nips-2012-Unsupervised Structure Discovery for Semantic Analysis of Audio
7 0.096671395 169 nips-2012-Label Ranking with Partial Abstention based on Thresholded Probabilistic Models
8 0.088683881 182 nips-2012-Learning Networks of Heterogeneous Influence
9 0.087713473 363 nips-2012-Wavelet based multi-scale shape features on arbitrary surfaces for cortical thickness discrimination
10 0.083590977 284 nips-2012-Q-MKL: Matrix-induced Regularization in Multi-Kernel Learning with Applications to Neuroimaging
11 0.074872874 219 nips-2012-Modelling Reciprocating Relationships with Hawkes Processes
12 0.071542658 165 nips-2012-Iterative ranking from pair-wise comparisons
13 0.064872779 278 nips-2012-Probabilistic n-Choose-k Models for Classification and Ranking
14 0.063849851 205 nips-2012-MCMC for continuous-time discrete-state systems
15 0.059958443 323 nips-2012-Statistical Consistency of Ranking Methods in A Rank-Differentiable Probability Space
16 0.059189178 252 nips-2012-On Multilabel Classification and Ranking with Partial Feedback
17 0.058788404 286 nips-2012-Random Utility Theory for Social Choice
18 0.057587411 141 nips-2012-GenDeR: A Generic Diversified Ranking Algorithm
19 0.057221662 115 nips-2012-Efficient high dimensional maximum entropy modeling via symmetric partition functions
20 0.054826468 138 nips-2012-Fully Bayesian inference for neural models with negative-binomial spiking
topicId topicWeight
[(0, 0.132), (1, 0.009), (2, 0.008), (3, 0.057), (4, -0.048), (5, -0.056), (6, 0.001), (7, 0.057), (8, -0.024), (9, 0.079), (10, -0.043), (11, -0.006), (12, -0.014), (13, -0.04), (14, 0.012), (15, -0.081), (16, 0.012), (17, -0.011), (18, 0.012), (19, -0.113), (20, 0.074), (21, 0.067), (22, -0.199), (23, -0.067), (24, 0.11), (25, -0.186), (26, -0.012), (27, 0.13), (28, -0.098), (29, 0.089), (30, 0.122), (31, -0.117), (32, -0.064), (33, 0.069), (34, -0.045), (35, -0.059), (36, 0.114), (37, -0.095), (38, -0.019), (39, -0.049), (40, -0.075), (41, 0.152), (42, -0.096), (43, 0.058), (44, 0.193), (45, -0.055), (46, -0.077), (47, 0.007), (48, 0.055), (49, -0.104)]
simIndex simValue paperId paperTitle
same-paper 1 0.93230665 276 nips-2012-Probabilistic Event Cascades for Alzheimer's disease
Author: Jonathan Huang, Daniel Alexander
Abstract: Accurate and detailed models of neurodegenerative disease progression are crucially important for reliable early diagnosis and the determination of effective treatments. We introduce the ALPACA (Alzheimer’s disease Probabilistic Cascades) model, a generative model linking latent Alzheimer’s progression dynamics to observable biomarker data. In contrast with previous works which model disease progression as a fixed event ordering, we explicitly model the variability over such orderings among patients which is more realistic, particularly for highly detailed progression models. We describe efficient learning algorithms for ALPACA and discuss promising experimental results on a real cohort of Alzheimer’s patients from the Alzheimer’s Disease Neuroimaging Initiative. 1
2 0.64847147 53 nips-2012-Bayesian Pedigree Analysis using Measure Factorization
Author: Bonnie Kirkpatrick, Alexandre Bouchard-côté
Abstract: Pedigrees, or family trees, are directed graphs used to identify sites of the genome that are correlated with the presence or absence of a disease. With the advent of genotyping and sequencing technologies, there has been an explosion in the amount of data available, both in the number of individuals and in the number of sites. Some pedigrees number in the thousands of individuals. Meanwhile, analysis methods have remained limited to pedigrees of < 100 individuals which limits analyses to many small independent pedigrees. Disease models, such those used for the linkage analysis log-odds (LOD) estimator, have similarly been limited. This is because linkage analysis was originally designed with a different task in mind, that of ordering the sites in the genome, before there were technologies that could reveal the order. LODs are difficult to interpret and nontrivial to extend to consider interactions among sites. These developments and difficulties call for the creation of modern methods of pedigree analysis. Drawing from recent advances in graphical model inference and transducer theory, we introduce a simple yet powerful formalism for expressing genetic disease models. We show that these disease models can be turned into accurate and computationally efficient estimators. The technique we use for constructing the variational approximation has potential applications to inference in other large-scale graphical models. This method allows inference on larger pedigrees than previously analyzed in the literature, which improves disease site prediction. 1
3 0.61134905 266 nips-2012-Patient Risk Stratification for Hospital-Associated C. diff as a Time-Series Classification Task
Author: Jenna Wiens, Eric Horvitz, John V. Guttag
Abstract: A patient’s risk for adverse events is affected by temporal processes including the nature and timing of diagnostic and therapeutic activities, and the overall evolution of the patient’s pathophysiology over time. Yet many investigators ignore this temporal aspect when modeling patient outcomes, considering only the patient’s current or aggregate state. In this paper, we represent patient risk as a time series. In doing so, patient risk stratification becomes a time-series classification task. The task differs from most applications of time-series analysis, like speech processing, since the time series itself must first be extracted. Thus, we begin by defining and extracting approximate risk processes, the evolving approximate daily risk of a patient. Once obtained, we use these signals to explore different approaches to time-series classification with the goal of identifying high-risk patterns. We apply the classification to the specific task of identifying patients at risk of testing positive for hospital acquired Clostridium difficile. We achieve an area under the receiver operating characteristic curve of 0.79 on a held-out set of several hundred patients. Our two-stage approach to risk stratification outperforms classifiers that consider only a patient’s current state (p<0.05). 1
Author: Hua Wang, Feiping Nie, Heng Huang, Jingwen Yan, Sungeun Kim, Shannon Risacher, Andrew Saykin, Li Shen
Abstract: Alzheimer’s disease (AD) is a neurodegenerative disorder characterized by progressive impairment of memory and other cognitive functions. Regression analysis has been studied to relate neuroimaging measures to cognitive status. However, whether these measures have further predictive power to infer a trajectory of cognitive performance over time is still an under-explored but important topic in AD research. We propose a novel high-order multi-task learning model to address this issue. The proposed model explores the temporal correlations existing in imaging and cognitive data by structured sparsity-inducing norms. The sparsity of the model enables the selection of a small number of imaging measures while maintaining high prediction accuracy. The empirical studies, using the longitudinal imaging and cognitive data of the ADNI cohort, have yielded promising results.
5 0.49502882 363 nips-2012-Wavelet based multi-scale shape features on arbitrary surfaces for cortical thickness discrimination
Author: Won H. Kim, Deepti Pachauri, Charles Hatt, Moo. K. Chung, Sterling Johnson, Vikas Singh
Abstract: Hypothesis testing on signals defined on surfaces (such as the cortical surface) is a fundamental component of a variety of studies in Neuroscience. The goal here is to identify regions that exhibit changes as a function of the clinical condition under study. As the clinical questions of interest move towards identifying very early signs of diseases, the corresponding statistical differences at the group level invariably become weaker and increasingly hard to identify. Indeed, after a multiple comparisons correction is adopted (to account for correlated statistical tests over all surface points), very few regions may survive. In contrast to hypothesis tests on point-wise measurements, in this paper, we make the case for performing statistical analysis on multi-scale shape descriptors that characterize the local topological context of the signal around each surface vertex. Our descriptors are based on recent results from harmonic analysis, that show how wavelet theory extends to non-Euclidean settings (i.e., irregular weighted graphs). We provide strong evidence that these descriptors successfully pick up group-wise differences, where traditional methods either fail or yield unsatisfactory results. Other than this primary application, we show how the framework allows performing cortical surface smoothing in the native space without mappint to a unit sphere. 1
6 0.45940971 46 nips-2012-Assessing Blinding in Clinical Trials
7 0.44853488 219 nips-2012-Modelling Reciprocating Relationships with Hawkes Processes
8 0.41615447 356 nips-2012-Unsupervised Structure Discovery for Semantic Analysis of Audio
10 0.38041151 299 nips-2012-Scalable imputation of genetic data with a discrete fragmentation-coagulation process
11 0.37449431 182 nips-2012-Learning Networks of Heterogeneous Influence
12 0.34341162 169 nips-2012-Label Ranking with Partial Abstention based on Thresholded Probabilistic Models
13 0.34205946 286 nips-2012-Random Utility Theory for Social Choice
14 0.33976024 155 nips-2012-Human memory search as a random walk in a semantic network
15 0.29925585 137 nips-2012-From Deformations to Parts: Motion-based Segmentation of 3D Objects
16 0.29655594 157 nips-2012-Identification of Recurrent Patterns in the Activation of Brain Networks
17 0.28323203 205 nips-2012-MCMC for continuous-time discrete-state systems
18 0.28256723 115 nips-2012-Efficient high dimensional maximum entropy modeling via symmetric partition functions
19 0.27595615 72 nips-2012-Cocktail Party Processing via Structured Prediction
20 0.26752126 141 nips-2012-GenDeR: A Generic Diversified Ranking Algorithm
topicId topicWeight
[(0, 0.031), (11, 0.02), (17, 0.011), (21, 0.025), (38, 0.106), (39, 0.011), (42, 0.02), (54, 0.032), (55, 0.014), (74, 0.041), (76, 0.165), (80, 0.083), (92, 0.039), (94, 0.327)]
simIndex simValue paperId paperTitle
1 0.96290219 362 nips-2012-Waveform Driven Plasticity in BiFeO3 Memristive Devices: Model and Implementation
Author: Christian Mayr, Paul Stärke, Johannes Partzsch, Love Cederstroem, Rene Schüffny, Yao Shuai, Nan Du, Heidemarie Schmidt
Abstract: Memristive devices have recently been proposed as efficient implementations of plastic synapses in neuromorphic systems. The plasticity in these memristive devices, i.e. their resistance change, is defined by the applied waveforms. This behavior resembles biological synapses, whose plasticity is also triggered by mechanisms that are determined by local waveforms. However, learning in memristive devices has so far been approached mostly on a pragmatic technological level. The focus seems to be on finding any waveform that achieves spike-timing-dependent plasticity (STDP), without regard to the biological veracity of said waveforms or to further important forms of plasticity. Bridging this gap, we make use of a plasticity model driven by neuron waveforms that explains a large number of experimental observations and adapt it to the characteristics of the recently introduced BiFeO3 memristive material. Based on this approach, we show STDP for the first time for this material, with learning window replication superior to previous memristor-based STDP implementations. We also demonstrate in measurements that it is possible to overlay short and long term plasticity at a memristive device in the form of the well-known triplet plasticity. To the best of our knowledge, this is the first implementations of triplet plasticity on any physical memristive device. 1
same-paper 2 0.78553462 276 nips-2012-Probabilistic Event Cascades for Alzheimer's disease
Author: Jonathan Huang, Daniel Alexander
Abstract: Accurate and detailed models of neurodegenerative disease progression are crucially important for reliable early diagnosis and the determination of effective treatments. We introduce the ALPACA (Alzheimer’s disease Probabilistic Cascades) model, a generative model linking latent Alzheimer’s progression dynamics to observable biomarker data. In contrast with previous works which model disease progression as a fixed event ordering, we explicitly model the variability over such orderings among patients which is more realistic, particularly for highly detailed progression models. We describe efficient learning algorithms for ALPACA and discuss promising experimental results on a real cohort of Alzheimer’s patients from the Alzheimer’s Disease Neuroimaging Initiative. 1
3 0.71589231 79 nips-2012-Compressive neural representation of sparse, high-dimensional probabilities
Author: Xaq Pitkow
Abstract: This paper shows how sparse, high-dimensional probability distributions could be represented by neurons with exponential compression. The representation is a novel application of compressive sensing to sparse probability distributions rather than to the usual sparse signals. The compressive measurements correspond to expected values of nonlinear functions of the probabilistically distributed variables. When these expected values are estimated by sampling, the quality of the compressed representation is limited only by the quality of sampling. Since the compression preserves the geometric structure of the space of sparse probability distributions, probabilistic computation can be performed in the compressed domain. Interestingly, functions satisfying the requirements of compressive sensing can be implemented as simple perceptrons. If we use perceptrons as a simple model of feedforward computation by neurons, these results show that the mean activity of a relatively small number of neurons can accurately represent a highdimensional joint distribution implicitly, even without accounting for any noise correlations. This comprises a novel hypothesis for how neurons could encode probabilities in the brain. 1
4 0.69922972 357 nips-2012-Unsupervised Template Learning for Fine-Grained Object Recognition
Author: Shulin Yang, Liefeng Bo, Jue Wang, Linda G. Shapiro
Abstract: Fine-grained recognition refers to a subordinate level of recognition, such as recognizing different species of animals and plants. It differs from recognition of basic categories, such as humans, tables, and computers, in that there are global similarities in shape and structure shared cross different categories, and the differences are in the details of object parts. We suggest that the key to identifying the fine-grained differences lies in finding the right alignment of image regions that contain the same object parts. We propose a template model for the purpose, which captures common shape patterns of object parts, as well as the cooccurrence relation of the shape patterns. Once the image regions are aligned, extracted features are used for classification. Learning of the template model is efficient, and the recognition results we achieve significantly outperform the stateof-the-art algorithms. 1
5 0.67416567 68 nips-2012-Clustering Aggregation as Maximum-Weight Independent Set
Author: Nan Li, Longin J. Latecki
Abstract: We formulate clustering aggregation as a special instance of Maximum-Weight Independent Set (MWIS) problem. For a given dataset, an attributed graph is constructed from the union of the input clusterings generated by different underlying clustering algorithms with different parameters. The vertices, which represent the distinct clusters, are weighted by an internal index measuring both cohesion and separation. The edges connect the vertices whose corresponding clusters overlap. Intuitively, an optimal aggregated clustering can be obtained by selecting an optimal subset of non-overlapping clusters partitioning the dataset together. We formalize this intuition as the MWIS problem on the attributed graph, i.e., finding the heaviest subset of mutually non-adjacent vertices. This MWIS problem exhibits a special structure. Since the clusters of each input clustering form a partition of the dataset, the vertices corresponding to each clustering form a maximal independent set (MIS) in the attributed graph. We propose a variant of simulated annealing method that takes advantage of this special structure. Our algorithm starts from each MIS, which is close to a distinct local optimum of the MWIS problem, and utilizes a local search heuristic to explore its neighborhood in order to find the MWIS. Extensive experiments on many challenging datasets show that: 1. our approach to clustering aggregation automatically decides the optimal number of clusters; 2. it does not require any parameter tuning for the underlying clustering algorithms; 3. it can combine the advantages of different underlying clustering algorithms to achieve superior performance; 4. it is robust against moderate or even bad input clusterings. 1
6 0.63966542 364 nips-2012-Weighted Likelihood Policy Search with Model Selection
9 0.59112549 363 nips-2012-Wavelet based multi-scale shape features on arbitrary surfaces for cortical thickness discrimination
10 0.58307225 63 nips-2012-CPRL -- An Extension of Compressive Sensing to the Phase Retrieval Problem
11 0.56293768 318 nips-2012-Sparse Approximate Manifolds for Differential Geometric MCMC
12 0.5617187 209 nips-2012-Max-Margin Structured Output Regression for Spatio-Temporal Action Localization
13 0.56166381 203 nips-2012-Locating Changes in Highly Dependent Data with Unknown Number of Change Points
15 0.55967712 280 nips-2012-Proper losses for learning from partial labels
16 0.55965781 74 nips-2012-Collaborative Gaussian Processes for Preference Learning
17 0.55965322 188 nips-2012-Learning from Distributions via Support Measure Machines
18 0.55962288 277 nips-2012-Probabilistic Low-Rank Subspace Clustering
19 0.55928862 111 nips-2012-Efficient Sampling for Bipartite Matching Problems
20 0.55913824 353 nips-2012-Transferring Expectations in Model-based Reinforcement Learning