105 research outputs found

    Two-stage clustering in genotype-by-environment analyses with missing data

    Get PDF
    Cluster analysis has been commonly used in genotype-by-environment (G x E) analyses, but current methods are inadequate when the data matrix is incomplete. This paper proposes a new method, referred to as two-stage clustering, which relies on a partitioning of squared Euclidean distance into two independent components, the G x E interaction and the genotype main effect. These components are used in the first and second stages of clustering respectively. Two-stage clustering forms the basis for imputing missing values in the G x E matrix so that a more complete data array is available for other GxE analyses. Imputation for a given genotype uses information from genotypes with similar interaction profiles. This imputation method is shown to improve on an existing nearest cluster method that confounds the G x E interaction and the genotype main effect

    Latent cluster analysis of ALS phenotypes identifies prognostically differing groups

    Get PDF
    BACKGROUND Amyotrophic lateral sclerosis (ALS) is a degenerative disease predominantly affecting motor neurons and manifesting as several different phenotypes. Whether these phenotypes correspond to different underlying disease processes is unknown. We used latent cluster analysis to identify groupings of clinical variables in an objective and unbiased way to improve phenotyping for clinical and research purposes. METHODS Latent class cluster analysis was applied to a large database consisting of 1467 records of people with ALS, using discrete variables which can be readily determined at the first clinic appointment. The model was tested for clinical relevance by survival analysis of the phenotypic groupings using the Kaplan-Meier method. RESULTS The best model generated five distinct phenotypic classes that strongly predicted survival (p<0.0001). Eight variables were used for the latent class analysis, but a good estimate of the classification could be obtained using just two variables: site of first symptoms (bulbar or limb) and time from symptom onset to diagnosis (p<0.00001). CONCLUSION The five phenotypic classes identified using latent cluster analysis can predict prognosis. They could be used to stratify patients recruited into clinical trials and generating more homogeneous disease groups for genetic, proteomic and risk factor research

    RNAseq Analyses Identify Tumor Necrosis Factor-Mediated Inflammation as a Major Abnormality in ALS Spinal Cord

    Get PDF
    ALS is a rapidly progressive, devastating neurodegenerative illness of adults that produces disabling weakness and spasticity arising from death of lower and upper motor neurons. No meaningful therapies exist to slow ALS progression, and molecular insights into pathogenesis and progression are sorely needed. In that context, we used high-depth, next generation RNA sequencing (RNAseq, Illumina) to define gene network abnormalities in RNA samples depleted of rRNA and isolated from cervical spinal cord sections of 7 ALS and 8 CTL samples. We aligned \u3e50 million 2X150 bp paired-end sequences/sample to the hg19 human genome and applied three different algorithms (Cuffdiff2, DEseq2, EdgeR) for identification of differentially expressed genes (DEG’s). Ingenuity Pathways Analysis (IPA) and Weighted Gene Co-expression Network Analysis (WGCNA) identified inflammatory processes as significantly elevated in our ALS samples, with tumor necrosis factor (TNF) found to be a major pathway regulator (IPA) and TNFΞ±-induced protein 2 (TNFAIP2) as a major network β€œhub” gene (WGCNA). Using the oPOSSUM algorithm, we analyzed transcription factors (TF) controlling expression of the nine DEG/hub genes in the ALS samples and identified TF’s involved in inflammation (NFkB, REL, NFkB1) and macrophage function (NR1H2::RXRA heterodimer). Transient expression in human iPSC-derived motor neurons of TNFAIP2 (also a DEG identified by all three algorithms) reduced cell viability and induced caspase 3/7 activation. Using high-density RNAseq, multiple algorithms for DEG identification, and an unsupervised gene co-expression network approach, we identified significant elevation of inflammatory processes in ALS spinal cord with TNF as a major regulatory molecule. Overexpression of the DEG TNFAIP2 in human motor neurons, the population most vulnerable to die in ALS, increased cell death and caspase 3/7 activation. We propose that therapies targeted to reduce inflammatory TNFΞ± signaling may be helpful in ALS patients

    Amyotrophic Lateral Sclerosis Multiprotein Biomarkers in Peripheral Blood Mononuclear Cells

    Get PDF
    Amyotrophic lateral sclerosis (ALS) is a fatal progressive motor neuron disease, for which there are still no diagnostic/prognostic test and therapy. Specific molecular biomarkers are urgently needed to facilitate clinical studies and speed up the development of effective treatments.We used a two-dimensional difference in gel electrophoresis approach to identify in easily accessible clinical samples, peripheral blood mononuclear cells (PBMC), a panel of protein biomarkers that are closely associated with ALS. Validations and a longitudinal study were performed by immunoassays on a selected number of proteins. The same proteins were also measured in PBMC and spinal cord of a G93A SOD1 transgenic rat model. We identified combinations of protein biomarkers that can distinguish, with high discriminatory power, ALS patients from healthy controls (98%), and from patients with neurological disorders that may resemble ALS (91%), between two levels of disease severity (90%), and a number of translational biomarkers, that link responses between human and animal model. We demonstrated that TDP-43, cyclophilin A and ERp57 associate with disease progression in a longitudinal study. Moreover, the protein profile changes detected in peripheral blood mononuclear cells of ALS patients are suggestive of possible intracellular pathogenic mechanisms such as endoplasmic reticulum stress, nitrative stress, disturbances in redox regulation and RNA processing.Our results indicate that PBMC multiprotein biomarkers could contribute to determine amyotrophic lateral sclerosis diagnosis, differential diagnosis, disease severity and progression, and may help to elucidate pathogenic mechanisms

    Plasma Neurofilament Heavy Chain Levels Correlate to Markers of Late Stage Disease Progression and Treatment Response in SOD1(G93A) Mice that Model ALS

    Get PDF
    Background: Amyotrophic lateral sclerosis (ALS) is an incurable neurodegenerative disorder characterised by progressive degeneration of motor neurons leading to death, typically within 3–5 years of symptom onset. The diagnosis of ALS is largely reliant on clinical assessment and electrophysiological findings. Neither specific investigative tools nor reliable biomarkers are currently available to enable an early diagnosis or monitoring of disease progression, hindering the design of treatment trials. Methodology/Principal Findings: In this study, using the well-established SOD1G93A mouse model of ALS and a new in-house ELISA method, we have validated that plasma neurofilament heavy chain protein (NfH) levels correlate with both functional markers of late stage disease progression and treatment response. We detected a significant increase in plasma levels of phosphorylated NfH during disease progression in SOD1G93A mice from 105 days onwards. Moreover, increased plasma NfH levels correlated with the decline in muscle force, motor unit survival and, more significantly, with the loss of spinal motor neurons in SOD1 mice during this critical period of decline. Importantly, mice treated with the disease modifying compound arimoclomol had lower plasma NfH levels, suggesting plasma NfH levels could be validated as an outcome measure for treatment trials. Conclusions/Significance: These results show that plasma NfH levels closely reflect later stages of disease progression and therapeutic response in the SOD1G93A mouse model of ALS and may potentially be a valuable biomarker of later disease progression in ALS

    Error Rate Estimation On the Basis of Posterior Probabilities

    No full text
    The so-called posterior probability estimator, e, formed by averaging the minimum of the posterior probabilities over a set of initial or additional observations (which need not be classified) is considered in the context of estimating the overall actual error rate for the linear discriminant function appropriate for two multivariate normal populations with a common covariance matrix. The bias of e is examined by deriving asymptotic approximations under three different models, the normal, logistic, and mixture models. The properties of e are investigated further by a series of simulation experiments for the logistic and mixture models for which there are few other available estimators

    A Comparison of the Mixture and Classification Approaches to Cluster-Analysis

    No full text
    This paper examines the relative performance of two commonly used clustering methods based on maximum likelihood in the context of classifying a sample of observations of unknown origin arising from two normal populations with a common covariance matrix. The associated properties of the two methods are compared by conducting a series of simulation experiments under both mixture and separate sampling schemes
    • …
    corecore