106 research outputs found

    Orthogonal rotation in PCAMIX

    Full text link
    Kiers (1991) considered the orthogonal rotation in PCAMIX, a principal component method for a mixture of qualitative and quantitative variables. PCAMIX includes the ordinary principal component analysis (PCA) and multiple correspondence analysis (MCA) as special cases. In this paper, we give a new presentation of PCAMIX where the principal components and the squared loadings are obtained from a Singular Value Decomposition. The loadings of the quantitative variables and the principal coordinates of the categories of the qualitative variables are also obtained directly. In this context, we propose a computationaly efficient procedure for varimax rotation in PCAMIX and a direct solution for the optimal angle of rotation. A simulation study shows the good computational behavior of the proposed algorithm. An application on a real data set illustrates the interest of using rotation in MCA. All source codes are available in the R package "PCAmixdata"

    Eigengene networks for studying the relationships between co-expression modules

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>There is evidence that genes and their protein products are organized into functional modules according to cellular processes and pathways. Gene co-expression networks have been used to describe the relationships between gene transcripts. Ample literature exists on how to detect biologically meaningful modules in networks but there is a need for methods that allow one to study the relationships between modules.</p> <p>Results</p> <p>We show that network methods can also be used to describe the relationships between co-expression modules and present the following methodology. First, we describe several methods for detecting modules that are shared by two or more networks (referred to as consensus modules). We represent the gene expression profiles of each module by an eigengene. Second, we propose a method for constructing an eigengene network, where the edges are undirected but maintain information on the sign of the co-expression information. Third, we propose methods for differential eigengene network analysis that allow one to assess the preservation of network properties across different data sets. We illustrate the value of eigengene networks in studying the relationships between consensus modules in human and chimpanzee brains; the relationships between consensus modules in brain, muscle, liver, and adipose mouse tissues; and the relationships between male-female mouse consensus modules and clinical traits. In some applications, we find that module eigengenes can be organized into higher level clusters which we refer to as meta-modules.</p> <p>Conclusion</p> <p>Eigengene networks can be effective and biologically meaningful tools for studying the relationships between modules of a gene co-expression network. The proposed methods may reveal a higher order organization of the transcriptome. R software tutorials, the data, and supplementary material can be found at the following webpage: <url>http://www.genetics.ucla.edu/labs/horvath/CoexpressionNetwork/EigengeneNetwork</url>.</p

    In silico approach to screen compounds active against parasitic nematodes of major socio-economic importance

    Get PDF
    Infections due to parasitic nematodes are common causes of morbidity and fatality around the world especially in developing nations. At present however, there are only three major classes of drugs for treating human nematode infections. Additionally the scientific knowledge on the mechanism of action and the reason for the resistance to these drugs is poorly understood. Commercial incentives to design drugs that are endemic to developing countries are limited therefore, virtual screening in academic settings can play a vital role is discovering novel drugs useful against neglected diseases. In this study we propose to build robust machine learning model to classify and screen compounds active against parasitic nematodes.A set of compounds active against parasitic nematodes were collated from various literature sources including PubChem while the inactive set was derived from DrugBank database. The support vector machine (SVM) algorithm was used for model development, and stratified ten-fold cross validation was used to evaluate the performance of each classifier. The best results were obtained using the radial basis function kernel. The SVM method achieved an accuracy of 81.79% on an independent test set. Using the model developed above, we were able to indentify novel compounds with potential anthelmintic activity.In this study, we successfully present the SVM approach for predicting compounds active against parasitic nematodes which suggests the effectiveness of computational approaches for antiparasitic drug discovery. Although, the accuracy obtained is lower than the previously reported in a similar study but we believe that our model is more robust because we intentionally employed stringent criteria to select inactive dataset thus making it difficult for the model to classify compounds. The method presents an alternative approach to the existing traditional methods and may be useful for predicting hitherto novel anthelmintic compounds.12 page(s

    The Self-Assessment Scale of Cognitive Complaints in Schizophrenia: A validation study in Tunisian population

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Despite a huge well-documented literature on cognitive deficits in schizophrenia, little is known about the own perception of patients regarding their cognitive functioning. The purpose of our study was to create a scale to collect subjective cognitive complaints of patients suffering from schizophrenia with Tunisian Arabic dialect as mother tongue and to proceed to a validation study of this scale.</p> <p>Methods</p> <p>The authors constructed the Self-Assessment Scale of Cognitive Complaints in Schizophrenia (SASCCS) based on a questionnaire covering five cognitive domains which are the most frequently reported in the literature to be impaired in schizophrenia. The scale consisted of 21 likert-type questions dealing with memory, attention, executive functions, language and praxia. In a second time, the authors proceeded to the study of psychometric qualities of the scale among 105 patients suffering from schizophrenia spectrum disorders (based on DSM- IV criteria). Patients were evaluated using the Positive and Negative Syndrome Scale (PANSS), the Global Assessment Functioning Scale (GAF scale) and the Calgary Depression Scale (CDS).</p> <p>Results</p> <p>The scale's reliability was proven to be good through Cronbach alpha coefficient equal to 0.85 and showing its good internal consistency. The intra-class correlation coefficient at 11 weeks was equal to 0.77 suggesting a good stability over time. Principal component analysis with Oblimin rotation was performed and yielded to six factors accounting for 58.28% of the total variance of the scale.</p> <p>Conclusion</p> <p>Given the good psychometric properties that have been revealed in this study, the SASCCS seems to be reliable to measure schizophrenic patients' perception of their own cognitive impairment. This kind of evaluation can't substitute for objective measures of cognitive performances in schizophrenia. The purpose of such an evaluation is to permit to the patient to express his own well-being and satisfaction of quality of life.</p

    The Glasgow Norms:Ratings of 5,500 words on nine scales

    Get PDF
    The Glasgow Norms are a set of normative ratings for 5,553 English words on nine psycholinguistic dimensions: arousal, valence, dominance, concreteness, imageability, familiarity, age of acquisition, semantic size, and gender association. The Glasgow Norms are unique in several respects. First, the corpus itself is relatively large, while simultaneously providing norms across a substantial number of lexical dimensions. Second, for any given subset of words, the same participants provided ratings across all nine dimensions (33 participants/word, on average). Third, two novel dimensions—semantic size and gender association—are included. Finally, the corpus contains a set of 379 ambiguous words that are presented either alone (e.g., toast) or with information that selects an alternative sense (e.g., toast (bread), toast (speech)). The relationships between the dimensions of the Glasgow Norms were initially investigated by assessing their correlations. In addition, a principal component analysis revealed four main factors, accounting for 82% of the variance (Visualization, Emotion, Salience, and Exposure). The validity of the Glasgow Norms was established via comparisons of our ratings to 18 different sets of current psycholinguistic norms. The dimension of size was tested with megastudy data, confirming findings from past studies that have explicitly examined this variable. Alternative senses of ambiguous words (i.e., disambiguated forms), when discordant on a given dimension, seemingly led to appropriately distinct ratings. Informal comparisons between the ratings of ambiguous words and of their alternative senses showed different patterns that likely depended on several factors (the number of senses, their relative strengths, and the rating scales themselves). Overall, the Glasgow Norms provide a valuable resource—in particular, for researchers investigating the role of word recognition in language comprehension

    Changing expression of vertebrate immunity genes in an anthropogenic environment: a controlled experiment

    Get PDF
    Background: The effect of anthropogenic environments on the function of the vertebrate immune system is a problem of general importance. For example, it relates to the increasing rates of immunologically-based disease in modern human populations and to the desirability of identifying optimal immune function in domesticated animals. Despite this importance, our present understanding is compromised by a deficit of experimental studies that make adequately matched comparisons between wild and captive vertebrates. Results: We transferred post-larval fishes (three-spined sticklebacks), collected in the wild, to an anthropogenic (captive) environment. We then monitored, over 11 months, how the systemic expression of immunity genes changed in comparison to cohort-matched wild individuals in the originator population (total n = 299). We found that a range of innate (lyz, defbl2, il1r-like, tbk1)and adaptive (cd8a, igmh) immunity genes were up-regulated in captivity, accompanied by an increase in expression of the antioxidant enzyme, gpx4a. For some genes previously known to show seasonality in the wild, this appeared to be reduced in captive fishes. Captive fishes tended to express immunity genes, including igzh, foxp3b, lyz, defbl2, and il1r-like, more variably. Furthermore, although gene co-expression patterns (analyzed through gene-by-gene correlations and mutual information theory based networks) shared common structure in wild and captive fishes, there was also significant divergence. For one gene in particular, defbl2, high expression was associated with adverse health outcomes in captive fishes. Conclusion: Taken together, these results demonstrate widespread regulatory changes in the immune system in captive populations, and that the expression of immunity genes is more constrained in the wild. An increase in constitutive systemic immune activity, such as we observed here, may alter the risk of immunopathology and contribute to variance in health in vertebrate populations exposed to anthropogenic environments

    Discovery of a Non-Peptidic Inhibitor of West Nile Virus NS3 Protease by High-Throughput Docking

    Get PDF
    An estimated 2.5 billion people are at risk of diseases caused by dengue and West Nile virus. As of today, there are neither vaccines to prevent nor drugs to cure the severe infections caused by these viruses. The NS3 protease is one of the most promising targets for drug development against West Nile virus because it is an essential enzyme for viral replication and because success has been demonstrated with the closely related hepatitis C virus protease. We have discovered a small molecule that inhibits the NS3 protease of West Nile virus by computer-aided high-throughput docking, and validated it using three experimental techniques. The inhibitor has potential to be developed to a drug candidate to combat West Nile virus infections

    A comparative analysis of predictive models of morbidity in intensive care unit after cardiac surgery – Part I: model planning

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Different methods have recently been proposed for predicting morbidity in intensive care units (ICU). The aim of the present study was to critically review a number of approaches for developing models capable of estimating the probability of morbidity in ICU after heart surgery. The study is divided into two parts. In this first part, popular models used to estimate the probability of class membership are grouped into distinct categories according to their underlying mathematical principles. Modelling techniques and intrinsic strengths and weaknesses of each model are analysed and discussed from a theoretical point of view, in consideration of clinical applications.</p> <p>Methods</p> <p>Models based on Bayes rule, <it>k-</it>nearest neighbour algorithm, logistic regression, scoring systems and artificial neural networks are investigated. Key issues for model design are described. The mathematical treatment of some aspects of model structure is also included for readers interested in developing models, though a full understanding of mathematical relationships is not necessary if the reader is only interested in perceiving the practical meaning of model assumptions, weaknesses and strengths from a user point of view.</p> <p>Results</p> <p>Scoring systems are very attractive due to their simplicity of use, although this may undermine their predictive capacity. Logistic regression models are trustworthy tools, although they suffer from the principal limitations of most regression procedures. Bayesian models seem to be a good compromise between complexity and predictive performance, but model recalibration is generally necessary. <it>k</it>-nearest neighbour may be a valid non parametric technique, though computational cost and the need for large data storage are major weaknesses of this approach. Artificial neural networks have intrinsic advantages with respect to common statistical models, though the training process may be problematical.</p> <p>Conclusion</p> <p>Knowledge of model assumptions and the theoretical strengths and weaknesses of different approaches are fundamental for designing models for estimating the probability of morbidity after heart surgery. However, a rational choice also requires evaluation and comparison of actual performances of locally-developed competitive models in the clinical scenario to obtain satisfactory agreement between local needs and model response. In the second part of this study the above predictive models will therefore be tested on real data acquired in a specialized ICU.</p

    Genetic contributions to two special factors of neuroticism are associated with affluence, higher intelligence, better health, and longer life

    Get PDF
    Higher scores on the personality trait of neuroticism, the tendency to experience negative emotions, are associated with worse mental and physical health. Studies examining links between neuroticism and health typically operationalize neuroticism by summing the items from a neuroticism scale. However, neuroticism is made up of multiple heterogeneous facets, each contributing to the effect of neuroticism as a whole. A recent study showed that a 12-item neuroticism scale described one broad trait of general neuroticism and two special factors, one characterizing the extent to which people worry and feel vulnerable, and the other characterizing the extent to which people are anxious and tense. This study also found that, although individuals who were higher on general neuroticism lived shorter lives, individuals whose neuroticism was characterized by worry and vulnerability lived longer lives. Here, we examine the genetic contributions to the two special factors of neuroticism—anxiety/tension and worry/vulnerability—and how they contrast with that of general neuroticism. First, we show that, whereas the polygenic load for neuroticism is associated with the genetic risk of coronary artery disease, lower intelligence, lower socioeconomic status (SES), and poorer self-rated health, the genetic variants associated with high levels of anxiety/tension, and high levels of worry/vulnerability are associated with genetic variants linked to higher SES, higher intelligence, better self-rated health, and longer life. Second, we identify genetic variants that are uniquely associated with these protective aspects of neuroticism. Finally, we show that different neurological pathways are linked to each of these neuroticism phenotypes.</p
    corecore