67 research outputs found

    Patterns of Learning Object Reuse in the Connexions Repository

    Get PDF
    Doctoral Dissertation abstract: Since the term learning object was first published, there has been either an explicit or implicit expectation of reuse. There has also been a lot of speculation about why learning objects are, or are not, reused. This study quantitatively examined the actual amount and type of learning object use, to include reuse, modification, and translation, within a single open educational resource repositoryâConnexions. The results indicate that about a quarter of used objects are subsequently reused, modified, or translated. While these results are repository specific, they represent an important first step in providing an empirical evaluation of the frequency and some reasons for reuse, as well as establishing metrics and terminology for future studies

    Design and data analysis of kinome microarrays

    Get PDF
    Catalyzed by protein kinases, phosphorylation is the most important post-translational modification in eukaryotes and is involved in the regulation of almost all cellular processes. Investigating phosphorylation events and how they change in response to different biological conditions is integral to understanding cellular signaling processes in general, as well as to defining the role of phosphorylation in health and disease. A recently-developed technology for studying phosphorylation events is the kinome microarray, which consists of several hundred "spots" arranged in a grid-like pattern on a glass slide. Each spot contains many peptides of a particular amino acid sequence chemically fixed to the slide, with different spots containing peptides with different sequences. Each peptide is a subsequence of a full protein, containing an amino acid residue that is known or suspected to undergo phosphorylation in vivo, as well as several surrounding residues. When a kinome microarray is exposed to cell lysate, the protein kinases in the lysate catalyze the phosphorylation of the peptides on the array. By measuring the degree to which the peptides comprising each spot are phosphorylated, insight can be gained into the upregulation or downregulation of signaling pathways in response to different biological treatments or conditions. There are two main computational challenges associated with kinome microarrays. The first is array design, which involves selecting the peptides to be included on a given array. The level of difficulty of this task depends largely on the number of phosphorylation sites that have been experimentally identified in the proteome of the organism being studied. For instance, thousands of phosphorylation sites are known for human and mouse, allowing considerable freedom to select peptides that are relevant to the problem being examined. In contrast, few sites are known for, say, honeybee and soybean. For such organisms, it is useful to expand the set of possible peptides by using computational techniques to predict probable phosphorylation sites. In this thesis, existing techniques for the computational prediction of phosphorylation sites are reviewed. In addition, two novel methods are described for predicting phosphorylation events in organisms with few known sites, with each method using a fundamentally different approach. The first technique, called PHOSFER, uses a random forest-based machine-learning strategy, while the second, called DAPPLE, takes advantage of sequence homology between known sites and the proteome of interest. Both methods are shown to allow quicker or more accurate predictions in organisms with few known sites than comparable previous techniques. Therefore, the use of kinome microarrays is no longer limited to the study of organisms having many known phosphorylation sites; rather, this technology can potentially be applied to any organism having a sequenced genome. It is shown that PHOSFER and DAPPLE are suitable for identifying phosphorylation sites in a wide variety of organisms, including cow, honeybee, and soybean. The second computational challenge is data analysis, which involves the normalization, clustering, statistical analysis, and visualization of data resulting from the arrays. While software designed for the analysis of DNA microarrays has also been used for kinome arrays, differences between the two technologies prompted the development of PIIKA, a software package specifically designed for the analysis of kinome microarray data. By comparing with methods used for DNA microarrays, it is shown that PIIKA improves the ability to identify biological pathways that are differentially regulated in a treatment condition compared to a control condition. Also described is an updated version, PIIKA 2, which contains improvements and new features in the areas of clustering, statistical analysis, and data visualization. Given the previous absence of dedicated tools for analyzing kinome microarray data, as well as their wealth of features, PIIKA and PIIKA 2 represent an important step in maximizing the scientific value of this technology. In addition to the above techniques, this thesis presents three studies involving biological applications of kinome microarray analysis. The first study demonstrates the existence of "kinotypes" - species- or individual-specific kinome profiles - which has implications for personalized medicine and for the use of model organisms in the study of human disease. The second study uses kinome analysis to characterize how the calf immune system responds to infection by the bacterium Mycobacterium avium subsp. paratuberculosis. Finally, the third study uses kinome arrays to study parasitism of honeybees by the mite Varroa destructor, which is thought to be a major cause of colony collapse disorder. In order to make the methods described above readily available, a website called the SAskatchewan PHosphorylation Internet REsource (SAPHIRE) has been developed. Located at the URL http://saphire.usask.ca, SAPHIRE allows researchers to easily make use of PHOSFER, DAPPLE, and PIIKA 2. These resources facilitate both the design and data analysis of kinome microarrays, making them an even more effective technique for studying cellular signaling

    Dissection of pleiotropic effects in genome-wide association studies of phenotypes related to cardiometabolic health

    Get PDF
    In the past seven years, Genome-Wide Association Studies (GWAS) have identified hundreds of variants associated with cardiometabolic quantitative traits and diseases. Many genetic loci appear to harbour variants associated with multiple phenotypes (cross-phenotype associations, CP). CP associations highlight that phenotypes may share common underlying genetic mechanisms that might, or might not, be consistent with epidemiological expectations and, therefore, add complexity to the relationships between human phenotypes. Pleiotropy occurs when the same genetic causal element affects more than one phenotype “in parallel” and can explain the presence of CP associations. It can appear at a single variant level, where a single causal variant is related to multiple phenotypes, or at a locus level, that is when multiple variants in the same gene or locus are associated with different phenotypes by affecting the same functional element. However, other potential genetic mechanisms, that can explain CP associations, exist. Among them, mediation occurs when a genetic variant is directly associated with a phenotype and that phenotype is itself causal for a second phenotype or more phenotypes; multi-phenotype allelic heterogeneity is a phenomenon which involves independent uncorrelated variants within the same locus which cause changes in multiple phenotypes, by affecting them through independent pathways related to distinct functional elements. The identification and characterisation of CP associations across the genome may help uncovering the mechanistic basis of physiological processes that underlie variability of cardiometabolic quantitative traits, and of pathogenetic processes leading to metabolic disorders. The definition of specific patterns of effect combinations on cardiometabolic phenotypes will highlight novel biological pathways, targets for translational research, for therapeutic intervention, and for the understanding of the pathophysiology of human metabolism. Based on this hypothesis, and in collaboration with the Cross-Consortia pleiotropy group and with the European Network for Genetic and Genomic Epidemiology (ENGAGE) consortium, my PhD project focused on dissection of CP effects, pleiotropy in particular, at common variants across the genome in association with cardiometabolic phenotypes. The objective was to improve our understanding of the extent of shared genetics between cardiometabolic phenotypes and of the influences of DNA sequence variation on risk of metabolic diseases, considering phenotypes as a range of inter-related manifestations of biological mechanisms rather than as isolated events. My research has been divided into three sub-projects: Project 1: Clustering and pathway analysis of univariate GWAS results for the detection of pleiotropic effects. We explored multi-phenotype effects at hundreds of established cardiometabolic genetic variants from published univariate GWAS meta-analyses on more than 20 respective phenotypes, by defining clusters of loci with similar multiple effects, comparing them to known epidemiological expectations, and identifying enriched biological networks within the most interesting groups of loci. Our results highlighted that many variants at cardiometabolic loci have multiple associations that characterise different aspects of metabolism. Cardiometabolic loci can be grouped according to their shared multi-phenotype effects and metabolic syndrome represents just one possible combination; in fact, several other unexpected combinations might be observed, for example healthy obesity/unhealthy leanness. We also highlighted that genetic loci with similar cardiometabolic effects are involved in shared biological pathways. Some of these may be expected, for instance, regulation of lipids metabolism or cholesterol transport for groups of loci with strong effects on lipids, and circulatory system processes for genes near blood pressure-association signals. Sometimes groups of loci affected fundamental cell functions, such as regulation of cellular processes, for the loci with effects on obesity and anthropometric traits. The enriched connectivity within pathway networks revealed new potential candidate genes and tissues of action that are more likely to have causal effect on phenotypes. Project 2: Validating pleiotropy and analysis of locus architecture in potential pleiotropic regions. We aimed to dissect the architecture of established cardiometabolic loci showing multiple associations for a better definition of the underlying mechanisms of multi-phenotype effects and for the discernment of potential pleiotropy from allelic heterogeneity. To this aim, we applied an approximate conditional analysis, based on observed linkage disequilibrium patterns, which led us to the discovery of multiple associations at adjacent variants that underlie the same genetic cause for variability of different phenotypes. Our results also highlighted that a substantial proportion of metabolic loci incorporate complex patterns of multi-phenotype allelic heterogeneity, thus suggesting an important contribution of this mechanism into cross-phenotype effects. Project 3: Application of a multivariate statistical approach for the study of pleiotropy within cardiometabolic phenotypes. We developed and applied a statistical strategy for joint multivariate analysis of multiple correlated phenotypes using individual genetic data from the ENGAGE consortium to discover new uncovered multiple associations and to follow-up GWAS meta-analysis at two loci, FTO and FADS1. Using this approach we were able to take into account correlation between phenotypes, and we achieved a boost in power; moreover, we improved precision of parameter estimates and of the identification of novel candidate genes. Our results allowed us to identify several variants jointly associated with multiple lipid traits and body mass index. Our approach was useful for the identification of mediation: we, in fact, confirmed mediation underlying causal relationship between adiposity and other cardiometabolic phenotypes at the FTO locus. Additionally, we demonstrated that multiple effects on cardiometabolic phenotypes attributable to the FADS1 locus are mediated by its independent, thus pleiotropic, effect on total cholesterol and triglycerides. In conclusion, we applied several statistical approaches which allowed dissecting suggestive CP effects and their mechanisms, including pleiotropy, mediation and allelic heterogeneity. Our analyses have demonstrated the complexity of the relationships between cardiometabolic phenotypes related to the variability of both, underlying genetic mechanisms and genetic loci architecture

    Intégration d'instructions data-parallèles dans le langage PSC et compilation pour processeur SIMD (INTEL SSE)

    Get PDF
    II existe des instructions data-parallèles dans les processeurs modernes. Ces instructions permettent d'effectuer la même opération sur plusieurs données différentes en parallèle. Présentement il est difficile de programmer des logiciels qui utilisent ces instructions data-parallèles avec les solutions existantes, Nous avons donc exploré l'utilisation d'un langage destiné à la programmation des circuits parallèles comme les FPGA (Field Programmable Gate Array) pour fabriqué un logiciel qui permet d'utiliser ces instructions data-parallèles de manière simple et efficace. Un langage de haut niveau pour la programmation des FPGA. le langage psC- Parallel and Synchronous C- a été choisi, Sa syntaxe proche du C, son paradigme entièrement parallèle et la disponibilité du code source ont justifié ce choix, II y a plusieurs années, les gens pensaient qu'aujourd'hui l'optimisation ne serait plus aussi importante qu'elle l'était pour eux. Ils disaient que la quantité de mémoire et la puissance de calculs des processeurs ferait en sorte que le gain en temps ne vaudrait pas l'effort de programmation nécessaire pour programmer du code optimisé. Maintenant, nous savons que ce n'est pas le cas. Les processeurs ont certes eu un gain de performance important, mais les tâches qu'ils accomplissent nécessitent de plus en plus de puissance de calculs et de mémoire. Aujourd'hui, une bonne partie de la puissance de calculs s'obtient par l'utilisation des instructions data-parallèles disponibles dans les processeurs modernes. Pour inclure ces instructions data-parallèles dans un logicieL il n'y a pas beaucoup d'alternatives disponibles. Ce travail a consisté à réaliser un compilateur complet pour machine SIMD. Une nouvelle syntaxe permettant de supporter les instructions data-parallèles a été définie et intégrée à celle du langage psC. L'algorithme de génération de code assembleur pour les instructions data-parallèles de type SSE d'Intel a été implémenté et testé. Finalement, trois applications ont été programmées et les performances de rapidité d'exécution comparées à diverses méthodes classiques de programmation. Les résultats montrent que les performances obtenu par le langage psC est toujours situé entre celui obtenu par un expert codant en langage assembleur et celui obtenu par les compilateurs C et C++, Ceci correspond à ce qui était désiré. En conclusion, ce travail de recherche a démontré qu'il était possible d'utiliser un langage HL-HDL (High Level Hardware Description Language) pour générer du code qui bénéficie des instructions data-parallèles. Le gain en performance de F implementation psC est présenté pour tous les cas étudié, et se rapproche de F implementation assembleur qui est le maximum atteignable

    Trends and challenges for wind energy harvesting : workshop, March 30-31, 2015, Coimbra, Portugal

    Get PDF
    corecore