1,156,214 research outputs found

    Integration and mining of malaria molecular, functional and pharmacological data: how far are we from a chemogenomic knowledge space?

    Get PDF
    The organization and mining of malaria genomic and post-genomic data is highly motivated by the necessity to predict and characterize new biological targets and new drugs. Biological targets are sought in a biological space designed from the genomic data from Plasmodium falciparum, but using also the millions of genomic data from other species. Drug candidates are sought in a chemical space containing the millions of small molecules stored in public and private chemolibraries. Data management should therefore be as reliable and versatile as possible. In this context, we examined five aspects of the organization and mining of malaria genomic and post-genomic data: 1) the comparison of protein sequences including compositionally atypical malaria sequences, 2) the high throughput reconstruction of molecular phylogenies, 3) the representation of biological processes particularly metabolic pathways, 4) the versatile methods to integrate genomic data, biological representations and functional profiling obtained from X-omic experiments after drug treatments and 5) the determination and prediction of protein structures and their molecular docking with drug candidate structures. Progresses toward a grid-enabled chemogenomic knowledge space are discussed.Comment: 43 pages, 4 figures, to appear in Malaria Journa

    Microbial genomic taxonomy

    Get PDF
    A need for a genomic species definition is emerging from several independent studies worldwide. In this commentary paper, we discuss recent studies on the genomic taxonomy of diverse microbial groups and a unified species definition based on genomics. Accordingly, strains from the same microbial species share >95% Average Amino Acid Identity (AAI) and Average Nucleotide Identity (ANI), >95% identity based on multiple alignment genes, 70% in silico Genome-to-Genome Hybridization similarity (GGDH). Species of the same genus will form monophyletic groups on the basis of 16S rRNA gene sequences, Multilocus Sequence Analysis (MLSA) and supertree analysis. In addition to the established requirements for species descriptions, we propose that new taxa descriptions should also include at least a draft genome sequence of the type strain in order to obtain a clear outlook on the genomic landscape of the novel microbe. The application of the new genomic species definition put forward here will allow researchers to use genome sequences to define simultaneously coherent phenotypic and genomic groups

    The effect of genomic information on optimal contribution selection in livestock breeding programs

    Get PDF
    BACKGROUND: Long-term benefits in animal breeding programs require that increases in genetic merit be balanced with the need to maintain diversity (lost due to inbreeding). This can be achieved by using optimal contribution selection. The availability of high-density DNA marker information enables the incorporation of genomic data into optimal contribution selection but this raises the question about how this information affects the balance between genetic merit and diversity. METHODS: The effect of using genomic information in optimal contribution selection was examined based on simulated and real data on dairy bulls. We compared the genetic merit of selected animals at various levels of co-ancestry restrictions when using estimated breeding values based on parent average, genomic or progeny test information. Furthermore, we estimated the proportion of variation in estimated breeding values that is due to within-family differences. RESULTS: Optimal selection on genomic estimated breeding values increased genetic gain. Genetic merit was further increased using genomic rather than pedigree-based measures of co-ancestry under an inbreeding restriction policy. Using genomic instead of pedigree relationships to restrict inbreeding had a significant effect only when the population consisted of many large full-sib families; with a half-sib family structure, no difference was observed. In real data from dairy bulls, optimal contribution selection based on genomic estimated breeding values allowed for additional improvements in genetic merit at low to moderate inbreeding levels. Genomic estimated breeding values were more accurate and showed more within-family variation than parent average breeding values; for genomic estimated breeding values, 30 to 40% of the variation was due to within-family differences. Finally, there was no difference between constraining inbreeding via pedigree or genomic relationships in the real data. CONCLUSIONS: The use of genomic estimated breeding values increased genetic gain in optimal contribution selection. Genomic estimated breeding values were more accurate and showed more within-family variation, which led to higher genetic gains for the same restriction on inbreeding. Using genomic relationships to restrict inbreeding provided no additional gain, except in the case of very large full-sib families

    GenomeGraphs: integrated genomic data visualization with R.

    Get PDF
    BackgroundBiological studies involve a growing number of distinct high-throughput experiments to characterize samples of interest. There is a lack of methods to visualize these different genomic datasets in a versatile manner. In addition, genomic data analysis requires integrated visualization of experimental data along with constantly changing genomic annotation and statistical analyses.ResultsWe developed GenomeGraphs, as an add-on software package for the statistical programming environment R, to facilitate integrated visualization of genomic datasets. GenomeGraphs uses the biomaRt package to perform on-line annotation queries to Ensembl and translates these to gene/transcript structures in viewports of the grid graphics package. This allows genomic annotation to be plotted together with experimental data. GenomeGraphs can also be used to plot custom annotation tracks in combination with different experimental data types together in one plot using the same genomic coordinate system.ConclusionGenomeGraphs is a flexible and extensible software package which can be used to visualize a multitude of genomic datasets within the statistical programming environment R

    SIG-DB: leveraging homomorphic encryption to Securely Interrogate privately held Genomic DataBases

    Full text link
    Genomic data are becoming increasingly valuable as we develop methods to utilize the information at scale and gain a greater understanding of how genetic information relates to biological function. Advances in synthetic biology and the decreased cost of sequencing are increasing the amount of privately held genomic data. As the quantity and value of private genomic data grows, so does the incentive to acquire and protect such data, which creates a need to store and process these data securely. We present an algorithm for the Secure Interrogation of Genomic DataBases (SIG-DB). The SIG-DB algorithm enables databases of genomic sequences to be searched with an encrypted query sequence without revealing the query sequence to the Database Owner or any of the database sequences to the Querier. SIG-DB is the first application of its kind to take advantage of locality-sensitive hashing and homomorphic encryption to allow generalized sequence-to-sequence comparisons of genomic data.Comment: 38 pages, 3 figures, 4 tables, 1 supplemental table, 7 supplemental figure

    The genomics of neonatal abstinence syndrome

    Get PDF
    Significant variability has been observed in the development and severity of neonatal abstinence syndrome (NAS) among neonates exposed to prenatal opioids. Since maternal opioid dose does not appear to correlate directly with neonatal outcome, maternal, placental, and fetal genomic variants may play important roles in NAS. Previous studies in small cohorts have demonstrated associations of variants in maternal and infant genes that encode the μ-opioid receptor (OPRM1), catechol-O-methyltransferase (COMT), and prepronociceptin (PNOC) with a shorter length of hospital stay and less need for treatment in neonates exposed to opioids in utero. Consistently falling genomic sequencing costs and computational approaches to predict variant function will permit unbiased discovery of genomic variants and gene pathways associated with differences in maternal and fetal opioid pharmacokinetics and pharmacodynamics and with placental opioid transport and metabolism. Discovery of pathogenic variants should permit better delineation of the risk of developing more severe forms of NAS. This review provides a summary of the current role of genomic factors in the development of NAS and suggests strategies for further genomic discovery

    Microarray-based ultra-high resolution discovery of genomic deletion mutations

    Get PDF
    BACKGROUND: Oligonucleotide microarray-based comparative genomic hybridization (CGH) offers an attractive possible route for the rapid and cost-effective genome-wide discovery of deletion mutations. CGH typically involves comparison of the hybridization intensities of genomic DNA samples with microarray chip representations of entire genomes, and has widespread potential application in experimental research and medical diagnostics. However, the power to detect small deletions is low. RESULTS: Here we use a graduated series of Arabidopsis thaliana genomic deletion mutations (of sizes ranging from 4 bp to ~5 kb) to optimize CGH-based genomic deletion detection. We show that the power to detect smaller deletions (4, 28 and 104 bp) depends upon oligonucleotide density (essentially the number of genome-representative oligonucleotides on the microarray chip), and determine the oligonucleotide spacings necessary to guarantee detection of deletions of specified size. CONCLUSIONS: Our findings will enhance a wide range of research and clinical applications, and in particular will aid in the discovery of genomic deletions in the absence of a priori knowledge of their existence

    Genomic variations associated with attenuation in Mycobacterium avium subsp paratuberculosis vaccine strains

    Get PDF
    BACKGROUND: Mycobacterium avium subspecies paratuberculosis (MAP) whole cell vaccines have been widely used tools in the control of Johne's disease in animals despite being unable to provide complete protection. Current vaccine strains derive from stocks created many decades ago; however their genotypes, underlying mechanisms and relative degree of their attenuation are largely unknown. RESULTS: Using mouse virulence studies we confirm that MAP vaccine strains 316 F, II and 2e have diverse but clearly attenuated survival and persistence characteristics compared with wild type strains. Using a pan genomic microarray we characterise the genomic variations in a panel of vaccine strains sourced from stocks spanning over 40 years of maintenance. We describe multiple genomic variations specific for individual vaccine stocks in both deletion (26-32 Kbp) and tandem duplicated (11-40 Kbp) large variable genomic islands and insertion sequence copy numbers. We show individual differences suitable for diagnostic differentiation between vaccine and wild type genotypes and provide evidence for functionality of some of the deleted MAP-specific genes and their possible relation to attenuation. CONCLUSIONS: This study shows how culture environments have influenced MAP genome diversity resulting in large tandem genomic duplications, deletions and transposable element activity. In combination with classical selective systematic subculture this has led to fixation of specific MAP genomic alterations in some vaccine strain lineages which link the resulting attenuated phenotypes with deficiencies in high reactive oxygen species handling
    corecore