824 research outputs found

    Integration and mining of malaria molecular, functional and pharmacological data: how far are we from a chemogenomic knowledge space?

    Get PDF
    The organization and mining of malaria genomic and post-genomic data is highly motivated by the necessity to predict and characterize new biological targets and new drugs. Biological targets are sought in a biological space designed from the genomic data from Plasmodium falciparum, but using also the millions of genomic data from other species. Drug candidates are sought in a chemical space containing the millions of small molecules stored in public and private chemolibraries. Data management should therefore be as reliable and versatile as possible. In this context, we examined five aspects of the organization and mining of malaria genomic and post-genomic data: 1) the comparison of protein sequences including compositionally atypical malaria sequences, 2) the high throughput reconstruction of molecular phylogenies, 3) the representation of biological processes particularly metabolic pathways, 4) the versatile methods to integrate genomic data, biological representations and functional profiling obtained from X-omic experiments after drug treatments and 5) the determination and prediction of protein structures and their molecular docking with drug candidate structures. Progresses toward a grid-enabled chemogenomic knowledge space are discussed.Comment: 43 pages, 4 figures, to appear in Malaria Journa

    PlantFuncSSR: Integrating first and next generation transcriptomics for mining of SSR-functional domains markers

    Get PDF
    © 2016 Sablok, Pérez-Pulido, Do, Seong, Casimiro-Soriguer, La Porta, Ralph, Squartini, Muñoz-Merida and Harikrishna. Analysis of repetitive DNA sequence content and divergence among the repetitive functional classes is a well-accepted approach for estimation of inter- and intrageneric differences in plant genomes. Among these elements, microsatellites, or Simple Sequence Repeats (SSRs), have been widely demonstrated as powerful genetic markers for species and varieties discrimination. We present PlantFuncSSRs platform having more than 364 plant species with more than 2 million functional SSRs. They are provided with detailed annotations for easy functional browsing of SSRs and with information on primer pairs and associated functional domains. PlantFuncSSRs can be leveraged to identify functional-based genic variability among the species of interest, which might be of particular interest in developing functional markers in plants. This comprehensive on-line portal unifies mining of SSRs from first and next generation sequencing datasets, corresponding primer pairs and associated in-depth functional annotation such as gene ontology annotation, gene interactions and its identification from reference protein databases. PlantFuncSSRs is freely accessible at: http://www. bioinfocabd.upo.es/plantssr

    A high‐quality functional genome assembly of delia radicum L. (diptera: anthomyiidae) annotated from egg to adult

    Get PDF
    Abstract Belowground herbivores are overseen and underestimated, even though they can cause significant economic losses in agriculture. The cabbage root fly Delia radicum (Anthomyiidae) is a common pest in Brassica species, including agriculturally important crops, such as oilseed rape. The damage is caused by the larvae, which feed specifically on the taproots of Brassica plants until they pupate. The adults are aboveground‐living generalists feeding on pollen and nectar. Female flies are attracted by chemical cues in Brassica plants for oviposition. An assembled and annotated genome can elucidate which genetic mechanisms underlie the adaptation of D . radicum to its host plants and their specific chemical defences, in particular isothiocyanates. Therefore, we assembled, annotated and analysed the D . radicum genome using a combination of different next‐generation sequencing and bioinformatic approaches. We assembled a chromosome‐level D . radicum genome using PacBio and Hi‐C Illumina sequence data. Combining Canu and 3D‐DNA genome assembler, we constructed a 1.3 Gbp genome with an N50 of 242 Mbp and 6 pseudo‐chromosomes. To annotate the assembled D . radicum genome, we combined homology‐, transcriptome‐ and ab initio‐prediction approaches. In total, we annotated 13,618 genes that were predicted by at least two approaches. We analysed egg, larval, pupal and adult transcriptomes in relation to life‐stage specific molecular functions. This high‐quality annotated genome of D . radicum is a first step to understanding the genetic mechanisms underlying host plant adaptation. As such, it will be an important resource to find novel and sustainable approaches to reduce crop losses to these pests

    Chlamydial contribution to anaerobic metabolism during eukaryotic evolution

    Get PDF
    The origin of eukaryotes is a major open question in evolutionary biology. Multiple hypotheses posit that eukaryotes likely evolved from a syntrophic relationship between an archaeon and an alphaproteobacterium based on H-2 exchange. However, there are no strong indications that modern eukaryotic H-2 metabolism originated from archaea or alphaproteobacteria. Here, we present evidence for the origin of H-2 metabolism genes in eukaryotes from an ancestor of the Anoxychlamydiales-a group of anaerobic chlamydiae, newly described here, from marine sediments. Among Chlamydiae, these bacteria uniquely encode genes for H-2 metabolism and other anaerobiosis-associated pathways. Phylogenetic analyses of several components of H-2 metabolism reveal that Anoxychlamydiales homologs are the closest relatives to eukaryotic sequences. We propose that an ancestor of the Anoxychlamydiales contributed these key genes during the evolution of eukaryotes, supporting a mosaic evolutionary origin of eukaryotic metabolism

    Comparative transcriptome analysis and simple sequence repeat marker development for two closely related Isodon species used as ‘Xihuangcao’ herbs

    Get PDF
    Purpose: To facilitate the molecular identification of original plants, resolve taxonomic problems and identify standards for ‘Xihuangcao’-based products on the market.Methods: A transcriptomic analysis of two closely related species, i.e., Isodon serra (Maxim.) (IS) and I. lophanthoides (Buch.-Ham. ex D. Don) Hara, was conducted by using the Illumina HiSeq 2500 platform, and expressed sequence tag-derived simple sequence repeat (EST-SSR) markers were developed based on these transcriptomes.Results: In total, 149,650 and 103,221 contigs were obtained, with N50 values of 1,400 and 1,516, from the IS and I. lophanthoides RNA-Seq datasets, respectively. These contigs were clustered into 107,777 and 68,220 unigenes, which were functionally annotated to identify the genes involved in therapeutic components. In total, 14,138 and 11,756 EST-SSR motifs were identified, and of these motifs, 7,453 and 6,428 were used to design primers for IS and I. lophanthoides, respectively. After PCR verification and fluorescence-based genotyping, 24 SSR markers with bright bands, high polymorphism, and single amplification were obtained and used to identify closely related Isodon species/varieties.Conclusion: These data could help herbal scientists identify high-quality herbal plants and provide a reference for genetic improvement and population genetic and phylogenetic studies investigating ‘Xihuangcao’ herbs.Keywords: Xihuangcao, Transcriptome, EST-SSR, Molecular marker

    The genome sequence and effector complement of the flax rust pathogen Melampsora lini

    Get PDF
    Rust fungi cause serious yield reductions on crops, including wheat, barley, soybean, coffee, and represent real threats to global food security. Of these fungi, the flax rust pathogen Melampsora lini has been developed most extensively over the past 80 years as a model to understand the molecular mechanisms that underpin pathogenesis. During infection, M. lini secretes virulence effectors to promote disease. The number of these effectors, their function and their degree of conservation across rust fungal species is unknown. To assess this, we sequenced and assembled de novo the genome of M. lini isolate CH5 into 21,130 scaffolds spanning 189 Mbp (scaffold N50 of 31 kbp). Global analysis of the DNA sequence revealed that repetitive elements, primarily retrotransposons, make up at least 45% of the genome. Using ab initio predictions, transcriptome data and homology searches, we identified 16,271 putative protein-coding genes. An analysis pipeline was then implemented to predict the effector complement of M. lini and compare it to that of the poplar rust, wheat stem rust and wheat stripe rust pathogens to identify conserved and species-specific effector candidates. Previous knowledge of four cloned M. lini avirulence effector proteins and two basidiomycete effectors was used to optimize parameters of the effector prediction pipeline. Markov clustering based on sequence similarity was performed to group effector candidates from all four rust pathogens. Clusters containing at least one member from M. lini were further analyzed and prioritized based on features including expression in isolated haustoria and infected leaf tissue and conservation across rust species. Herein, we describe 200 of 940 clusters that ranked highest on our priority list, representing 725 flax rust candidate effectors. Our findings on this important model rust species provide insight into how effectors of rust fungi are conserved across species and how they may act to promote infection on their hosts.This work was funded by a grant from the CSIRO Transformational Biology Capability Platform to Adnane Nemri. Claire Anderson was supported by an ARC Discovery Grant (DP120104044) awarded to David A. Jones and Peter N. Dodds

    Micro and Macro-Evolutionary Studies in Non-Model Species: a Transcriptomic Perspective in Teleosts

    Get PDF
    246 p.Los recursos genómicos y las herramientas bioinformáticas son muy escasas en especies no modelo. ElRNA-Seq puede ser una herramienta efectiva para generar tales recursos genómicos con el objetivo derevelar la variación genética y funciones necesarias en estudios micro- y macro-evolutivos.La primera sección de esta Tesis aborda la presencia o ausencia de los filogrupos Este y Oeste de dospoblaciones cultivadas de Tinca tinca en Europa Central. Nuestro estudio respalda la hipótesis de que losindividuos analizados resultan ser un mosaico genómico de ambos filogrupos, y que las diferencias entrelas dos razas se deben a la composición inicial de los filogrupos en el momento de su fundación.La segunda sección de esta Tesis describe EXFI, un método que utiliza algoritmos de última generaciónpara dividir tránscritos en exones.La tercera sección muestra un método de muestreo optimizado para la caracterización de tránscritos.Apliqué la estrategia multi-tejido para obtener muestras, secuenciar y ensamblar el transcriptoma másexhaustivo de la sardina europea.En la última sección, estudié las relaciones evolutivas dentro de los Clupeiformes. Recopilé un granconjunto de transcriptomas de doce especies, construí su árbol filogenético, y descubrí grupos de genesbajo selección positiva. La principal conclusión es que la evolución ha moldeado la maquinaria molecularde los Clupeiformes hacia un almacenamiento y transporte de lípidos mejorado

    Insights into the Sesquiterpenoid Pathway by Metabolic Profiling and De novo Transcriptome Assembly of Stem-Chicory (Cichorium intybus Cultigroup "Catalogna")

    Get PDF
    Stem-chicory of the "Catalogna" group is a vegetable consumed for bitter-flavored stems. Type and levels of bitter sesquiterpene lactones (STLs) participate in conferring bitterness in vegetables. The content of lactucin-and lactucopocrin-like STLs was higher in "Molfettese" than "Galatina" landrace stalks, regardless of the cultivation sites, consistently with bitterness scores and gustative differences. The "Galatina" transcriptome assembly resulted in 58,872 unigenes, 77% of which were annotated, paving the way to molecular investigation of the STL pathway. Comparative transcriptome analysis allowed the identification of 69,352 SNPs and of 1640 differentially expressed genes that maintained the pattern independently of the site. Enrichment analyses revealed that 4 out of 29 unigenes were up-regulated in "Molfettese" vs "Galatina" within the sesquiterpenoid pathway. The expression of two germacrene A -synthase (GAS) and one -oxidase (GAO) genes of the costunolide branch correlated positively with the contents of lactucin-like molecules, supporting that STL biosynthesis regulation occurs at the transcriptional level. Finally, 46 genes encoding transcription factors (TFs) maintained a differential expression pattern between the two varieties regardless of the growth site; correlation analyses among TFs, GAS, GAO gene expressions and STLs contents suggest that one MYB and one bHLH may act in the pathway
    corecore