6,054 research outputs found

    Sequencing, Analysis, and Annotation of Expressed Sequence Tags for \u3ci\u3eCamelus dromedarius\u3c/i\u3e

    Get PDF
    Despite its economical, cultural, and biological importance, there has not been a large scale sequencing project to date for Camelus dromedarius. With the goal of sequencing complete DNA of the organism, we first established and sequenced camel EST libraries, generating 70,272 reads. Following trimming, chimera check, repeat masking, cluster and assembly, we obtained 23,602 putative gene sequences, out of which over 4,500 potentially novel or fast evolving gene sequences do not carry any homology to other available genomes. Functional annotation of sequences with similarities in nucleotide and protein databases has been obtained using Gene Ontology classification. Comparison to available full length cDNA sequences and Open Reading Frame (ORF) analysis of camel sequences that exhibit homology to known genes show more than 80% of the contigs with an ORF\u3e300 bp and ~40% hits extending to the start codons of full length cDNAs suggesting successful characterization of camel genes. Similarity analyses are done separately for different organisms including human, mouse, bovine, and rat. Accompanying web portal, CAGBASE (http://camel.kacst.edu.sa/), hosts a relational database containing annotated EST sequences and analysis tools with possibility to add sequences from public domain. We anticipate our results to provide a home base for genomic studies of camel and other comparative studies enabling a starting point for whole genome sequencing of the organism

    Metatranscriptome analysis of microbial communities in rice microcosms

    Get PDF
    Metatranscriptomics is a state-of-the-art technique to elucidate the functional activities of microbial communities, but its application is still limited to marine microbial assemblages. In my PhD project, we established the complete approach of soil metatranscriptomics, involving RNA extraction, cDNA library preparation by random priming, 454-pyrosequencing, and bioinformatic data analysis. The approach was tested on microbial communities in the oxic surface layer and the anoxic bulk soil of flooded rice paddy soil microcosms. Total RNA was recovered in high integrity and purity by low-pH phenol extraction (pH 5.0) followed by Q-Sepharose column chromatography. We were able to enrich mRNA sequences up to 50-70% in the metatranscriptome libraries using Ribo-Zero™ rRNA removal kit (Meta-Bacteria). All the 454 reads obtained were preprocessed prior to data analysis to minimize sequence ambiguities. A total of 10,000 SSU-ribotags (total RNA) were analyzed to elucidate community composition in the oxic and anoxic zones at three different incubation time points (25, 45 and 90 days after transplantation of rice seedlings). Additionally, about 45,000 and 12,000 mRNA-tags (enriched mRNA) were obtained for the analysis of functional activities in, respectively, the oxic and anoxic zone of 90-day-old rice microcosms. SSU-ribotag data analysis revealed no major temporal changes in community composition except for Geobacter, Clostridia and methanogens in the anoxic bulk soil. However, the taxonomic composition of microbial communities was clearly distinct between the oxic and anoxic zones, with cyanobacteria being the dominant group in the surface layer. Although mRNA-tags related to basic cellular functions were most abundant in both mRNA datasets, the expression of specific functions in response to different oxygen conditions was observed such as, for example, methane oxidation in the oxic zone and methanogenesis in the anoxic zone. Our metatranscriptomic approach provides a means to analyze the composition and functional gene expression of complex soil microbial communities while avoiding the limitations of PCR-based approaches

    Adaptive genomic structural variation in the grape powdery mildew pathogen, Erysiphe necator.

    Get PDF
    BackgroundPowdery mildew, caused by the obligate biotrophic fungus Erysiphe necator, is an economically important disease of grapevines worldwide. Large quantities of fungicides are used for its control, accelerating the incidence of fungicide-resistance. Copy number variations (CNVs) are unbalanced changes in the structure of the genome that have been associated with complex traits. In addition to providing the first description of the large and highly repetitive genome of E. necator, this study describes the impact of genomic structural variation on fungicide resistance in Erysiphe necator.ResultsA shotgun approach was applied to sequence and assemble the genome of five E. necator isolates, and RNA-seq and comparative genomics were used to predict and annotate protein-coding genes. Our results show that the E. necator genome is exceptionally large and repetitive and suggest that transposable elements are responsible for genome expansion. Frequent structural variations were found between isolates and included copy number variation in EnCYP51, the target of the commonly used sterol demethylase inhibitor (DMI) fungicides. A panel of 89 additional E. necator isolates collected from diverse vineyard sites was screened for copy number variation in the EnCYP51 gene and for presence/absence of a point mutation (Y136F) known to result in higher fungicide tolerance. We show that an increase in EnCYP51 copy number is significantly more likely to be detected in isolates collected from fungicide-treated vineyards. Increased EnCYP51 copy numbers were detected with the Y136F allele, suggesting that an increase in copy number becomes advantageous only after the fungicide-tolerant allele is acquired. We also show that EnCYP51 copy number influences expression in a gene-dose dependent manner and correlates with fungal growth in the presence of a DMI fungicide.ConclusionsTaken together our results show that CNV can be adaptive in the development of resistance to fungicides by providing increasing quantitative protection in a gene-dosage dependent manner. The results of this work not only demonstrate the effectiveness of using genomics to dissect complex traits in organisms with very limited molecular information, but also may have broader implications for understanding genomic dynamics in response to strong selective pressure in other pathogens with similar genome architectures

    INVESTIGATION INTO THE GENETIC BASIS OF CAPSAICIN PRODUCTION IN PEPPERS USING NEXT GENERATION RNA SEQUENCING AND SYNTHETIC BIOLOGY APPROACHES

    Get PDF
    Capsaicin, a molecule synthesized by plants in the Capsicum genus, is popular for its ability to produce a sensation of burning in any tissue it encounters. The synthesis of capsaicin molecules is achieved through the capsaicin biosynthesis pathway. In this dual study, our goal was to insert two crucial genes, pun1 and pAMT, into a strain of Saccharomyces cerevisiae to allow capsaicin synthesis and perform Illumina RNA sequencing on seven pepper species of increasing pungency to identify other key or novel genes needed or related to capsaicin synthesis. We implemented a golden gate cloning strategy to insert our genes of interest into bacteria to then be cloned into yeast. We believe that successful insertion into our yeast strain was achieved for one of the genes, pAMT, but the other, pun1, appears to not be inserted. We hypothesize that correct insertion and expression of pun1 would achieve capsaicin synthesis alongside expression of pAMT, as these two genes would complete the missing parts of the capsaicin pathway. We identified five possible new gene candidates with unknown functions grouped together with similar expression to known genes present in the capsaicin pathway. These novel genes were identified as follows: CA01g11020, CA12g21630, CA09g00520, CA03g28900, CA09g15570. We also identified five regions of interest that showed similar trends in expression patterns that could contain new promising genes that are not known to participate in the capsaicin biosynthesis pathway

    Diversity in parasitic nematode genomes: the microRNAs of Brugia pahangi and Haemonchus contortus are largely novel

    Get PDF
    <b>BACKGROUND:</b> MicroRNAs (miRNAs) play key roles in regulating post-transcriptional gene expression and are essential for development in the free-living nematode Caenorhabditis elegans and in higher organisms. Whether microRNAs are involved in regulating developmental programs of parasitic nematodes is currently unknown. Here we describe the the miRNA repertoire of two important parasitic nematodes as an essential first step in addressing this question. <b>RESULTS:</b> The small RNAs from larval and adult stages of two parasitic species, Brugia pahangi and Haemonchus contortus, were identified using deep-sequencing and bioinformatic approaches. Comparative analysis to known miRNA sequences reveals that the majority of these miRNAs are novel. Some novel miRNAs are abundantly expressed and display developmental regulation, suggesting important functional roles. Despite the lack of conservation in the miRNA repertoire, genomic positioning of certain miRNAs within or close to specific coding genes is remarkably conserved across diverse species, indicating selection for these associations. Endogenous small-interfering RNAs and Piwi-interacting (pi)RNAs, which regulate gene and transposon expression, were also identified. piRNAs are expressed in adult stage H. contortus, supporting a conserved role in germline maintenance in some parasitic nematodes. <b>CONCLUSIONS:</b> This in-depth comparative analysis of nematode miRNAs reveals the high level of divergence across species and identifies novel sequences potentially involved in development. Expression of novel miRNAs may reflect adaptations to different environments and lifestyles. Our findings provide a detailed foundation for further study of the evolution and function of miRNAs within nematodes and for identifying potential targets for intervention

    Identification and Functional Analyses of 11 769 Full-length Human cDNAs Focused on Alternative Splicing

    Get PDF
    We analyzed diversity of mRNA produced as a result of alternative splicing in order to evaluate gene function. First, we predicted the number of human genes transcribed into protein-coding mRNAs by using the sequence information of full-length cDNAs and 5′-ESTs and obtained 23 241 of such human genes. Next, using these genes, we analyzed the mRNA diversity and consequently sequenced and identified 11 769 human full-length cDNAs whose predicted open reading frames were different from other known full-length cDNAs. Especially, 30% of the cDNAs we identified contained variation in the transcription start site (TSS). Our analysis, which particularly focused on multiple variable first exons (FEVs) formed due to the alternative utilization of TSSs, led to the identification of 261 FEVs expressed in the tissue-specific manner. Quantification of the expression profiles of 13 genes by real-time PCR analysis further confirmed the tissue-specific expression of FEVs, e.g. OXR1 had specific TSS in brain and tumor tissues, and so on. Finally, based on the results of our mRNA diversity analysis, we have created the FLJ Human cDNA Database. From our result, it has been understood mechanisms that one gene produces suitable protein-coding transcripts responding to the situation and the environment

    Whole transcriptome profiling of Late-Onset Alzheimer's Disease patients provides insights into the molecular changes involved in the disease

    Get PDF
    Alzheimer's Disease (AD) is the most common cause of dementia affecting the elderly population worldwide. We have performed a comprehensive transcriptome profiling of Late-Onset AD (LOAD) patients using second generation sequencing technologies, identifying 2,064 genes, 47 lncRNAs and 4 miRNAs whose expression is specifically deregulated in the hippocampal region of LOAD patients. Moreover, analyzing the hippocampal, temporal and frontal regions from the same LOAD patients, we identify specific sets of deregulated miRNAs for each region, and we confirm that the miR-132/212 cluster is deregulated in each of these regions in LOAD patients, consistent with these miRNAs playing a role in AD pathogenesis. Notably, a luciferase assay indicates that miR-184 is able to target the 3'UTR NR4A2 - which is known to be involved in cognitive functions and long-term memory and whose expression levels are inversely correlated with those of miR-184 in the hippocampus. Finally, RNA editing analysis  reveals a general RNA editing decrease in LOAD hippocampus, with 14 recoding sites significantly and differentially edited in 11 genes. Our data underline specific transcriptional changes in LOAD brain and provide an important source of information for understanding the molecular changes characterizing LOAD progression

    Next-generation sequencing and its new possibilities in medicine

    Get PDF
    Next-Generation Sequencing (NGS) originally refers to high-throughput, massively parallel sequencing methods that allow the sequencing of up to billions of small (50-1000 bp), amplified DNA fragments at the same time but nowadays, there are NGS techniques that determine the sequence of long (up to 50 kbp) single molecules. Over the past years, NGS technologies become widely available with increasing throughput and decreasing sequencing costs per base making them more cost effective than the previously used capillary sequencing methods based on Sanger biochemistry. Nowadays, high-throughput DNA sequencing is routinely used on a wide range of important fields of biology and medicine enabling large-scale sequencing projects like analysis of complete genomes, disease association studies, whole transcriptomes, methylomes and provide new insights into complex biological systems. In addition, more and more NGS-based diagnostic tools are being introduced into the clinical practice, for example, on the fields of oncology, inherited and infectious diseases or pre-implantation and prenatal genetic screenings
    corecore