28 research outputs found

    Defensin-like peptides in wheat analyzed by whole-transcriptome sequencing: a focus on structural diversity and role in induced resistance

    Get PDF
    Antimicrobial peptides (AMPs) are the main components of the plant innate immune system. Defensins represent the most important AMP family involved in defense and non-defense functions. In this work, global RNA sequencing and de novo transcriptome assembly were performed to explore the diversity of defensin-like (DEFL) genes in the wheat Triticum kiharae and to study their role in induced resistance (IR) mediated by the elicitor metabolites of a non-pathogenic strain FS-94 of Fusarium sambucinum. Using a combination of two pipelines for DEFL mining in transcriptome data sets, as many as 143 DEFL genes were identified in T. kiharae, the vast majority of them represent novel genes. According to the number of cysteine residues and the cysteine motif, wheat DEFLs were classified into ten groups. Classical defensins with a characteristic 8-Cys motif assigned to group 1 DEFLs represent the most abundant group comprising 52 family members. DEFLs with a characteristic 4-Cys motif CX{3,5}CX{8,17}CX{4,6}C named group 4 DEFLs previously found only in legumes were discovered in wheat. Within DEFL groups, subgroups of similar sequences originated by duplication events were isolated. Variation among DEFLs within subgroups is due to amino acid substitutions and insertions/deletions of amino acid sequences. To identify IR-related DEFL genes, transcriptional changes in DEFL gene expression during elicitor-mediated IR were monitored. Transcriptional diversity of DEFL genes in wheat seedlings in response to the fungus Fusarium oxysporum, FS-94 elicitors, and the combination of both (elicitors + fungus) was demonstrated, with specific sets of up- and down-regulated DEFL genes. DEFL expression profiling allowed us to gain insight into the mode of action of the elicitors from F. sambucinum. We discovered that the elicitors up-regulated a set of 24 DEFL genes. After challenge inoculation with F. oxysporum, another set of 22 DEFLs showed enhanced expression in IR-displaying seedlings. These DEFLs, in concert with other defense molecules, are suggested to determine enhanced resistance of elicitor-pretreated wheat seedlings. In addition to providing a better understanding of the mode of action of the elicitors from FS-94 in controlling diseases, up-regulated IR-specific DEFL genes represent novel candidates for genetic transformation of plants and development of pathogen-resistant crops

    Employing toxin-antitoxin genome markers for identification of Bifidobacterium and Lactobacillus strains in human metagenomes

    Get PDF
    Recent research has indicated that in addition to the unique genotype each individual may also have a unique microbiota composition. Difference in microbiota composition may emerge from both its species and strain constituents. It is important to know the precise composition especially for the gut microbiota (GM), since it can contribute to the health assessment, personalized treatment, and disease prevention for individuals and groups (cohorts). The existing methods for species and strain composition in microbiota are not always precise and usually not so easy to use. Probiotic bacteria of the genus Bifidobacterium and Lactobacillus make an essential component of human GM. Previously we have shown that in certain Bifidobacterium and Lactobacillus species the RelBE and MazEF superfamily of toxin-antitoxin (TA) systems may be used as functional biomarkers to differentiate these groups of bacteria at the species and strain levels. We have composed a database of TA genes of these superfamily specific for all lactobacilli and bifidobacteria species with complete genome sequence and confirmed that in all Lactobacillus and Bifidobacterium species TA gene composition is species and strain specific. To analyze composition of species and strains of two bacteria genera, Bifidobacterium and Lactobacillus, in human GM we developed TAGMA (toxin antitoxin genes for metagenomes analyses) software based on polymorphism in TA genes. TAGMA was tested on gut metagenomic samples. The results of our analysis have shown that TAGMA can be used to characterize species and strains of Lactobacillus and Bifidobacterium in metagenomes

    De novo sequencing and characterization of floral transcriptome in two species of buckwheat (Fagopyrum)

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Transcriptome sequencing data has become an integral component of modern genetics, genomics and evolutionary biology. However, despite advances in the technologies of DNA sequencing, such data are lacking for many groups of living organisms, in particular, many plant taxa. We present here the results of transcriptome sequencing for two closely related plant species. These species, <it>Fagopyrum esculentum </it>and <it>F. tataricum</it>, belong to the order Caryophyllales - a large group of flowering plants with uncertain evolutionary relationships. <it>F. esculentum </it>(common buckwheat) is also an important food crop. Despite these practical and evolutionary considerations <it>Fagopyrum </it>species have not been the subject of large-scale sequencing projects.</p> <p>Results</p> <p>Normalized cDNA corresponding to genes expressed in flowers and inflorescences of <it>F. esculentum </it>and <it>F. tataricum </it>was sequenced using the 454 pyrosequencing technology. This resulted in 267 (for <it>F. esculentum</it>) and 229 (<it>F. tataricum</it>) thousands of reads with average length of 341-349 nucleotides. <it>De novo </it>assembly of the reads produced about 25 thousands of contigs for each species, with 7.5-8.2× coverage. Comparative analysis of two transcriptomes demonstrated their overall similarity but also revealed genes that are presumably differentially expressed. Among them are retrotransposon genes and genes involved in sugar biosynthesis and metabolism. Thirteen single-copy genes were used for phylogenetic analysis; the resulting trees are largely consistent with those inferred from multigenic plastid datasets. The sister relationships of the Caryophyllales and asterids now gained high support from nuclear gene sequences.</p> <p>Conclusions</p> <p>454 transcriptome sequencing and <it>de novo </it>assembly was performed for two congeneric flowering plant species, <it>F. esculentum </it>and <it>F. tataricum</it>. As a result, a large set of cDNA sequences that represent orthologs of known plant genes as well as potential new genes was generated.</p

    The Constrained Maximal Expression Level Owing to Haploidy Shapes Gene Content on the Mammalian X Chromosome.

    Get PDF
    X chromosomes are unusual in many regards, not least of which is their nonrandom gene content. The causes of this bias are commonly discussed in the context of sexual antagonism and the avoidance of activity in the male germline. Here, we examine the notion that, at least in some taxa, functionally biased gene content may more profoundly be shaped by limits imposed on gene expression owing to haploid expression of the X chromosome. Notably, if the X, as in primates, is transcribed at rates comparable to the ancestral rate (per promoter) prior to the X chromosome formation, then the X is not a tolerable environment for genes with very high maximal net levels of expression, owing to transcriptional traffic jams. We test this hypothesis using The Encyclopedia of DNA Elements (ENCODE) and data from the Functional Annotation of the Mammalian Genome (FANTOM5) project. As predicted, the maximal expression of human X-linked genes is much lower than that of genes on autosomes: on average, maximal expression is three times lower on the X chromosome than on autosomes. Similarly, autosome-to-X retroposition events are associated with lower maximal expression of retrogenes on the X than seen for X-to-autosome retrogenes on autosomes. Also as expected, X-linked genes have a lesser degree of increase in gene expression than autosomal ones (compared to the human/Chimpanzee common ancestor) if highly expressed, but not if lowly expressed. The traffic jam model also explains the known lower breadth of expression for genes on the X (and the Z of birds), as genes with broad expression are, on average, those with high maximal expression. As then further predicted, highly expressed tissue-specific genes are also rare on the X and broadly expressed genes on the X tend to be lowly expressed, both indicating that the trend is shaped by the maximal expression level not the breadth of expression per se. Importantly, a limit to the maximal expression level explains biased tissue of expression profiles of X-linked genes. Tissues whose tissue-specific genes are very highly expressed (e.g., secretory tissues, tissues abundant in structural proteins) are also tissues in which gene expression is relatively rare on the X chromosome. These trends cannot be fully accounted for in terms of alternative models of biased expression. In conclusion, the notion that it is hard for genes on the Therian X to be highly expressed, owing to transcriptional traffic jams, provides a simple yet robustly supported rationale of many peculiar features of X's gene content, gene expression, and evolution

    Comparative Analysis of Developmental Transcriptome Maps of Arabidopsis thaliana and Solanum lycopersicum

    No full text
    The knowledge of gene functions in model organisms is the starting point for the analysis of gene function in non-model species, including economically important ones. Usually, the assignment of gene functions is based on sequence similarity. In plants, due to a highly intricate gene landscape, this approach has some limitations. It is often impossible to directly match gene sets from one plant species to another species based only on their sequences. Thus, it is necessary to use additional information to identify functionally similar genes. Expression patterns have great potential to serve as a source of such information. An important prerequisite for the comparative analysis of transcriptomes is the existence of high-resolution expression maps consisting of comparable samples. Here, we present a transcriptome atlas of tomato (Solanum lycopersicum) consisting of 30 samples of different organs and developmental stages. The samples were selected in a way that allowed for side-by-side comparison with the Arabidopsis thaliana transcriptome map. Newly obtained data are integrated in the TraVA database and are available online, together with tools for their analysis. In this paper, we demonstrate the potential of comparing transcriptome maps for inferring shifts in the expression of paralogous genes

    Mitochondrial Genome of Fagopyrum esculentum and the Genetic Diversity of Extranuclear Genomes in Buckwheat

    No full text
    Fagopyrum esculentum (common buckwheat) is an important agricultural non-cereal grain plant. Despite extensive genetic studies, the information on its mitochondrial genome is still lacking. Using long reads generated by single-molecule real-time technology coupled with circular consensus sequencing (CCS) protocol, we assembled the buckwheat mitochondrial genome and detected that its prevalent form consists of 10 circular chromosomes with a total length of 404 Kb. In order to confirm the presence of a multipartite structure, we developed a new targeted assembly tool capable of processing long reads. The mitogenome contains all genes typical for plant mitochondrial genomes and long inserts of plastid origin (~6.4% of the total mitogenome length). Using this new information, we characterized the genetic diversity of mitochondrial and plastid genomes in 11 buckwheat cultivars compared with the ancestral subspecies, F. esculentum ssp. ancestrale. We found it to be surprisingly low within cultivars: Only three to six variations in the mitogenome and one to two in the plastid genome. In contrast, the divergence with F. esculentum ssp. ancestrale is much higher: 220 positions differ in the mitochondrial genome and 159 in the plastid genome. The SNPs in the plastid genome are enriched in non-synonymous substitutions, in particular in the genes involved in photosynthesis: psbA, psbC, and psbH. This presumably reflects the selection for the increased photosynthesis efficiency as a part of the buckwheat breeding program

    An update to database TraVA: organ-specific cold stress response in Arabidopsis thaliana

    No full text
    Abstract Background Transcriptome map is a powerful tool for a variety of biological studies; transcriptome maps that include different organs, tissues, cells and stages of development are currently available for at least 30 plants. Some of them include samples treated by environmental or biotic stresses. However, most studies explore only limited set of organs and developmental stages (leaves or seedlings). In order to provide broader view of organ-specific strategies of cold stress response we studied expression changes that follow exposure to cold (+ 4 °C) in different aerial parts of plant: cotyledons, hypocotyl, leaves, young flowers, mature flowers and seeds using RNA-seq. Results The results on differential expression in leaves are congruent with current knowledge on stress response pathways, in particular, the role of CBF genes. In other organs, both essence and dynamics of gene expression changes are different. We show the involvement of genes that are confined to narrow expression patterns in non-stress conditions into stress response. In particular, the genes that control cell wall modification in pollen, are activated in leaves. In seeds, predominant pattern is the change of lipid metabolism. Conclusions Stress response is highly organ-specific; different pathways are involved in this process in each type of organs. The results were integrated with previously published transcriptome map of Arabidopsis thaliana and used for an update of a public database TraVa: http://travadb.org/browse/Species=AthStress

    Effect of method of deduplication on estimation of differential gene expression using RNA-seq

    No full text
    Background RNA-seq is a useful tool for analysis of gene expression. However, its robustness is greatly affected by a number of artifacts. One of them is the presence of duplicated reads. Results To infer the influence of different methods of removal of duplicated reads on estimation of gene expression in cancer genomics, we analyzed paired samples of hepatocellular carcinoma (HCC) and non-tumor liver tissue. Four protocols of data analysis were applied to each sample: processing without deduplication, deduplication using a method implemented in SAMtools, and deduplication based on one or two molecular indices (MI). We also analyzed the influence of sequencing layout (single read or paired end) and read length. We found that deduplication without MI greatly affects estimated expression values; this effect is the most pronounced for highly expressed genes. Conclusion The use of unique molecular identifiers greatly improves accuracy of RNA-seq analysis, especially for highly expressed genes. We developed a set of scripts that enable handling of MI and their incorporation into RNA-seq analysis pipelines. Deduplication without MI affects results of differential gene expression analysis, producing a high proportion of false negative results. The absence of duplicate read removal is biased towards false positives. In those cases where using MI is not possible, we recommend using paired-end sequencing layout

    Assembly and Analysis of the Complete Mitochondrial Genome of Capsella bursa-pastoris

    No full text
    Shepherd&rsquo;s purse (Capsella bursa-pastoris) is a cosmopolitan annual weed and a promising model plant for studying allopolyploidization in the evolution of angiosperms. Though plant mitochondrial genomes are a valuable source of genetic information, they are hard to assemble. At present, only the complete mitogenome of C. rubella is available out of all species of the genus Capsella. In this work, we have assembled the complete mitogenome of C. bursa-pastoris using high-precision PacBio SMRT third-generation sequencing technology. It is 287,799 bp long and contains 32 protein-coding genes, 3 rRNAs, 25 tRNAs corresponding to 15 amino acids, and 8 open reading frames (ORFs) supported by RNAseq data. Though many repeat regions have been found, none of them is longer than 1 kbp, and the most frequent structural variant originated from these repeats is present in only 4% of the mitogenome copies. The mitochondrial DNA sequence of C. bursa-pastoris differs from C. rubella, but not from C. orientalis, by two long inversions, suggesting that C. orientalis could be its maternal progenitor species. In total, 377 C to U RNA editing sites have been detected. All genes except cox1 and atp8 contain RNA editing sites, and most of them lead to non-synonymous changes of amino acids. Most of the identified RNA editing sites are identical to corresponding RNA editing sites in A. thaliana

    Origin and diversity of Capsella bursa-pastoris from the genomic point of view

    No full text
    Abstract Background Capsella bursa-pastoris, a cosmopolitan weed of hybrid origin, is an emerging model object for the study of early consequences of polyploidy, being a fast growing annual and a close relative of Arabidopsis thaliana. The development of this model is hampered by the absence of a reference genome sequence. Results We present here a subgenome-resolved chromosome-scale assembly and a genetic map of the genome of Capsella bursa-pastoris. It shows that the subgenomes are mostly colinear, with no massive deletions, insertions, or rearrangements in any of them. A subgenome-aware annotation reveals the lack of genome dominance—both subgenomes carry similar number of genes. While most chromosomes can be unambiguously recognized as derived from either paternal or maternal parent, we also found homeologous exchange between two chromosomes. It led to an emergence of two hybrid chromosomes; this event is shared between distant populations of C. bursa-pastoris. The whole-genome analysis of 119 samples belonging to C. bursa-pastoris and its parental species C. grandiflora/rubella and C. orientalis reveals introgression from C. orientalis but not from C. grandiflora/rubella. Conclusions C. bursa-pastoris does not show genome dominance. In the earliest stages of evolution of this species, a homeologous exchange occurred; its presence in all present-day populations of C. bursa-pastoris indicates on a single origin of this species. The evidence coming from whole-genome analysis challenges the current view that C. grandiflora/rubella was a direct progenitor of C. bursa-pastoris; we hypothesize that it was an extinct (or undiscovered) species sister to C. grandiflora/rubella
    corecore