15 research outputs found

    The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features

    Get PDF
    Background Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. Findings Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20-30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed inPeer reviewe

    The haplotype-resolved chromosome pairs of a heterozygous diploid African cassava cultivar reveal novel pan-genome and allele-specific transcriptome features

    Full text link
    Background: Cassava (Manihot esculenta) is an important clonally propagated food crop in tropical and subtropical regions worldwide. Genetic gain by molecular breeding has been limited, partially because cassava is a highly heterozygous crop with a repetitive and difficult-to-assemble genome. Findings: Here we demonstrate that Pacific Biosciences high-fidelity (HiFi) sequencing reads, in combination with the assembler hifiasm, produced genome assemblies at near complete haplotype resolution with higher continuity and accuracy compared to conventional long sequencing reads. We present 2 chromosome-scale haploid genomes phased with Hi-C technology for the diploid African cassava variety TME204. With consensus accuracy >QV46, contig N50 >18 Mb, BUSCO completeness of 99%, and 35k phased gene loci, it is the most accurate, continuous, complete, and haplotype-resolved cassava genome assembly so far. Ab initio gene prediction with RNA-seq data and Iso-Seq transcripts identified abundant novel gene loci, with enriched functionality related to chromatin organization, meristem development, and cell responses. During tissue development, differentially expressed transcripts of different haplotype origins were enriched for different functionality. In each tissue, 20-30% of transcripts showed allele-specific expression (ASE) differences. ASE bias was often tissue specific and inconsistent across different tissues. Direction-shifting was observed in <2% of the ASE transcripts. Despite high gene synteny, the HiFi genome assembly revealed extensive chromosome rearrangements and abundant intra-genomic and inter-genomic divergent sequences, with large structural variations mostly related to LTR retrotransposons. We use the reference-quality assemblies to build a cassava pan-genome and demonstrate its importance in representing the genetic diversity of cassava for downstream reference-guided omics analysis and breeding. Conclusions: The phased and annotated chromosome pairs allow a systematic view of the heterozygous diploid genome organization in cassava with improved accuracy, completeness, and haplotype resolution. They will be a valuable resource for cassava breeding and research. Our study may also provide insights into developing cost-effective and efficient strategies for resolving complex genomes with high resolution, accuracy, and continuity

    De novo expression of gastrokines in pancreatic precursor lesions impede the development of pancreatic cancer

    Full text link
    Molecular events occurring in stepwise progression from pre-malignant lesions (pancreatic intraepithelial neoplasia; PanIN) to the development of pancreatic ductal adenocarcinoma (PDAC) are poorly understood. Thus, characterization of early PanIN lesions may reveal markers that can help in diagnosing PDAC at an early stage and allow understanding the pathology of the disease. We performed the molecular and histological assessment of patient-derived PanINs, tumor tissues and pancreas from mouse models with PDAC (KC mice that harbor K-RAS mutation in pancreatic tissue), where we noted marked upregulation of gastrokine (GKN) proteins. To further understand the role of gastrokine proteins in PDAC development, GKN-deficient KC mice were developed by intercrossing gastrokine-deficient mice with KC mice. Panc-02 (pancreatic cancer cells of mouse origin) were genetically modified to express GKN1 for further in vitro and in vivo analysis. Our results show that gastrokine proteins were absent in healthy pancreas and invasive cancer, while its expression was prominent in low-grade PanINs. We could detect these proteins in pancreatic juice and serum of KC mice. Furthermore, accelerated PanIN and tumor development were noted in gastrokine deficient KC mice. Loss of gastrokine 1 protein delayed apoptosis during carcinogenesis leading to the development of desmoplastic stroma while loss of gastrokine 2 increased the proliferation rate in precursor lesions. In summary, we identified gastrokine proteins in early pancreatic precursor lesions, where gastrokine proteins delay pancreatic carcinogenesis

    A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes

    Full text link
    A platform for highly parallel direct sequencing of native RNA strands was recently described by Oxford Nanopore Technologies, but despite initial efforts it remains crucial to further investigate the technology for quantification of complex transcriptomes. Here we undertake native RNA sequencing of polyA + RNA from two human cell lines, analysing ~5.2 million aligned native RNA reads. To enable informative comparisons, we also perform relevant ONT direct cDNA- and Illumina-sequencing. We find that while native RNA sequencing does enable some of the anticipated advantages, key unexpected aspects currently hamper its performance, most notably the quite frequent inability to obtain full-length transcripts from single reads, as well as difficulties to unambiguously infer their true transcript of origin. While characterising issues that need to be addressed when investigating more complex transcriptomes, our study highlights that with some defined improvements, native RNA sequencing could be an important addition to the mammalian transcriptomics toolbox

    The RNA-binding protein Musashi controls axon compartment-specific synaptic connectivity through ptp69D mRNA poly(A)-tailing

    No full text
    Synaptic targeting with subcellular specificity is essential for neural circuit assembly. Developing neurons use mechanisms to curb promiscuous synaptic connections and to direct synapse formation to defined subcellular compartments. How this selectivity is achieved molecularly remains enigmatic. Here, we discover a link between mRNA poly(A)-tailing and axon collateral branch-specific synaptic connectivity within the CNS. We reveal that the RNA-binding protein Musashi binds to the mRNA encoding the receptor protein tyrosine phosphatase Ptp69D, thereby increasing poly(A) tail length and Ptp69D protein levels. This regulation specifically promotes synaptic connectivity in one axon collateral characterized by a high degree of arborization and strong synaptogenic potential. In a different compartment of the same axon, Musashi prevents ectopic synaptogenesis, revealing antagonistic, compartment-specific functions. Moreover, Musashi-dependent Ptp69D regulation controls synaptic connectivity in the olfactory circuit. Thus, Musashi differentially shapes synaptic connectivity at the level of individual subcellular compartments and within different developmental and neuron type-specific contexts.ISSN:2666-3864ISSN:2211-124

    The transcriptome response of the ruminal methanogen Methanobrevibacter ruminantium strain M1 to the inhibitor lauric acid

    Get PDF
    OBJECTIVE: Lauric acid (C12) is a medium-chain fatty acid that inhibits growth and production of the greenhouse gas methane by rumen methanogens such as Methanobrevibacter ruminantium. To understand the inhibitory mechanism of C12, a transcriptome analysis was performed in M. ruminantium strain M1 (DSM 1093) using RNA-Seq. RESULTS: Pure cell cultures in the exponential growth phase were treated with 0.4 mg/ml C12, dissolved in dimethyl sulfoxide (DMSO), for 1 h and transcriptomic changes were compared to DMSO-only treated cells (final DMSO concentration 0.2%). Exposure to C12 resulted in differential expression of 163 of the 2280 genes in the M1 genome (maximum log2-fold change 6.6). Remarkably, C12 hardly affected the expression of genes involved in methanogenesis. Instead, most affected genes encode cell-surface associated proteins (adhesion-like proteins, membrane-associated transporters and hydrogenases), and proteins involved in detoxification or DNA-repair processes. Enrichment analysis on the genes regulated in the C12-treated group showed a significant enrichment for categories 'cell surface' and 'mobile elements' (activated by C12), and for the categories 'regulation' and 'protein fate' (represssed). These results are useful to generate and test specific hypotheses on the mechanism how C12 affects rumen methanogens

    The transcriptome response of the ruminal methanogen Methanobrevibacter ruminantium strain M1 to the inhibitor lauric acid

    No full text
    Abstract Objective Lauric acid (C12) is a medium-chain fatty acid that inhibits growth and production of the greenhouse gas methane by rumen methanogens such as Methanobrevibacter ruminantium. To understand the inhibitory mechanism of C12, a transcriptome analysis was performed in M. ruminantium strain M1 (DSM 1093) using RNA-Seq. Results Pure cell cultures in the exponential growth phase were treated with 0.4 mg/ml C12, dissolved in dimethyl sulfoxide (DMSO), for 1 h and transcriptomic changes were compared to DMSO-only treated cells (final DMSO concentration 0.2%). Exposure to C12 resulted in differential expression of 163 of the 2280 genes in the M1 genome (maximum log2-fold change 6.6). Remarkably, C12 hardly affected the expression of genes involved in methanogenesis. Instead, most affected genes encode cell-surface associated proteins (adhesion-like proteins, membrane-associated transporters and hydrogenases), and proteins involved in detoxification or DNA-repair processes. Enrichment analysis on the genes regulated in the C12-treated group showed a significant enrichment for categories ‘cell surface’ and ‘mobile elements’ (activated by C12), and for the categories ‘regulation’ and ‘protein fate’ (represssed). These results are useful to generate and test specific hypotheses on the mechanism how C12 affects rumen methanogens

    A comprehensive examination of Nanopore native RNA sequencing for characterization of complex transcriptomes

    No full text
    A platform for highly parallel direct sequencing of native RNA strands was recently described by Oxford Nanopore Technologies, but despite initial efforts it remains crucial to further investigate the technology for quantification of complex transcriptomes. Here we undertake native RNA sequencing of polyA + RNA from two human cell lines, analysing ~5.2 million aligned native RNA reads. To enable informative comparisons, we also perform relevant ONT direct cDNA- and Illumina-sequencing. We find that while native RNA sequencing does enable some of the anticipated advantages, key unexpected aspects currently hamper its performance, most notably the quite frequent inability to obtain full-length transcripts from single reads, as well as difficulties to unambiguously infer their true transcript of origin. While characterising issues that need to be addressed when investigating more complex transcriptomes, our study highlights that with some defined improvements, native RNA sequencing could be an important addition to the mammalian transcriptomics toolbox.ISSN:2041-172

    MOESM3 of The transcriptome response of the ruminal methanogen Methanobrevibacter ruminantium strain M1 to the inhibitor lauric acid

    No full text
    Additional file 3: Figure S1. Venn diagram indicates the number of differentially expressed genes between the experimental groups and the common overlapping differentially expressed genes. TC: treatment (C12 + DMSO) vs. control (DMSO); TB: treatment (C12 + DMSO) vs. untreated blank; CB: control (DMSO) vs. untreated blank. It should be kept in mind that it is not possible to distinguish between the DMSO and the C12 effect in the dataset comparing the treatment and the blank samples, and that the C12 effect is much better studied in the TC comparison (C12 + DMSO vs DMSO). The DMSO effect can be partial quenched by the C12 effect, so genes regulated in CB and TC are not necessarily regulated in the TB. The 26 common genes differentially expressed in M. ruminantium exposed to DMSO or DMSO + C12 compared to the untreated blank control are outlined in the tables on the right side. The 35 overlapping differentially expressed genes of the TC and CB comparisons are outlined in the table on the left side. The diagram was generated using the online tool at bioinformatics.psb.ugent.be/webtools/Venn/

    MOESM2 of The transcriptome response of the ruminal methanogen Methanobrevibacter ruminantium strain M1 to the inhibitor lauric acid

    No full text
    Additional file 2: Table S2. M. ruminantium M1 genes with significantly changed expression of genes in the cultures exposed to C12 + DMSO as compared to the blank group (log2-fold change < 1 and > 1, false discovery rate < 0.05). The list does not include the 15 regulated hypothetical proteins. The M. ruminantium (mru) open reading frame (ORF) codes are adopted from the Kyoto Encyclopedia of Genes and Genomes
    corecore