30 research outputs found
MSIsensor-ct: Microsatellite instability detection using cfDNA sequencing data
MOTIVATION: Microsatellite instability (MSI) is a promising biomarker for cancer prognosis and chemosensitivity. Techniques are rapidly evolving for the detection of MSI from tumor-normal paired or tumor-only sequencing data. However, tumor tissues are often insufficient, unavailable, or otherwise difficult to procure. Increasing clinical evidence indicates the enormous potential of plasma circulating cell-free DNA (cfNDA) technology as a noninvasive MSI detection approach.
RESULTS: We developed MSIsensor-ct, a bioinformatics tool based on a machine learning protocol, dedicated to detecting MSI status using cfDNA sequencing data with a potential stable MSIscore threshold of 20%. Evaluation of MSIsensor-ct on independent testing datasets with various levels of circulating tumor DNA (ctDNA) and sequencing depth showed 100% accuracy within the limit of detection (LOD) of 0.05% ctDNA content. MSIsensor-ct requires only BAM files as input, rendering it user-friendly and readily integrated into next generation sequencing (NGS) analysis pipelines.
AVAILABILITY: MSIsensor-ct is freely available at https://github.com/niu-lab/MSIsensor-ct.
SUPPLEMENTARY INFORMATION: Supplementary data are available at Briefings in Bioinformatics online
OsbZIP18, a Positive Regulator of Serotonin Biosynthesis, Negatively Controls the UV-B Tolerance in Rice
Serotonin (5-hydroxytryptamine) plays an important role in many developmental processes and biotic/abiotic stress responses in plants. Although serotonin biosynthetic pathways in plants have been uncovered, knowledge of the mechanisms of serotonin accumulation is still limited, and no regulators have been identified to date. Here, we identified the basic leucine zipper transcription factor OsbZIP18 as a positive regulator of serotonin biosynthesis in rice. Overexpression of OsbZIP18 strongly induced the levels of serotonin and its early precursors (tryptophan and tryptamine), resulting in stunted growth and dark-brown phenotypes. A function analysis showed that OsbZIP18 activated serotonin biosynthesis genes (including tryptophan decarboxylase 1 (OsTDC1), tryptophan decarboxylase 3 (OsTDC3), and tryptamine 5-hydroxylase (OsT5H)) by directly binding to the ACE-containing or G-box cis-elements in their promoters. Furthermore, we demonstrated that OsbZIP18 is induced by UV-B stress, and experiments using UV-B radiation showed that transgenic plants overexpressing OsbZIP18 exhibited UV-B stress-sensitive phenotypes. Besides, exogenous serotonin significantly exacerbates UV-B stress of OsbZIP18_OE plants, suggesting that the excessive accumulation of serotonin may be responsible for the sensitivity of OsbZIP18_OE plants to UV-B stress. Overall, we identified a positive regulator of serotonin biosynthesis and demonstrated that UV-B-stress induced serotonin accumulation, partly in an OsbZIP18-dependent manner
Empirical vs Bayesian approach for estimating haplotypes from genotypes of unrelated individuals
BACKGROUND: The completion of the HapMap project has stimulated further development of haplotype-based methodologies for disease associations. A key aspect of such development is the statistical inference of individual diplotypes from unphased genotypes. Several methodologies for inferring haplotypes have been developed, but they have not been evaluated extensively to determine which method not only performs well, but also can be easily incorporated in downstream haplotype-based association analyses. In this paper, we attempt to do so. Our evaluation was carried out by comparing the two leading Bayesian methods, implemented in PHASE and HAPLOTYPER, and the two leading empirical methods, implemented in PL-EM and HPlus. We used these methods to analyze real data, namely the dense genotypes on X-chromosome of 30 European and 30 African trios provided by the International HapMap Project, and simulated genotype data. Our conclusions are based on these analyses. RESULTS: All programs performed very well on X-chromosome data, with an average similarity index of 0.99 and an average prediction rate of 0.99 for both European and African trios. On simulated data with approximation of coalescence, PHASE implementing the Bayesian method based on the coalescence approximation outperformed other programs on small sample sizes. When the sample size increased, other programs performed as well as PHASE. PL-EM and HPlus implementing empirical methods required much less running time than the programs implementing the Bayesian methods. They required only one hundredth or thousandth of the running time required by PHASE, particularly when analyzing large sample sizes and large umber of SNPs. CONCLUSION: For large sample sizes (hundreds or more), which most association studies require, the two empirical methods might be used since they infer the haplotypes as accurately as any Bayesian methods and can be incorporated easily into downstream haplotype-based analyses such as haplotype-association analyses
Accurate authentication of Dendrobium officinale and its closely related species by comparative analysis of complete plastomes
Owing to its great medicinal and ornamental values, Dendrobium officinale is frequently adulterated with other Dendrobium species on the market. Unfortunately, the utilization of the common DNA markers ITS, ITS2, and matK+rbcL is unable to distinguish D. officinale from 5 closely related species of it (D. tosaense, D. shixingense, D. flexicaule, D. scoriarum and D. aduncum). Here, we compared 63 Dendrobium plastomes comprising 40 newly sequenced plastomes of the 6 species and 23 previously published plastomes. The plastomes of D. officinale and its closely related species were shown to have conserved genome structure and gene content. Comparative analyses revealed that small single copy region contained higher variation than large single copy and inverted repeat regions, which was mainly attributed to the loss/retention of ndh genes. Furthermore, the intraspecific sequence variability among different Dendrobium species was shown to be diversified, which necessitates a cautious evaluation of genetic markers specific for different Dendrobium species. By evaluating the maximum likelihood trees inferred from different datasets, we found that the complete plastome sequence dataset had the highest discriminatory power for D. officinale and its closely related species, indicating that complete plastome sequences can be used to accurately authenticate Dendrobium species. KEY WORDS: Authentication, Complete plastome sequence, Dendrobium officinale, Plastomic comparison, Genetic marke
CRISPR/Cas9-induced β-carotene hydroxylase mutation in Dunaliella salina CCAP19/18
Abstract Dunaliella salina (D. salina) has been exploited as a novel expression system for the field of genetic engineering. However, owing to the low or inconsistent expression of target proteins, it has been greatly restricted to practical production of recombinant proteins. Since the accurate gene editing function of clustered regularly interspaced short palindromic repeat (CRISPR)/Cas system, β-carotene hydroxylase gene was chosen as an example to explore D. salina application with the purpose of improving expression level of foreign genes. In this paper, based on pKSE401 backbone, three CRISPR/Cas9 binary vectors were constructed to targeting exon 1 and 3 of the β-carotene hydroxylase of D. salina CCAP19/18 (Dschyb). D. salina mutants were obtained by salt gradient transformation method, and the expression of Dschyb gene were identified through real-time fluorescent quantitative PCR. Moreover, carotenoids content was analyzed by high-performance liquid chromatography at different time points after high intensity treatment. Compared with wild type strains, the β-carotene levels of mutants showed a significant increase, nearly up to 1.4 μg/ml, and the levels of zeaxanthin decreased to various degrees in mutants. All the results provide a compelling evidence for targeted gene editing in D. salina. This study gave a first successful gene editing of D. salina which has a very important practical significance for increasing carotene yield and meeting realistic industry demand. Furthermore, it provides an approach to overcome the current obstacles of D. salina, and then gives a strong tool to facilitates the development and application of D. salina system
The Complete Plastome Sequences of Four Orchid Species: Insights into the Evolution of the Orchidaceae and the Utility of Plastomic Mutational Hotspots
Orchidaceae (orchids) is the largest family in the monocots, including about 25,000 species in 880 genera and five subfamilies. Many orchids are highly valued for their beautiful and long-lasting flowers. However, the phylogenetic relationships among the five orchid subfamilies remain unresolved. The major dispute centers on whether the three one-stamened subfamilies, Epidendroideae, Orchidoideae, and Vanilloideae, are monophyletic or paraphyletic. Moreover, structural changes in the plastid genome (plastome) and the effective genetic loci at the species-level phylogenetics of orchids have rarely been documented. In this study, we compared 53 orchid plastomes, including four newly sequenced ones, that represent four remote genera: Dendrobium, Goodyera, Paphiopedilum, and Vanilla. These differ from one another not only in their lengths of inverted repeats and small single copy regions but also in their retention of ndh genes. Comparative analyses of the plastomes revealed that the expansion of inverted repeats in Paphiopedilum and Vanilla is associated with a loss of ndh genes. In orchid plastomes, mutational hotspots are genus specific. After having carefully examined the data, we propose that the three loci 5′trnK-rps16, trnS-trnG, and rps16-trnQ might be powerful markers for genera within Epidendroideae, and clpP-psbB and rps16-trnQ might be markers for genera within Cypripedioideae. After analyses of a partitioned dataset, we found that our plastid phylogenomic trees were congruent in a topology where two one-stamened subfamilies (i.e., Epidendroideae and Orchidoideae) were sisters to a multi-stamened subfamily (i.e., Cypripedioideae) rather than to the other one-stamened subfamily (Vanilloideae), suggesting that the living one-stamened orchids are paraphyletic
Plastome-wide comparison reveals new SNV resources for the authentication of Dendrobium huoshanense and its corresponding medicinal slice (Huoshan Fengdou)
Dendrobium species and their corresponding medicinal slices have been extensively used as traditional Chinese medicine (TCM) in many Asian countries. However, it is extremely difficult to identify Dendrobium species based on their morphological and chemical features. In this study, the plastomes of D. huoshanense were used as a model system to investigate the hypothesis that plastomic mutational hotspot regions could provide a useful single nucleotide variants (SNVs) resource for authentication studies. We surveyed the plastomes of 17 Dendrobium species, including the newly sequenced plastome of D. huoshanense. A total of 19 SNVs that could be used for the authentication of D. huoshanense were detected. On the basis of this comprehensive comparison, we identified the four most informative hotspot regions in the Dendrobium plastome that encompass ccsA to ndhF, matK to 3′trnG, rpoB to psbD, and trnT to rbcL. Furthermore, to established a simple and accurate method for the authentication of D. huoshanense and its medicinal slices, a total of 127 samples from 20 Dendrobium species including their corresponding medicinal slices (Fengdous) were used in this study. Our results suggest that D. huoshanense and its medicinal slices can be rapidly and unequivocally identified using this method that combines real-time PCR with the amplification refractory mutation system (ARMS). KEY WORDS: Dendrobium huoshanense, Plastome, Slices, Single nucleotide variants, RT-ARMS, Authenticatio
Mutational Biases and GC-Biased Gene Conversion Affect GC Content in the Plastomes of Dendrobium Genus
The variation of GC content is a key genome feature because it is associated with fundamental elements of genome organization. However, the reason for this variation is still an open question. Different kinds of hypotheses have been proposed to explain the variation of GC content during genome evolution. However, these hypotheses have not been explicitly investigated in whole plastome sequences. Dendrobium is one of the largest genera in the orchid species. Evolutionary studies of the plastomic organization and base composition are limited in this genus. In this study, we obtained the high-quality plastome sequences of D. loddigesii and D. devonianum. The comparison results showed a nearly identical organization in Dendrobium plastomes, indicating that the plastomic organization is highly conserved in Dendrobium genus. Furthermore, the impact of three evolutionary forces—selection, mutational biases, and GC-biased gene conversion (gBGC)—on the variation of GC content in Dendrobium plastomes was evaluated. Our results revealed: (1) consistent GC content evolution trends and mutational biases in single-copy (SC) and inverted repeats (IRs) regions; and (2) that gBGC has influenced the plastome-wide GC content evolution. These results suggest that both mutational biases and gBGC affect GC content in the plastomes of Dendrobium genus
Comparative Analysis of the Complete Plastomes of Apostasia wallichii and Neuwiedia singapureana (Apostasioideae) Reveals Different Evolutionary Dynamics of IR/SSC Boundary among Photosynthetic Orchids
Apostasioideae, consists of only two genera, Apostasia and Neuwiedia, which are mainly distributed in Southeast Asia and northern Australia. The floral structure, taxonomy, biogeography, and genome variation of Apostasioideae have been intensively studied. However, detailed analyses of plastome composition and structure and comparisons with those of other orchid subfamilies have not yet been conducted. Here, the complete plastome sequences of Apostasia wallichii and Neuwiedia singapureana were sequenced and compared with 43 previously published photosynthetic orchid plastomes to characterize the plastome structure and evolution in the orchids. Unlike many orchid plastomes (e.g., Paphiopedilum and Vanilla), the plastomes of Apostasioideae contain a full set of 11 functional NADH dehydrogenase (ndh) genes. The distribution of repeat sequences and simple sequence repeat elements enhanced the view that the mutation rate of non-coding regions was higher than that of coding regions. The 10 loci—ndhA intron, matK-5′trnK, clpP-psbB, rps8-rpl14, trnT-trnL, 3′trnK-matK, clpP intron, psbK-trnK, trnS-psbC, and ndhF-rpl32—that had the highest degrees of sequence variability were identified as mutational hotspots for the Apostasia plastome. Furthermore, our results revealed that plastid genes exhibited a variable evolution rate within and among different orchid genus. Considering the diversified evolution of both coding and non-coding regions, we suggested that the plastome-wide evolution of orchid species was disproportional. Additionally, the sequences flanking the inverted repeat/small single copy (IR/SSC) junctions of photosynthetic orchid plastomes were categorized into three types according to the presence/absence of ndh genes. Different evolutionary dynamics for each of the three IR/SSC types of photosynthetic orchid plastomes were also proposed