112 research outputs found
Divide and Conquer (DC) BLAST: fast and easy BLAST execution within HPC environments
Bioinformatics is currently faced with very large-scale data sets that lead to computational jobs, especially sequence similarity searches, that can take absurdly long times to run. For example, the National Center for Biotechnology Information (NCBI) Basic Local Alignment Search Tool (BLAST and BLAST+) suite, which is by far the most widely used tool for rapid similarity searching among nucleic acid or amino acid sequences, is highly central processing unit (CPU) intensive. While the BLAST suite of programs perform searches very rapidly, they have the potential to be accelerated. In recent years, distributed computing environments have become more widely accessible and used due to the increasing availability of high-performance computing (HPC) systems. Therefore, simple solutions for data parallelization are needed to expedite BLAST and other sequence analysis tools. However, existing software for parallel sequence similarity searches often requires extensive computational experience and skill on the part of the user. In order to accelerate BLAST and other sequence analysis tools, Divide and Conquer BLAST (DCBLAST) was developed to perform NCBI BLAST searches within a cluster, grid, or HPC environment by using a query sequence distribution approach. Scaling from one (1) to 256 CPU cores resulted in significant improvements in processing speed. Thus, DCBLAST dramatically accelerates the execution of BLAST searches using a simple, accessible, robust, and parallel approach. DCBLAST works across multiple nodes automatically and it overcomes the speed limitation of single-node BLAST programs. DCBLAST can be used on any HPC system, can take advantage of hundreds of nodes, and has no output limitations. This freely available tool simplifies distributed computation pipelines to facilitate the rapid discovery of sequence similarities between very large data sets.This work was supported by the Department of Energy (DOE), Office of Science, Genomic Science Program [DE-SC0008834 to JCC]. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
The authors would like to thank the Information Technology Department at the University of Nevada, Reno for the use of computing time on the High-Performance Computing Cluster (http://www.unr.edu/it/research-resources/the-grid) and Mary Ann Cushman and Pradeep Yerramsetty for providing helpful and clarifying comments on the manuscript
Laying the Foundation for Crassulacean Acid Metabolism (CAM) Biodesign: Expression of the C4 Metabolism Cycle Genes of CAM in Arabidopsis
Crassulacean acid metabolism (CAM) is a specialized mode of photosynthesis that exploits a temporal CO2 pump with nocturnal CO2 uptake and concentration to reduce photorespiration, improve water-use efficiency (WUE), and optimize the adaptability of plants to hotter and drier climates. Introducing the CAM photosynthetic machinery into C3 (or C4) photosynthesis plants (CAM Biodesign) represents a potentially breakthrough strategy for improving WUE while maintaining high productivity. To optimize the success of CAM Biodesign approaches, the functional analysis of individual C4 metabolism cycle genes is necessary to identify the essential genes for robust CAM pathway introduction. Here, we isolated and analyzed the subcellular localizations of 13 enzymes and regulatory proteins of the C4 metabolism cycle of CAM from the common ice plant in stably transformed Arabidopsis thaliana. Six components of the carboxylation module were analyzed including beta-carbonic anhydrase (McBCA2), phosphoenolpyruvate carboxylase (McPEPC1), phosphoenolpyruvate carboxylase kinase (McPPCK1), NAD-dependent malate dehydrogenase (McNAD-MDH1, McNAD-MDH2), and NADP-dependent malate dehydrogenase (McNADP-MDH1). In addition, seven components of the decarboxylation module were analyzed including NAD-dependent malic enzyme (McNAD-ME1, McNAD-ME2), NADP-dependent malic enzyme (McNADP-ME1, NADP-ME2), pyruvate, orthophosphate dikinase (McPPDK), pyruvate, orthophosphate dikinase-regulatory protein (McPPDK-RP), and phosphoenolpyruvate carboxykinase (McPEPCK). Ectopic overexpression of most C4-metabolism cycle components resulted in increased rosette diameter, leaf area, and leaf fresh weight of A. thaliana except for McNADP-MDH1, McPPDK-RP, and McPEPCK. Overexpression of most carboxylation module components resulted in increased stomatal conductance and dawn/dusk titratable acidity (TA) as an indirect measure of organic acid (mainly malate) accumulation in A. thaliana. In contrast, overexpression of the decarboxylating malic enzymes reduced stomatal conductance and TA. This comprehensive study provides fundamental insights into the relative functional contributions of each of the individual components of the core C4-metabolism cycle of CAM and represents a critical first step in laying the foundation for CAM Biodesign
Sporobolus stapfianus: Insights into desiccation tolerance in the resurrection grasses from linking transcriptomics to metabolomics
Predominant clusters of SDATs that share distinct patterns of abundance during dehydration: A. Predominant patterns of abundance for transcripts in clusters that exhibited increased abundance during dehydration. B. Predominant patterns of abundance for transcripts in clusters that exhibited a decreased abundance during dehydration. (PDF 226 kb
Identification of Genes Encoding Enzymes Catalyzing the Early Steps of Carrot Polyacetylene Biosynthesis
Polyacetylenic lipids accumulate in various Apiaceae species after pathogen attack, suggesting that these compounds are naturally occurring pesticides and potentially valuable resources for crop improvement. These compounds also promote human health and slow tumor growth. Even though polyacetylenic lipids were discovered decades ago, the biosynthetic pathway underlying their production is largely unknown. To begin filling this gap and ultimately enable polyacetylene engineering, we studied polyacetylenes and their biosynthesis in the major Apiaceae crop carrot (Daucus carota subsp. sativus). Using gas chromatography and mass spectrometry, we identified three known polyacetylenes and assigned provisional structures to two novel polyacetylenes. We also quantified these compounds in carrot leaf, petiole, root xylem, root phloem, and root periderm extracts. Falcarindiol and falcarinol predominated and accumulated primarily in the root periderm. Since the multiple double and triple carbon-carbon bonds that distinguish polyacetylenes from ubiquitous fatty acids are often introduced by Δ12 oleic acid desaturase (FAD2)-type enzymes, we mined the carrot genome for FAD2 genes. We identified a FAD2 family with an unprecedented 24 members and analyzed public, tissue-specific carrot RNA-Seq data to identify coexpressed members with root periderm-enhanced expression. Six candidate genes were heterologously expressed individually and in combination in yeast and Arabidopsis (Arabidopsis thaliana), resulting in the identification of one canonical FAD2 that converts oleic to linoleic acid, three divergent FAD2-like acetylenases that convert linoleic into crepenynic acid, and two bifunctional FAD2s with Δ12 and Δ14 desaturase activity that convert crepenynic into the further desaturated dehydrocrepenynic acid, a polyacetylene pathway intermediate. These genes can now be used as a basis for discovering other steps of falcarin-type polyacetylene biosynthesis, to modulate polyacetylene levels in plants, and to test the in planta function of these molecules
Crassulacean Acid Metabolism Abiotic Stress-Responsive Transcription Factors: a Potential Genetic Engineering Approach for Improving Crop Tolerance to Abiotic Stress
This perspective paper explores the utilization of abiotic stress-responsive transcription factors (TFs) from crassulacean acid metabolism (CAM) plants to improve abiotic stress tolerance in crop plants. CAM is a specialized type of photosynthetic adaptation that enhances water-use efficiency (WUE) by shifting CO2 uptake to all or part of the nighttime when evaporative water losses are minimal. Recent studies have shown that TF-based genetic engineering could be a useful approach for improving plant abiotic stress tolerance because of the role of TFs as master regulators of clusters of stress-responsive genes. Here, we explore the use of abiotic stress-responsive TFs from CAM plants to improve abiotic stress tolerance and WUE in crops by controlling the expression of gene cohorts that mediate drought-responsive adaptations. Recent research has revealed several TF families including AP2/ERF, MYB, WRKY, NAC, NF-Y, and bZIP that might regulate water-deficit stress responses and CAM in the inducible CAM plant Mesembryanthemum crystallinum under water-deficit stress-induced CAM and in the obligate CAM plant Kalanchoe fedtschenkoi. Overexpression of genes from these families in Arabidopsis thaliana can improve abiotic stress tolerance in A. thaliana in some instances. Therefore, we propose that TF-based genetic engineering with a small number of CAM abiotic stress-responsive TFs will be a promising strategy for improving abiotic stress tolerance and WUE in crop plants in a projected hotter and drier landscape in the 21st-century and beyond
Evolution of l-DOPA 4,5-dioxygenase activity allows for recurrent specialisation to betalain pigmentation in Caryophyllales
The evolution of l-DOPA 4,5-dioxygenase activity, encoded by the gene DODA, was a key step in the origin of betalain biosynthesis in Caryophyllales. We previously proposed that l-DOPA 4,5-dioxygenase activity evolved via a single Caryophyllales-specific neofunctionalisation event within the DODA gene lineage. However, this neofunctionalisation event has not been confirmed and the DODA gene lineage exhibits numerous gene duplication events, whose evolutionary significance is unclear. To address this, we functionally characterised 23 distinct DODA proteins for l-DOPA 4,5-dioxygenase activity, from four betalain-pigmented and five anthocyanin-pigmented species, representing key evolutionary transitions across Caryophyllales. By mapping these functional data to an updated DODA phylogeny, we then explored the evolution of l-DOPA 4,5-dioxygenase activity. We find that low l-DOPA 4,5-dioxygenase activity is distributed across the DODA gene lineage. In this context, repeated gene duplication events within the DODA gene lineage give rise to polyphyletic occurrences of elevated l-DOPA 4,5-dioxygenase activity, accompanied by convergent shifts in key functional residues and distinct genomic patterns of micro-synteny. In the context of an updated organismal phylogeny and newly inferred pigment reconstructions, we argue that repeated convergent acquisition of elevated l-DOPA 4,5-dioxygenase activity is consistent with recurrent specialisation to betalain synthesis in Caryophyllales.
Keywords: Caryophyllales; anthocyanins; betalains; convergent evolution; gene duplication; l-DOPA 4, 5-dioxygenase (DODA); metabolic operon; plant pigments; specialised metabolism
The bracteatus pineapple genome and domestication of clonally propagated crops
Domestication of clonally propagated crops such as pineapple from South America was hypothesized to be a 'one-step operation'. We sequenced the genome of Ananas comosus var. bracteatus CB5 and assembled 513 Mb into 25 chromosomes with 29,412 genes. Comparison of the genomes of CB5, F153 and MD2 elucidated the genomic basis of fiber production, color formation, sugar accumulation and fruit maturation. We also resequenced 89 Ananas genomes. Cultivars 'Smooth Cayenne' and 'Queen' exhibited ancient and recent admixture, while 'Singapore Spanish' supported a one-step operation of domestication. We identified 25 selective sweeps, including a strong sweep containing a pair of tandemly duplicated bromelain inhibitors. Four candidate genes for self-incompatibility were linked in F153, but were not functional in self-compatible CB5. Our findings support the coexistence of sexual recombination and a one-step operation in the domestication of clonally propagated crops. This work guides the exploration of sexual and asexual domestication trajectories in other clonally propagated crops
Multidrug resistant pulmonary tuberculosis treatment regimens and patient outcomes: an individual patient data meta-analysis of 9,153 patients.
Treatment of multidrug resistant tuberculosis (MDR-TB) is lengthy, toxic, expensive, and has generally poor outcomes. We undertook an individual patient data meta-analysis to assess the impact on outcomes of the type, number, and duration of drugs used to treat MDR-TB
Detection of primary sites in unknown primary tumors using FDG-PET or FDG-PET/CT
<p>Abstract</p> <p>Background</p> <p>Carcinoma of unknown primary tumors (CUP) is present in 0.5%-9% of all patients with malignant neoplasms; only 20%-27% of primary sites are identified before the patients die. Currently, 18F-fluorodeoxy-glucose positron-emission tomography (18F-FDG PET) or PET combined with computed tomography (PET/CT) is widely used for the diagnosis of CUP. However, the diagnostic yield of the primary site varies. The aim of this study was to determine whether PET or PET/CT has additional advantages over the conventional diagnostic workup in detecting the primary origin of CUP.</p> <p>Findings</p> <p>Twenty patients with unknown primary tumors that underwent PET or PET/CT were included in this study. For all patients, the conventional diagnostic workup was unsuccessful in detecting the primary sites. Among 20 patients, 11 had PET scans. The remaining nine patients had PET/CT. In all 20 patients, neither the PET nor PET/CT identified the primary site of the tumor, including six cases with cervical lymph node metastases. The PET and PET/CT revealed sites of FDG uptake other than those associated with known metastases in seven patients, but these findings did not influence patient management or therapy. Two patients had unnecessary invasive diagnostic procedures due to false positive results on the PET or PET/CT.</p> <p>Conclusions</p> <p>Although it is inconclusive because of small sample size of the study, the additional value of PET or PET/CT for the detection of primary sites in patients with CUP might be less than expected; especially in patients that have already had extensive conventional diagnostic workups. Further study is needed to confirm this finding.</p
- …