183 research outputs found

    Pollution at Lake Quinsigamond

    Get PDF
    https://digitalcommons.wpi.edu/gps-posters/1613/thumbnail.jp

    Incorporating expression data in metabolic modeling: a case study of lactate dehydrogenase

    Full text link
    Integrating biological information from different sources to understand cellular processes is an important problem in systems biology. We use data from mRNA expression arrays and chemical kinetics to formulate a metabolic model relevant to K562 erythroleukemia cells. MAP kinase pathway activation alters the expression of metabolic enzymes in K562 cells. Our array data show changes in expression of lactate dehydrogenase (LDH) isoforms after treatment with phorbol 12-myristate 13-acetate (PMA), which activates MAP kinase signaling. We model the change in lactate production which occurs when the MAP kinase pathway is activated, using a non-equilibrium, chemical-kinetic model of homolactic fermentation. In particular, we examine the role of LDH isoforms, which catalyze the conversion of pyruvate to lactate. Changes in the isoform ratio are not the primary determinant of the production of lactate. Rather, the total concentration of LDH controls the lactate concentration.Comment: In press, Journal of Theoretical Biology. 27 pages, 9 figure

    Power and limitations of electrophoretic separations in proteomics strategies

    Get PDF
    Proteomics can be defined as the large-scale analysis of proteins. Due to the complexity of biological systems, it is required to concatenate various separation techniques prior to mass spectrometry. These techniques, dealing with proteins or peptides, can rely on chromatography or electrophoresis. In this review, the electrophoretic techniques are under scrutiny. Their principles are recalled, and their applications for peptide and protein separations are presented and critically discussed. In addition, the features that are specific to gel electrophoresis and that interplay with mass spectrometry (i.e., protein detection after electrophoresis, and the process leading from a gel piece to a solution of peptides) are also discussed

    Addressing statistical biases in nucleotide-derived protein databases for proteogenomic search strategies

    Get PDF
    [Image: see text] Proteogenomics has the potential to advance genome annotation through high quality peptide identifications derived from mass spectrometry experiments, which demonstrate a given gene or isoform is expressed and translated at the protein level. This can advance our understanding of genome function, discovering novel genes and gene structure that have not yet been identified or validated. Because of the high-throughput shotgun nature of most proteomics experiments, it is essential to carefully control for false positives and prevent any potential misannotation. A number of statistical procedures to deal with this are in wide use in proteomics, calculating false discovery rate (FDR) and posterior error probability (PEP) values for groups and individual peptide spectrum matches (PSMs). These methods control for multiple testing and exploit decoy databases to estimate statistical significance. Here, we show that database choice has a major effect on these confidence estimates leading to significant differences in the number of PSMs reported. We note that standard target:decoy approaches using six-frame translations of nucleotide sequences, such as assembled transcriptome data, apparently underestimate the confidence assigned to the PSMs. The source of this error stems from the inflated and unusual nature of the six-frame database, where for every target sequence there exists five ā€œincorrectā€ targets that are unlikely to code for protein. The attendant FDR and PEP estimates lead to fewer accepted PSMs at fixed thresholds, and we show that this effect is a product of the database and statistical modeling and not the search engine. A variety of approaches to limit database size and remove noncoding target sequences are examined and discussed in terms of the altered statistical estimates generated and PSMs reported. These results are of importance to groups carrying out proteogenomics, aiming to maximize the validation and discovery of gene structure in sequenced genomes, while still controlling for false positives

    Development of an amplicon-based sequencing approach in response to the global emergence of mpox

    Get PDF
    The 2022 multicountry mpox outbreak concurrent with the ongoing Coronavirus Disease 2019 (COVID-19) pandemic further highlighted the need for genomic surveillance and rapid pathogen whole-genome sequencing. While metagenomic sequencing approaches have been used to sequence many of the early mpox infections, these methods are resource intensive and require samples with high viral DNA concentrations. Given the atypical clinical presentation of cases associated with the outbreak and uncertainty regarding viral load across both the course of infection and anatomical body sites, there was an urgent need for a more sensitive and broadly applicable sequencing approach. Highly multiplexed amplicon-based sequencing (PrimalSeq) was initially developed for sequencing of Zika virus, and later adapted as the main sequencing approach for Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2). Here, we used PrimalScheme to develop a primer scheme for human monkeypox virus that can be used with many sequencing and bioinformatics pipelines implemented in public health laboratories during the COVID-19 pandemic. We sequenced clinical specimens that tested presumptively positive for human monkeypox virus with amplicon-based and metagenomic sequencing approaches. We found notably higher genome coverage across the virus genome, with minimal amplicon drop-outs, in using the amplicon-based sequencing approach, particularly in higher PCR cycle threshold (Ct) (lower DNA titer) samples. Further testing demonstrated that Ct value correlated with the number of sequencing reads and influenced the percent genome coverage. To maximize genome coverage when resources are limited, we recommend selecting samples with a PCR Ct below 31 Ct and generating 1 million sequencing reads per sample. To support national and international public health genomic surveillance efforts, we sent out primer pool aliquots to 10 laboratories across the United States, United Kingdom, Brazil, and Portugal. These public health laboratories successfully implemented the human monkeypox virus primer scheme in various amplicon sequencing workflows and with different sample types across a range of Ct values. Thus, we show that amplicon-based sequencing can provide a rapidly deployable, cost-effective, and flexible approach to pathogen whole-genome sequencing in response to newly emerging pathogens. Importantly, through the implementation of our primer scheme into existing SARS-CoV-2 workflows and across a range of sample types and sequencing platforms, we further demonstrate the potential of this approach for rapid outbreak response.This publication was made possible by CTSA Grant Number UL1 TR001863 from the National Center for Advancing Translational Science (NCATS), a component of the National Institutes of Health (NIH) awarded to CBFV. INSA was partially funded by the HERA project (Grant/ 2021/PHF/23776) supported by the European Commission through the European Centre for Disease Control (to VB).info:eu-repo/semantics/publishedVersio

    The possible functions of duplicated ets (GGAA) motifs located near transcription start sites of various human genes

    Get PDF
    Transcription is one of the most fundamental nuclear functions and is an enzyme complex-mediated reaction that converts DNA sequences into mRNA. Analyzing DNA sequences of 5ā€²-flanking regions of several human genes that respond to 12-O-tetradecanoyl-phorbol-13-acetate (TPA) in HL-60 cells, we have identified that the ets (GGAA) motifs are duplicated, overlapped, or clustered within a 500-bp distance from the most 5ā€²-upstream region of the cDNA. Multiple protein factors including Ets family proteins are known to recognize and bind to the GGAA containing sequences. In addition, it has been reported that the ets motifs play important roles in regulation of various promoters. Here, we propose a molecular mechanism, defined by the presence of duplication and multiplication of the GGAA motifs, that is responsible for the initiation of transcription of several genes and for the recruitment of binding proteins to the transcription start site (TSS) of TATA-less promoters
    • ā€¦
    corecore