30 research outputs found

    The Size of the Human Proteome: The Width and Depth

    Get PDF
    This work discusses bioinformatics and experimental approaches to explore the human proteome, a constellation of proteins expressed in different tissues and organs. As the human proteome is not a static entity, it seems necessary to estimate the number of different protein species (proteoforms) and measure the number of copies of the same protein in a specific tissue. Here, meta-analysis of neXtProt knowledge base is proposed for theoretical prediction of the number of different proteoforms that arise from alternative splicing (AS), single amino acid polymorphisms (SAPs), and posttranslational modifications (PTMs). Three possible cases are considered: (1) PTMs and SAPs appear exclusively in the canonical sequences of proteins, but not in splice variants; (2) PTMs and SAPs can occur in both proteins encoded by canonical sequences and in splice variants; (3) all modification types (AS, SAP, and PTM) occur as independent events. Experimental validation of proteoforms is limited by the analytical sensitivity of proteomic technology. A bell-shaped distribution histogram was generated for proteins encoded by a single chromosome, with the estimation of copy numbers in plasma, liver, and HepG2 cell line. The proposed metabioinformatics approaches can be used for estimation of the number of different proteoforms for any group of protein-coding genes

    The Curious Case of the HepG2 Cell Line: 40 Years of Expertise

    No full text
    Liver cancer is the third leading cause of cancer death worldwide. Representing such a dramatic impact on our lives, liver cancer is a significant public health concern. Sustainable and reliable methods for preventing and treating liver cancer require fundamental research on its molecular mechanisms. Cell lines are treated as in vitro equivalents of tumor tissues, making them a must-have for basic research on the nature of cancer. According to recent discoveries, certified cell lines retain most genetic properties of the original tumor and mimic its microenvironment. On the other hand, modern technologies allowing the deepest level of detail in omics landscapes have shown significant differences even between samples of the same cell line due to cross- and mycoplasma infection. This and other observations suggest that, in some cases, cell cultures are not suitable as cancer models, with limited predictive value for the effectiveness of new treatments. HepG2 is a popular hepatic cell line. It is used in a wide range of studies, from the oncogenesis to the cytotoxicity of substances on the liver. In this regard, we set out to collect up-to-date information on the HepG2 cell line to assess whether the level of heterogeneity of the cell line allows in vitro biomedical studies as a model with guaranteed production and quality

    The Gene-Centric Content Management System and Its Application for Cognitive Proteomics

    No full text
    The Human Proteome Project is moving into the next phase of creating and/or reconsidering the functional annotations of proteins using the chromosome-centric paradigm. This challenge cannot be solved exclusively using automated means, but rather requires human intelligence for interpreting the combined data. To foster the integration between human cognition and post-genome array a number of specific tools were recently developed, among them CAPER, GenomewidePDB, and The Proteome Browser (TPB). For the purpose of tackling the task of protein functional annotating the Gene-Centric Content Management System (GenoCMS) was expanded with new features. The goal was to enable bioinformaticans to develop self-made applications and to position these applets within the generalized informational canvas supported by GenoCMS. We report the results of GenoCMS-enabled integration of the concordant informational flows in the chromosome-centric framework of the human chromosome 18 project. The workflow described in the article can be scaled to other human chromosomes, and also supplemented with new tracks created by the user. The GenoCMS is an example of a project-oriented informational system, which are important for public data sharing

    The Expectation and Reality of the HepG2 Core Metabolic Profile

    No full text
    To represent the composition of small molecules circulating in HepG2 cells and the formation of the “core” of characteristic metabolites that often attract researchers’ attention, we conducted a meta-analysis of 56 datasets obtained through metabolomic profiling via mass spectrometry and NMR. We highlighted the 288 most commonly studied compounds of diverse chemical nature and analyzed metabolic processes involving these small molecules. Building a complete map of the metabolome of a cell, which encompasses the diversity of possible impacts on it, is a severe challenge for the scientific community, which is faced not only with natural limitations of experimental technologies, but also with the absence of transparent and widely accepted standards for processing and presenting the obtained metabolomic data. Formulating our research design, we aimed to reveal metabolites crucial to the Hepg2 cell line, regardless of all chemical and/or physical impact factors. Unfortunately, the existing paradigm of data policy leads to a streetlight effect. When analyzing and reporting only target metabolites of interest, the community ignores the changes in the metabolomic landscape that hide many molecular secrets

    Estimating Total Quantitative Protein Content in <i>Escherichia coli</i>, <i>Saccharomyces cerevisiae</i>, and HeLa Cells

    No full text
    The continuous improvement of proteomic techniques, most notably mass spectrometry, has generated quantified proteomes of many organisms with unprecedented depth and accuracy. However, there is still a significant discrepancy in the reported numbers of total protein molecules per specific cell type. In this article, we explore the results of proteomic studies of Escherichia coli, Saccharomyces cerevisiae, and HeLa cells in terms of total protein copy numbers per cell. We observe up to a ten-fold difference between reported values. Investigating possible reasons for this discrepancy, we conclude that neither an unmeasured fraction of the proteome nor biases in the quantification of individual proteins can explain the observed discrepancy. We normalize protein copy numbers in each study using a total protein amount per cell as reported in the literature and create integrated proteome maps of the selected model organisms. Our results indicate that cells contain from one to three million protein molecules per µm3 and that protein copy density decreases with increasing organism complexity

    Epitranscriptome: Review of Top 25 Most-Studied RNA Modifications

    No full text
    The alphabet of building blocks for RNA molecules is much larger than the standard four nucleotides. The diversity is achieved by the post-transcriptional biochemical modification of these nucleotides into distinct chemical entities that are structurally and functionally different from their unmodified counterparts. Some of these modifications are constituent and critical for RNA functions, while others serve as dynamic markings to regulate the fate of specific RNA molecules. Together, these modifications form the epitranscriptome, an essential layer of cellular biochemistry. As of the time of writing this review, more than 300 distinct RNA modifications from all three life domains have been identified. However, only a few of the most well-established modifications are included in most reviews on this topic. To provide a complete overview of the current state of research on the epitranscriptome, we analyzed the extent of the available information for all known RNA modifications. We selected 25 modifications to describe in detail. Summarizing our findings, we describe the current status of research on most RNA modifications and identify further developments in this field

    Exploring Dynamic Metabolome of the HepG2 Cell Line: Rise and Fall

    No full text
    Both biological and technical variations can discredit the reliability of obtained data in omics studies. In this technical note, we investigated the effect of prolonged cultivation of the HepG2 hepatoma cell line on its metabolomic profile. Using the GC &times; GC-MS approach, we determined the degree of metabolic variability across HepG2 cells cultured in uniform conditions for 0, 5, 10, 15, and 20 days. Post-processing of obtained data revealed substantial changes in relative abundances of 110 metabolites among HepG2 samples under investigation. Our findings have implications for interpreting metabolomic results obtained from immortal cells, especially in longitudinal studies. There are still plenty of unanswered questions regarding metabolomics variability and many potential areas for future targeted and panoramic research. However, we suggest that the metabolome of cell lines is unstable and may undergo significant transformation over time, even if the culture conditions remain the same. Considering metabolomics variability on a relatively long-term basis, careful experimentation with particular attention to control samples is required to ensure reproducibility and relevance of the research results when testing both fundamentally and practically significant hypotheses

    Blood Plasma Proteome: A Meta-Analysis of the Results of Protein Quantification in Human Blood by Targeted Mass Spectrometry

    No full text
    A meta-analysis of the results of targeted quantitative screening of human blood plasma was performed to generate a reference standard kit that can be used for health analytics. The panel included 53 of the 296 proteins that form a “stable” part of the proteome of a healthy individual; these proteins were found in at least 70% of samples and were characterized by an interindividual coefficient of variation −10–10−3 M and enrichment analysis revealed their association with rare familial diseases. The concentration of ceruloplasmin was reduced by approximately three orders of magnitude in patients with neurological disorders compared to healthy volunteers, and those of gelsolin isoform 1 and complement factor H were abruptly reduced in patients with lung adenocarcinoma. Absolute quantitative data of the individual proteome of a healthy and diseased individual can be used as the basis for personalized medicine and health monitoring. Storage over time allows us to identify individual biomarkers in the molecular landscape and prevent pathological conditions

    Does Proteomic Mirror Reflect Clinical Characteristics of Obesity?

    No full text
    Obesity is a frightening chronic disease, which has tripled since 1975. It is not expected to slow down staying one of the leading cases of preventable death and resulting in an increased clinical and economic burden. Poor lifestyle choices and excessive intake of “cheap calories” are major contributors to obesity, triggering type 2 diabetes, cardiovascular diseases, and other comorbidities. Understanding the molecular mechanisms responsible for development of obesity is essential as it might result in the introducing of anti-obesity targets and early-stage obesity biomarkers, allowing the distinction between metabolic syndromes. The complex nature of this disease, coupled with the phenomenon of metabolically healthy obesity, inspired us to perform data-centric, hypothesis-generating pilot research, aimed to find correlations between parameters of classic clinical blood tests and proteomic profiles of 104 lean and obese subjects. As the result, we assembled patterns of proteins, which presence or absence allows predicting the weight of the patient fairly well. We believe that such proteomic patterns with high prediction power should facilitate the translation of potential candidates into biomarkers of clinical use for early-stage stratification of obesity therapy

    Does Proteomic Mirror Reflect Clinical Characteristics of Obesity?

    No full text
    Obesity is a frightening chronic disease, which has tripled since 1975. It is not expected to slow down staying one of the leading cases of preventable death and resulting in an increased clinical and economic burden. Poor lifestyle choices and excessive intake of &ldquo;cheap calories&rdquo; are major contributors to obesity, triggering type 2 diabetes, cardiovascular diseases, and other comorbidities. Understanding the molecular mechanisms responsible for development of obesity is essential as it might result in the introducing of anti-obesity targets and early-stage obesity biomarkers, allowing the distinction between metabolic syndromes. The complex nature of this disease, coupled with the phenomenon of metabolically healthy obesity, inspired us to perform data-centric, hypothesis-generating pilot research, aimed to find correlations between parameters of classic clinical blood tests and proteomic profiles of 104 lean and obese subjects. As the result, we assembled patterns of proteins, which presence or absence allows predicting the weight of the patient fairly well. We believe that such proteomic patterns with high prediction power should facilitate the translation of potential candidates into biomarkers of clinical use for early-stage stratification of obesity therapy
    corecore