33 research outputs found

    The Size of the Human Proteome: The Width and Depth

    Get PDF
    This work discusses bioinformatics and experimental approaches to explore the human proteome, a constellation of proteins expressed in different tissues and organs. As the human proteome is not a static entity, it seems necessary to estimate the number of different protein species (proteoforms) and measure the number of copies of the same protein in a specific tissue. Here, meta-analysis of neXtProt knowledge base is proposed for theoretical prediction of the number of different proteoforms that arise from alternative splicing (AS), single amino acid polymorphisms (SAPs), and posttranslational modifications (PTMs). Three possible cases are considered: (1) PTMs and SAPs appear exclusively in the canonical sequences of proteins, but not in splice variants; (2) PTMs and SAPs can occur in both proteins encoded by canonical sequences and in splice variants; (3) all modification types (AS, SAP, and PTM) occur as independent events. Experimental validation of proteoforms is limited by the analytical sensitivity of proteomic technology. A bell-shaped distribution histogram was generated for proteins encoded by a single chromosome, with the estimation of copy numbers in plasma, liver, and HepG2 cell line. The proposed metabioinformatics approaches can be used for estimation of the number of different proteoforms for any group of protein-coding genes

    Gene-centric coverage of the human liver transcriptome: QPCR, Illumina, and Oxford Nanopore RNA-Seq

    Get PDF
    It has been shown that the best coverage of the HepG2 cell line transcriptome encoded by genes of a single chromosome, chromosome 18, is achieved by a combination of two sequencing platforms, Illumina RNA-Seq and Oxford Nanopore Technologies (ONT), using cut-off levels of FPKM > 0 and TPM > 0, respectively. In this study, we investigated the extent to which the combination of these transcriptomic analysis methods makes it possible to achieve a high coverage of the transcriptome encoded by the genes of other human chromosomes. A comparative analysis of transcriptome coverage for various types of biological material was carried out, and the HepG2 cell line transcriptome was compared with the transcriptome of liver tissue cells. In addition, the contribution of variability in the coverage of expressed genes in human transcriptomes to the creation of a draft human transcriptome was evaluated. For human liver tissues, ONT makes an extremely insignificant contribution to the overall coverage of the transcriptome. Thus, to ensure maximum coverage of the liver tissue transcriptome, it is sufficient to apply only one technology: Illumina RNA-Seq (FPKM > 0)

    Dataset of target mass spectromic proteome profiling for human chromosome 18

    Get PDF
    Proteome profiling is a type of quantitative analysis that reveals level of protein expression in the sample. Proteome profiling by using selected reaction monitoring is an approach for the Chromosome-centric Human Proteome Project (C-HPP). Here we describe dataset generated in the course of the pilot phase of Russian part of C-HPP, which was focused on human Chr 18 proteins. Proteome profiling was performed using stable isotope-labeled standards (SRM/SIS) for plasma, liver tissue and HepG2 cells. Dataset includes both positive and negative results of protein detection.These data were partly discussed in recent publications, “Chromosome 18 Transcriptome Profiling and Targeted Proteome Mapping in Depleted Plasma, Liver Tissue and HepG2 Cells” [1] and “Chromosome 18 transcriptoproteome of liver tissue and HepG2 Cells and targeted proteome mapping in depleted plasma: Update 2013” [2], supporting the accompanying publication “State of the Chromosome 18-centric HPP in 2016: Transcriptome and Proteome Profiling of Liver Tissue and HepG2 Cells” [3], and are deposited at the ProteomeXchange via the PASSEL repository with the dataset identifier PASSEL: PASS00697 for liver and HepG2 cell line

    The Expectation and Reality of the HepG2 Core Metabolic Profile

    No full text
    To represent the composition of small molecules circulating in HepG2 cells and the formation of the “core” of characteristic metabolites that often attract researchers’ attention, we conducted a meta-analysis of 56 datasets obtained through metabolomic profiling via mass spectrometry and NMR. We highlighted the 288 most commonly studied compounds of diverse chemical nature and analyzed metabolic processes involving these small molecules. Building a complete map of the metabolome of a cell, which encompasses the diversity of possible impacts on it, is a severe challenge for the scientific community, which is faced not only with natural limitations of experimental technologies, but also with the absence of transparent and widely accepted standards for processing and presenting the obtained metabolomic data. Formulating our research design, we aimed to reveal metabolites crucial to the Hepg2 cell line, regardless of all chemical and/or physical impact factors. Unfortunately, the existing paradigm of data policy leads to a streetlight effect. When analyzing and reporting only target metabolites of interest, the community ignores the changes in the metabolomic landscape that hide many molecular secrets

    Evolution of Protein Functional Annotation: Text Mining Study

    No full text
    Within the Human Proteome Project initiative framework for creating functional annotations of uPE1 proteins, the neXt-CP50 Challenge was launched in 2018. In analogy with the missing-protein challenge, each command deciphers the functional features of the proteins in the chromosome-centric mode. However, the neXt-CP50 Challenge is more complicated than the missing-protein challenge: the approaches and methods for solving the problem are clear, but neither the concept of protein function nor specific experimental and/or bioinformatics protocols have been standardized to address it. We proposed using a retrospective analysis of the key HPP repository, the neXtProt database, to identify the most frequently used experimental and bioinformatic methods for analyzing protein functions, and the dynamics of accumulation of functional annotations. It has been shown that the dynamics of the increase in the number of proteins with known functions are greater than the progress made in the experimental confirmation of the existence of questionable proteins in the framework of the missing-protein challenge. At the same time, the functional annotation is based on the guilty-by-association postulate, according to which, based on large-scale experiments on API-MS and Y2H, proteins with unknown functions are most likely mapped through “handshakes” to biochemical processes

    Analytical Solutions for Geodesic Acoustic Eigenmodes in Tokamak Plasmas

    No full text
    The analytical solutions for geodesic acoustic eigenmodes in tokamak plasmas with circular concentric magnetic surfaces are found. In the frame of ideal magnetohydrodynamics the dispersion relation taking into account the toroidal coupling between electrostatic perturbations and electromagnetic perturbations with poloidal mode number |m| = 2 is derived. In the absence of such a coupling the dispersion relation gives the standard continuous spectrum of geodesic acoustic modes. The analysis of the existence of global eigenmodes for plasma equilibria with both off-axis and on-axis maximum of the local geodesic acoustic frequency is performed

    Analytical Solutions for Geodesic Acoustic Eigenmodes in Tokamak Plasmas

    No full text
    The analytical solutions for geodesic acoustic eigenmodes in tokamak plasmas with circular concentric magnetic surfaces are found. In the frame of ideal magnetohydrodynamics the dispersion relation taking into account the toroidal coupling between electrostatic perturbations and electromagnetic perturbations with poloidal mode number |m| = 2 is derived. In the absence of such a coupling the dispersion relation gives the standard continuous spectrum of geodesic acoustic modes. The analysis of the existence of global eigenmodes for plasma equilibria with both off-axis and on-axis maximum of the local geodesic acoustic frequency is performed

    Analytical Solutions for Geodesic Acoustic Eigenmodes in Tokamak Plasmas

    No full text
    The analytical solutions for geodesic acoustic eigenmodes in tokamak plasmas with circular concentric magnetic surfaces are found. In the frame of ideal magnetohydrodynamics the dispersion relation taking into account the toroidal coupling between electrostatic perturbations and electromagnetic perturbations with poloidal mode number |m| = 2 is derived. In the absence of such a coupling the dispersion relation gives the standard continuous spectrum of geodesic acoustic modes. The analysis of the existence of global eigenmodes for plasma equilibria with both off-axis and on-axis maximum of the local geodesic acoustic frequency is performed

    Exploring Dynamic Metabolome of the HepG2 Cell Line: Rise and Fall

    No full text
    Both biological and technical variations can discredit the reliability of obtained data in omics studies. In this technical note, we investigated the effect of prolonged cultivation of the HepG2 hepatoma cell line on its metabolomic profile. Using the GC × GC-MS approach, we determined the degree of metabolic variability across HepG2 cells cultured in uniform conditions for 0, 5, 10, 15, and 20 days. Post-processing of obtained data revealed substantial changes in relative abundances of 110 metabolites among HepG2 samples under investigation. Our findings have implications for interpreting metabolomic results obtained from immortal cells, especially in longitudinal studies. There are still plenty of unanswered questions regarding metabolomics variability and many potential areas for future targeted and panoramic research. However, we suggest that the metabolome of cell lines is unstable and may undergo significant transformation over time, even if the culture conditions remain the same. Considering metabolomics variability on a relatively long-term basis, careful experimentation with particular attention to control samples is required to ensure reproducibility and relevance of the research results when testing both fundamentally and practically significant hypotheses

    Blood Plasma Proteome: A Meta-Analysis of the Results of Protein Quantification in Human Blood by Targeted Mass Spectrometry

    No full text
    A meta-analysis of the results of targeted quantitative screening of human blood plasma was performed to generate a reference standard kit that can be used for health analytics. The panel included 53 of the 296 proteins that form a “stable” part of the proteome of a healthy individual; these proteins were found in at least 70% of samples and were characterized by an interindividual coefficient of variation −10–10−3 M and enrichment analysis revealed their association with rare familial diseases. The concentration of ceruloplasmin was reduced by approximately three orders of magnitude in patients with neurological disorders compared to healthy volunteers, and those of gelsolin isoform 1 and complement factor H were abruptly reduced in patients with lung adenocarcinoma. Absolute quantitative data of the individual proteome of a healthy and diseased individual can be used as the basis for personalized medicine and health monitoring. Storage over time allows us to identify individual biomarkers in the molecular landscape and prevent pathological conditions
    corecore