628 research outputs found

    Granzyme G is expressed in the two-cell stage mouse embryo and is required for the maternal-zygotic transition

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Detailed knowledge of the molecular and cellular mechanisms that direct spatial and temporal gene expression in pre-implantation embryos is critical for understanding the control of the maternal-zygotic transition and cell differentiation in early embryonic development. In this study, twenty-three clones, expressed at different stages of early mouse development, were identified using differential display reverse transcription polymerase chain reaction (DDRT-PCR). One of these clones, which is expressed in 2-cell stage embryos at 48 hr post-hCG injection, shows a perfect sequence homology to the gene encoding the granzyme G protein. The granzyme family members are serine proteases that are present in the secretory granules of cytolytic T lymphocytes. However, the pattern of granzyme G expression and its function in early mouse embryos are entirely unknown.</p> <p>Results</p> <p>Upon the introduction of an antisense morpholino (2 mM) against granzyme G to knock-down endogenous gene function, all embryos were arrested at the 2- to 4-cell stages of egg cleavage, and the <it>de novo </it>synthesis of zygotic RNAs was decreased. The embryonic survival rate was dramatically decreased at the late 2-cell stage when serine protease-specific inhibitors, 0.1 mM 3,4-dichloroisocoumarin (3,4-DCI), and 2 mM phenyl methanesulphonyl fluoride (PMSF), were added to the <it>in vitro </it>embryonic culture medium. Survival was not affected by the addition of 0.5 mM EDTA, a metalloproteinase inhibitor.</p> <p>Conclusion</p> <p>We characterized for the first time the expression and function of <it>granzyme G </it>during early stage embryogenesis. Our data suggest that granzyme G is an important factor in early mouse embryonic development and may play a novel role in the elimination of maternal proteins and the triggering of zygotic gene expression during the maternal-zygotic transition.</p

    Gene ontology based transfer learning for protein subcellular localization

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Prediction of protein subcellular localization generally involves many complex factors, and using only one or two aspects of data information may not tell the true story. For this reason, some recent predictive models are deliberately designed to integrate multiple heterogeneous data sources for exploiting multi-aspect protein feature information. Gene ontology, hereinafter referred to as <it>GO</it>, uses a controlled vocabulary to depict biological molecules or gene products in terms of biological process, molecular function and cellular component. With the rapid expansion of annotated protein sequences, gene ontology has become a general protein feature that can be used to construct predictive models in computational biology. Existing models generally either concatenated the <it>GO </it>terms into a flat binary vector or applied majority-vote based ensemble learning for protein subcellular localization, both of which can not estimate the individual discriminative abilities of the three aspects of gene ontology.</p> <p>Results</p> <p>In this paper, we propose a Gene Ontology Based Transfer Learning Model (<it>GO-TLM</it>) for large-scale protein subcellular localization. The model transfers the signature-based homologous <it>GO </it>terms to the target proteins, and further constructs a reliable learning system to reduce the adverse affect of the potential false <it>GO </it>terms that are resulted from evolutionary divergence. We derive three <it>GO </it>kernels from the three aspects of gene ontology to measure the <it>GO </it>similarity of two proteins, and derive two other spectrum kernels to measure the similarity of two protein sequences. We use simple non-parametric cross validation to explicitly weigh the discriminative abilities of the five kernels, such that the time & space computational complexities are greatly reduced when compared to the complicated semi-definite programming and semi-indefinite linear programming. The five kernels are then linearly merged into one single kernel for protein subcellular localization. We evaluate <it>GO-TLM </it>performance against three baseline models: <it>MultiLoc, MultiLoc-GO </it>and <it>Euk-mPLoc </it>on the benchmark datasets the baseline models adopted. 5-fold cross validation experiments show that <it>GO-TLM </it>achieves substantial accuracy improvement against the baseline models: 80.38% against model <it>Euk-mPLoc </it>67.40% with <it>12.98% </it>substantial increase; 96.65% and 96.27% against model <it>MultiLoc-GO </it>89.60% and 89.60%, with <it>7.05% </it>and <it>6.67% </it>accuracy increase on dataset <it>MultiLoc plant </it>and dataset <it>MultiLoc animal</it>, respectively; 97.14%, 95.90% and 96.85% against model <it>MultiLoc-GO </it>83.70%, 90.10% and 85.70%, with accuracy increase <it>13.44%</it>, <it>5.8% </it>and <it>11.15% </it>on dataset <it>BaCelLoc plant</it>, dataset <it>BaCelLoc fungi </it>and dataset <it>BaCelLoc animal </it>respectively. For <it>BaCelLoc </it>independent sets, <it>GO-TLM </it>achieves 81.25%, 80.45% and 79.46% on dataset <it>BaCelLoc plant holdout</it>, dataset <it>BaCelLoc plant holdout </it>and dataset <it>BaCelLoc animal holdout</it>, respectively, as compared against baseline model <it>MultiLoc-GO </it>76%, 60.00% and 73.00%, with accuracy increase <it>5.25%</it>, <it>20.45% </it>and <it>6.46%</it>, respectively.</p> <p>Conclusions</p> <p>Since direct homology-based <it>GO </it>term transfer may be prone to introducing noise and outliers to the target protein, we design an explicitly weighted kernel learning system (called Gene Ontology Based Transfer Learning Model, <it>GO-TLM</it>) to transfer to the target protein the known knowledge about related homologous proteins, which can reduce the risk of outliers and share knowledge between homologous proteins, and thus achieve better predictive performance for protein subcellular localization. Cross validation and independent test experimental results show that the homology-based <it>GO </it>term transfer and explicitly weighing the <it>GO </it>kernels substantially improve the prediction performance.</p

    Predicting multiplex subcellular localization of proteins using protein-protein interaction network: a comparative study

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Proteins that interact in vivo tend to reside within the same or "adjacent" subcellular compartments. This observation provides opportunities to reveal protein subcellular localization in the context of the protein-protein interaction (PPI) network. However, so far, only a few efforts based on heuristic rules have been made in this regard.</p> <p>Results</p> <p>We systematically and quantitatively validate the hypothesis that proteins physically interacting with each other probably share at least one common subcellular localization. With the result, for the first time, four graph-based semi-supervised learning algorithms, Majority, <it>χ</it><sup>2</sup>-score, GenMultiCut and FunFlow originally proposed for protein function prediction, are introduced to assign "multiplex localization" to proteins. We analyze these approaches by performing a large-scale cross validation on a <it>Saccharomyces cerevisiae </it>proteome compiled from BioGRID and comparing their predictions for 22 protein subcellular localizations. Furthermore, we build an ensemble classifier to associate 529 unlabeled and 137 ambiguously-annotated proteins with subcellular localizations, most of which have been verified in the previous experimental studies.</p> <p>Conclusions</p> <p>Physical interaction of proteins has actually provided an essential clue for their co-localization. Compared to the local approaches, the global algorithms consistently achieve a superior performance.</p

    Semi-supervised protein subcellular localization

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Protein subcellular localization is concerned with predicting the location of a protein within a cell using computational method. The location information can indicate key functionalities of proteins. Accurate predictions of subcellular localizations of protein can aid the prediction of protein function and genome annotation, as well as the identification of drug targets. Computational methods based on machine learning, such as support vector machine approaches, have already been widely used in the prediction of protein subcellular localization. However, a major drawback of these machine learning-based approaches is that a large amount of data should be labeled in order to let the prediction system learn a classifier of good generalization ability. However, in real world cases, it is laborious, expensive and time-consuming to experimentally determine the subcellular localization of a protein and prepare instances of labeled data.</p> <p>Results</p> <p>In this paper, we present an approach based on a new learning framework, semi-supervised learning, which can use much fewer labeled instances to construct a high quality prediction model. We construct an initial classifier using a small set of labeled examples first, and then use unlabeled instances to refine the classifier for future predictions.</p> <p>Conclusion</p> <p>Experimental results show that our methods can effectively reduce the workload for labeling data using the unlabeled data. Our method is shown to enhance the state-of-the-art prediction results of SVM classifiers by more than 10%.</p

    Where to deliver? Analysis of choice of delivery location from a national survey in India

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>In order to reduce maternal mortality, the Indian government has increased its commitment to institutional deliveries. We assess the determinants of home, private and public sector utilization for a delivery in a Western state.</p> <p>Methods</p> <p>Cross sectional analyses of the National Family Health Survey – 2 dataset.</p> <p>Setting</p> <p>Maharashtra state. The dataset had a sample size of 5391 ever-married females between the ages of 15 to 49 years. Data were abstracted for the most recent birth (n = 1510) and these were used in the analyses. Conceptual framework was the Andersen Behavioral Model. Multinomial logistic regression analyses was conducted to assess the association of predisposing, enabling and need factors on use of home, public or private sector for delivery.</p> <p>Results</p> <p>A majority delivered at home (n = 559, 37%); with private and public facility deliveries accounting for 32% (n = 493) and 31% (n = 454) respectively. For the choice set of home delivery versus public facility, women with higher birth order and those living in rural areas had greater odds of delivering at home, while increasing maternal age, greater media exposure, and more then three antenatal visits were associated with greater odds of delivery in a public facility. Maternal and paternal education, scheduled caste/tribe status, and media exposure were statistically significant predictors of the choice of public versus private facility delivery.</p> <p>Conclusion</p> <p>As India's economy continues to grow, the private sector will continue to expand. Given the high household expenditures on health, the government needs to facilitate insurance schemes or provide grants to prevent impoverishment. It also needs to strengthen the public sector so that it can return to its mission of being the safety net.</p

    Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>DNA repair is the general term for the collection of critical mechanisms which repair many forms of DNA damage such as methylation or ionizing radiation. DNA repair has mainly been studied in experimental and clinical situations, and relatively few information-based approaches to new extracting DNA repair knowledge exist. As a first step, automatic detection of DNA repair proteins in genomes via informatics techniques is desirable; however, there are many forms of DNA repair and it is not a straightforward process to identify and classify repair proteins with a single optimal method. We perform a study of the ability of homology and machine learning-based methods to identify and classify DNA repair proteins, as well as scan vertebrate genomes for the presence of novel repair proteins. Combinations of primary sequence polypeptide frequency, secondary structure, and homology information are used as feature information for input to a Support Vector Machine (SVM).</p> <p>Results</p> <p>We identify that SVM techniques are capable of identifying portions of DNA repair protein datasets without admitting false positives; at low levels of false positive tolerance, homology can also identify and classify proteins with good performance. Secondary structure information provides improved performance compared to using primary structure alone. Furthermore, we observe that machine learning methods incorporating homology information perform best when data is filtered by some clustering technique. Analysis by applying these methodologies to the scanning of multiple vertebrate genomes confirms a positive correlation between the size of a genome and the number of DNA repair protein transcripts it is likely to contain, and simultaneously suggests that all organisms have a non-zero minimum number of repair genes. In addition, the scan result clusters several organisms' repair abilities in an evolutionarily consistent fashion. Analysis also identifies several functionally unconfirmed proteins that are highly likely to be involved in the repair process. A new web service, INTREPED, has been made available for the immediate search and annotation of DNA repair proteins in newly sequenced genomes.</p> <p>Conclusion</p> <p>Despite complexity due to a multitude of repair pathways, combinations of sequence, structure, and homology with Support Vector Machines offer good methods in addition to existing homology searches for DNA repair protein identification and functional annotation. Most importantly, this study has uncovered relationships between the size of a genome and a genome's available repair repetoire, and offers a number of new predictions as well as a prediction service, both which reduce the search time and cost for novel repair genes and proteins.</p

    Rac1 and Rac3 isoform activation is involved in the invasive and metastatic phenotype of human breast cancer cells

    Get PDF
    INTRODUCTION: The metastatic progression of cancer is a direct result of the disregulation of numerous cellular signaling pathways, including those associated with adhesion, migration, and invasion. Members of the Rac family of small GTPases are known to act as regulators of actin cytoskeletal structures and strongly influence the cellular processes of integrin-mediated adhesion and migration. Even though hyperactivated Rac proteins have been shown to influence metastatic processes, these proteins have never been directly linked to metastatic progression. METHODS: To investigate a role for Rac and Cdc42 in metastatic breast cancer cell invasion and migration, relative endogenous Rac or Cdc42 activity was determined in a panel of metastatic variants of the MDA-MB-435 metastatic human breast cancer cell line using a p21-binding domain-PAK pull down assay. To investigate the migratory and invasive potential of the Rac isoforms in human breast cancer, namely Rac1 and the subsequently cloned Rac3, we stably expressed either dominant active Rac1 or dominant active Rac3 into the least metastatic cell variant. Dominant negative Rac1 or dominant negative Rac3 were stably expressed in the most metastatic cell variant. Cell lines expressing mutant Rac1 or Rac3 were analyzed using in vitro adhesion, migration and invasion assays. RESULTS: We show that increased activation of Rac proteins directly correlates with increasing metastatic potential in a panel of cell variants derived from a single metastatic breast cancer cell line (MDA-MB-435). The same correlation could not be found with activated Cdc42. Expression of a dominant active Rac1 or a dominant active Rac3 resulted in a more invasive and motile phenotype. Moreover, expression of either dominant negative Rac1 or dominant negative Rac3 into the most metastatic cell variant resulted in decreased invasive and motile properties. CONCLUSION: This study correlates endogenous Rac activity with high metastatic potential and implicates Rac in the regulation of cell migration and invasion in metastatic breast cancer cells. Taken together, these results suggest a role for both the Rac1 and Rac3 GTPases in human breast cancer progression

    Enhancement of the activity of phenoxodiol by cisplatin in prostate cancer cells

    Get PDF
    Phenoxodiol is a novel isoflav-3-ene, currently undergoing clinical trials, that has a broad in vitro activity against a number of human cancer cell lines. Phenoxodiol alone inhibited DU145 and PC3 in a dose- and time-dependent manner with IC50 values of 8±1 and 38±9 μM, respectively. The combination of phenoxodiol and cisplatin was synergistic in DU145, and additive in PC3, as assessed by the Chou–Talalay method. Carboplatin was also synergistic in combination with phenoxodiol in DU145 cells. The activity of the phenoxodiol and cisplatin combination was confirmed in vivo using a DU145 xenograft model in nude mice. Pharmacokinetic data from these mice suggest that the mechanism of synergy may occur through a pharmacodynamic mechanism. An intracellular cisplatin accumulation assay showed a 35% (P<0.05) increase in the uptake of cisplatin when it was combined in a ratio of 1 μM: 5 μM phenoxodiol, resulting in a 300% (P<0.05) increase in DNA adducts. Taken together, our results suggest that phenoxodiol has interesting properties that make combination therapy with cisplatin or carboplatin appealing

    A domain-based approach to predict protein-protein interactions

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Knowing which proteins exist in a certain organism or cell type and how these proteins interact with each other are necessary for the understanding of biological processes at the whole cell level. The determination of the protein-protein interaction (PPI) networks has been the subject of extensive research. Despite the development of reasonably successful methods, serious technical difficulties still exist. In this paper we present DomainGA, a quantitative computational approach that uses the information about the domain-domain interactions to predict the interactions between proteins.</p> <p>Results</p> <p>DomainGA is a multi-parameter optimization method in which the available PPI information is used to derive a quantitative scoring scheme for the domain-domain pairs. Obtained domain interaction scores are then used to predict whether a pair of proteins interacts. Using the yeast PPI data and a series of tests, we show the robustness and insensitivity of the DomainGA method to the selection of the parameter sets, score ranges, and detection rules. Our DomainGA method achieves very high explanation ratios for the positive and negative PPIs in yeast. Based on our cross-verification tests on human PPIs, comparison of the optimized scores with the structurally observed domain interactions obtained from the iPFAM database, and sensitivity and specificity analysis; we conclude that our DomainGA method shows great promise to be applicable across multiple organisms.</p> <p>Conclusion</p> <p>We envision the DomainGA as a first step of a multiple tier approach to constructing organism specific PPIs. As it is based on fundamental structural information, the DomainGA approach can be used to create potential PPIs and the accuracy of the constructed interaction template can be further improved using complementary methods. Explanation ratios obtained in the reported test case studies clearly show that the false prediction rates of the template networks constructed using the DomainGA scores are reasonably low, and the erroneous predictions can be filtered further using supplementary approaches such as those based on literature search or other prediction methods.</p
    • …
    corecore