69 research outputs found
Fuzzy clustering of CPP family in plants with evolution and interaction analyses
Background: Transcription factors have been studied intensively because they play an important role in gene expression regulation. However, the transcription factors in the CPP family (cystein-rich polycomb-like protein), compared with other transcription factor families, have not received sufficient attention, despite their wide prevalence in a broad spectrum of species, from plants to animals. The total number of known CPP transcription factors in plants is 111 from 16 plants, but only 2 of them have been studied so far, namely TSO1 and CPP1 in Arabidopsis thaliana and soybean, respectively.
Methods: In this work, to study their functions, we applied the fuzzy clustering method to all plant CPP transcription factors. The feature vector of each protein sequence for the fuzzy clustering method is encoded by the short length peptides and the combination of functional domain models.
Results and conclusions: With the fuzzy clustering method, all plant CPP transcription factors are grouped into two subfamilies. A systems approach, including Expressed Sequence Tag analysis, evolutionary analysis, proteinprotein interaction network analysis and co-expression analysis, is employed to validate the clustering results, the results of which also indicates that the transcription factors from different subfamilies show uncorrelated responses
New methods to measure residues coevolution in proteins
<p>Abstract</p> <p>Background</p> <p>The covariation of two sites in a protein is often used as the degree of their coevolution. To quantify the covariation many methods have been developed and most of them are based on residues position-specific frequencies by using the mutual information (MI) model.</p> <p>Results</p> <p>In the paper, we proposed several new measures to incorporate new biological constraints in quantifying the covariation. The first measure is the mutual information with the amino acid background distribution (MIB), which incorporates the amino acid background distribution into the marginal distribution of the MI model. The modification is made to remove the effect of amino acid evolutionary pressure in measuring covariation. The second measure is the mutual information of residues physicochemical properties (MIP), which is used to measure the covariation of physicochemical properties of two sites. The third measure called MIBP is proposed by applying residues physicochemical properties into the MIB model. Moreover, scores of our new measures are applied to a robust indicator <it>conn(k) </it>in finding the covariation signal of each site.</p> <p>Conclusions</p> <p>We find that incorporating amino acid background distribution is effective in removing the effect of evolutionary pressure of amino acids. Thus the MIB measure describes more biological background information for the coevolution of residues. Besides, our analysis also reveals that the covariation of physicochemical properties is a new aspect of coevolution information.</p
Hard-Aware Point-to-Set Deep Metric for Person Re-identification
Person re-identification (re-ID) is a highly challenging task due to large
variations of pose, viewpoint, illumination, and occlusion. Deep metric
learning provides a satisfactory solution to person re-ID by training a deep
network under supervision of metric loss, e.g., triplet loss. However, the
performance of deep metric learning is greatly limited by traditional sampling
methods. To solve this problem, we propose a Hard-Aware Point-to-Set (HAP2S)
loss with a soft hard-mining scheme. Based on the point-to-set triplet loss
framework, the HAP2S loss adaptively assigns greater weights to harder samples.
Several advantageous properties are observed when compared with other
state-of-the-art loss functions: 1) Accuracy: HAP2S loss consistently achieves
higher re-ID accuracies than other alternatives on three large-scale benchmark
datasets; 2) Robustness: HAP2S loss is more robust to outliers than other
losses; 3) Flexibility: HAP2S loss does not rely on a specific weight function,
i.e., different instantiations of HAP2S loss are equally effective. 4)
Generality: In addition to person re-ID, we apply the proposed method to
generic deep metric learning benchmarks including CUB-200-2011 and Cars196, and
also achieve state-of-the-art results.Comment: Accepted to ECCV 201
Synergistic and Independent Actions of Multiple Terminal Nucleotidyl Transferases in the 3’ Tailing of Small RNAs in Arabidopsis
All types of small RNAs in plants, piwi-interacting RNAs (piRNAs) in animals and a subset of siRNAs in Drosophila and C. elegans are subject to HEN1 mediated 3’ terminal 2’-Omethylation. This modification plays a pivotal role in protecting small RNAs from 3’ uridylation, trimming and degradation. In Arabidopsis, HESO1 is a major enzyme that uridylates small RNAs to trigger their degradation. However, U-tail is still present in null hen1 heso1 mutants, suggesting the existence of (an) enzymatic activities redundant with HESO1. Here, we report that UTP: RNA uridylyltransferase (URT1) is a functional paralog of HESO1. URT1 interacts with AGO1 and plays a predominant role in miRNA uridylation when HESO1 is absent. Uridylation of miRNA is globally abolished in a hen1 heso1 urt1 triple mutant, accompanied by an extensive increase of 3’-to-5’ trimming. In contrast, disruption of URT1 appears not to affect the heterochromatic siRNA uridylation. This indicates the involvement of additional nucleotidyl transferases in the siRNA pathway. Analysis of miRNA tailings in the hen1 heso1 urt1 triple mutant also reveals the existence of previously unknown enzymatic activities that can add non-uridine nucleotides. Importantly, we show HESO1 may also act redundantly with URT1 in miRNA uridylation when HEN1 is fully competent. Taken together, our data not only reveal a synergistic action of HESO1 and URT1 in the 3’ uridylation of miRNAs, but also independent activities of multiple terminal nucleotidyl transferases in the 3’ tailing of small RNAs and an antagonistic relationship between uridylation and trimming. Our results may provide further insight into the mechanisms of small RNA 3’ end modification and stability control
Proteogenomic insights suggest druggable pathways in endometrial carcinoma
We characterized a prospective endometrial carcinoma (EC) cohort containing 138 tumors and 20 enriched normal tissues using 10 different omics platforms. Targeted quantitation of two peptides can predict antigen processing and presentation machinery activity, and may inform patient selection for immunotherapy. Association analysis between MYC activity and metformin treatment in both patients and cell lines suggests a potential role for metformin treatment in non-diabetic patients with elevated MYC activity. PIK3R1 in-frame indels are associated with elevated AKT phosphorylation and increased sensitivity to AKT inhibitors. CTNNB1 hotspot mutations are concentrated near phosphorylation sites mediating pS45-induced degradation of β-catenin, which may render Wnt-FZD antagonists ineffective. Deep learning accurately predicts EC subtypes and mutations from histopathology images, which may be useful for rapid diagnosis. Overall, this study identified molecular and imaging markers that can be further investigated to guide patient stratification for more precise treatment of EC
Proteogenomic characterization of endometrial carcinoma
We undertook a comprehensive proteogenomic characterization of 95 prospectively collected endometrial carcinomas, comprising 83 endometrioid and 12 serous tumors. This analysis revealed possible new consequences of perturbations to the p53 and Wnt/β-catenin pathways, identified a potential role for circRNAs in the epithelial-mesenchymal transition, and provided new information about proteomic markers of clinical and genomic tumor subgroups, including relationships to known druggable pathways. An extensive genome-wide acetylation survey yielded insights into regulatory mechanisms linking Wnt signaling and histone acetylation. We also characterized aspects of the tumor immune landscape, including immunogenic alterations, neoantigens, common cancer/testis antigens, and the immune microenvironment, all of which can inform immunotherapy decisions. Collectively, our multi-omic analyses provide a valuable resource for researchers and clinicians, identify new molecular associations of potential mechanistic significance in the development of endometrial cancers, and suggest novel approaches for identifying potential therapeutic targets
Efficient Cancer Modeling Through CRISPR-Cas9/HDR-Based Somatic Precision Gene Editing in Mice
CRISPR-Cas9 has been used successfully to introduce indels in somatic cells of rodents; however, precise editing of single nucleotides has been hampered by limitations of flexibility and efficiency. Here, we report technological modifications to the CRISPR-Cas9 vector system that now allows homology-directed repair-mediated precise editing of any proto-oncogene in murine somatic tissues to generate tumor models with high flexibility and efficiency. Somatic editing of eithe
L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier
To understand enzyme functions, identifying the catalytic residues is a usual first step. Moreover, knowledge about catalytic residues is also useful for protein engineering and drug-design. However, to experimentally identify catalytic residues remains challenging for reasons of time and cost. Therefore, computational methods have been explored to predict catalytic residues. Here, we developed a new algorithm, L1pred, for catalytic residue prediction, by using the L1-logreg classifier to integrate eight sequence-based scoring functions. We tested L1pred and compared it against several existing sequence-based methods on carefully designed datasets Data604 and Data63. With ten-fold cross-validation, L1pred showed the area under precision-recall curve (AUPR) and the area under ROC curve (AUC) of 0.2198 and 0.9494 on the training dataset, Data604, respectively. In addition, on the independent test dataset, Data63, it showed the AUPR and AUC values of 0.2636 and 0.9375, respectively. Compared with other sequence-based methods, L1pred showed the best performance on both datasets. We also analyzed the importance of each attribute in the algorithm, and found that all the scores contributed more or less equally to the L1pred performance
Proteogenomic Approaches for the Identification of NF1/Neurofibromin-depleted Estrogen Receptor-positive Breast Cancers for Targeted Treatment
NF1 is a key tumor suppressor that represses both RAS and estrogen receptor-α (ER) signaling in breast cancer. Blocking both pathways by fulvestrant (F), a selective ER degrader, together with binimetinib (B), a MEK inhibitor, promotes tumor regression in NF1-depleted ER+ models. We aimed to establish approaches to determine how NF1 protein levels impact B+F treatment response to improve our ability to identify B+F sensitive tumors. We examined a panel of ER+ patient-derived xenograft (PDX) models by DNA and mRNA sequencing and found that more than half of these models carried an NF1 shallow deletion and generally have low mRNA levels. Consistent with RAS and ER activation, RET and MEK levels in NF1-depleted tumors were elevated when profiled by mass spectrometry (MS) after kinase inhibitor bead pulldown. MS showed that NF1 can also directly and selectively bind to palbociclib-conjugated beads, aiding quantification. An IHC assay was also established to measure NF1, but the MS-based approach was more quantitative. Combined IHC and MS analysis defined a threshold of NF1 protein loss in ER+ breast PDX, below which tumors regressed upon treatment with B+F. These results suggest that we now have a MS-verified NF1 IHC assay that can be used for patient selection as a complement to somatic genomic analysis
- …