Search CORE

65 research outputs found

Fuzzy clustering of CPP family in plants with evolution and interaction analyses

Author: Dou Yongchao
Lu Tao
Zhang Chi
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/10/2012
Field of study

Background: Transcription factors have been studied intensively because they play an important role in gene expression regulation. However, the transcription factors in the CPP family (cystein-rich polycomb-like protein), compared with other transcription factor families, have not received sufficient attention, despite their wide prevalence in a broad spectrum of species, from plants to animals. The total number of known CPP transcription factors in plants is 111 from 16 plants, but only 2 of them have been studied so far, namely TSO1 and CPP1 in Arabidopsis thaliana and soybean, respectively. Methods: In this work, to study their functions, we applied the fuzzy clustering method to all plant CPP transcription factors. The feature vector of each protein sequence for the fuzzy clustering method is encoded by the short length peptides and the combination of functional domain models. Results and conclusions: With the fuzzy clustering method, all plant CPP transcription factors are grouped into two subfamilies. A systems approach, including Expressed Sequence Tag analysis, evolutionary analysis, proteinprotein interaction network analysis and co-expression analysis, is employed to validate the clustering results, the results of which also indicates that the transcription factors from different subfamilies show uncorrelated responses

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

PubMed Central

New methods to measure residues coevolution in proteins

Author: Dou Yongchao
Gao Hongyun
Wang Jun
Yang Jialiang
Publication venue: BioMed Central
Publication date: 01/01/2011
Field of study

Abstract Background The covariation of two sites in a protein is often used as the degree of their coevolution. To quantify the covariation many methods have been developed and most of them are based on residues position-specific frequencies by using the mutual information (MI) model. Results In the paper, we proposed several new measures to incorporate new biological constraints in quantifying the covariation. The first measure is the mutual information with the amino acid background distribution (MIB), which incorporates the amino acid background distribution into the marginal distribution of the MI model. The modification is made to remove the effect of amino acid evolutionary pressure in measuring covariation. The second measure is the mutual information of residues physicochemical properties (MIP), which is used to measure the covariation of physicochemical properties of two sites. The third measure called MIBP is proposed by applying residues physicochemical properties into the MIB model. Moreover, scores of our new measures are applied to a robust indicator <it>conn(k) </it>in finding the covariation signal of each site. Conclusions We find that incorporating amino acid background distribution is effective in removing the effect of evolutionary pressure of amino acids. Thus the MIB measure describes more biological background information for the coevolution of residues. Besides, our analysis also reveals that the covariation of physicochemical properties is a new aspect of coevolution information.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Hard-Aware Point-to-Set Deep Metric for Person Re-identification

Author: Bai Song
Bai Xiang
Dou Zhiyong
Xu Yongchao
Yu Rui
Zhang Zhaoxiang
Publication venue
Publication date: 30/07/2018
Field of study

Person re-identification (re-ID) is a highly challenging task due to large variations of pose, viewpoint, illumination, and occlusion. Deep metric learning provides a satisfactory solution to person re-ID by training a deep network under supervision of metric loss, e.g., triplet loss. However, the performance of deep metric learning is greatly limited by traditional sampling methods. To solve this problem, we propose a Hard-Aware Point-to-Set (HAP2S) loss with a soft hard-mining scheme. Based on the point-to-set triplet loss framework, the HAP2S loss adaptively assigns greater weights to harder samples. Several advantageous properties are observed when compared with other state-of-the-art loss functions: 1) Accuracy: HAP2S loss consistently achieves higher re-ID accuracies than other alternatives on three large-scale benchmark datasets; 2) Robustness: HAP2S loss is more robust to outliers than other losses; 3) Flexibility: HAP2S loss does not rely on a specific weight function, i.e., different instantiations of HAP2S loss are equally effective. 4) Generality: In addition to person re-ID, we apply the proposed method to generic deep metric learning benchmarks including CUB-200-2011 and Cars196, and also achieve state-of-the-art results.Comment: Accepted to ECCV 201

arXiv.org e-Print Archive

Crossref

Synergistic and Independent Actions of Multiple Terminal Nucleotidyl Transferases in the 3’ Tailing of Small RNAs in Arabidopsis

Author: Chen Xuemei
Dou Yongchao
Ren Guodong
Wang Xiaoyan
Yu Bin
Zhang Chi
Zhang Shuxin
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2015
Field of study

All types of small RNAs in plants, piwi-interacting RNAs (piRNAs) in animals and a subset of siRNAs in Drosophila and C. elegans are subject to HEN1 mediated 3’ terminal 2’-Omethylation. This modification plays a pivotal role in protecting small RNAs from 3’ uridylation, trimming and degradation. In Arabidopsis, HESO1 is a major enzyme that uridylates small RNAs to trigger their degradation. However, U-tail is still present in null hen1 heso1 mutants, suggesting the existence of (an) enzymatic activities redundant with HESO1. Here, we report that UTP: RNA uridylyltransferase (URT1) is a functional paralog of HESO1. URT1 interacts with AGO1 and plays a predominant role in miRNA uridylation when HESO1 is absent. Uridylation of miRNA is globally abolished in a hen1 heso1 urt1 triple mutant, accompanied by an extensive increase of 3’-to-5’ trimming. In contrast, disruption of URT1 appears not to affect the heterochromatic siRNA uridylation. This indicates the involvement of additional nucleotidyl transferases in the siRNA pathway. Analysis of miRNA tailings in the hen1 heso1 urt1 triple mutant also reveals the existence of previously unknown enzymatic activities that can add non-uridine nucleotides. Importantly, we show HESO1 may also act redundantly with URT1 in miRNA uridylation when HEN1 is fully competent. Taken together, our data not only reveal a synergistic action of HESO1 and URT1 in the 3’ uridylation of miRNAs, but also independent activities of multiple terminal nucleotidyl transferases in the 3’ tailing of small RNAs and an antagonistic relationship between uridylation and trimming. Our results may provide further insight into the mechanisms of small RNA 3’ end modification and stability control

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central

eScholarship - University of California

FigShare

Proteogenomic insights suggest druggable pathways in endometrial carcinoma

Author: Ding Li
Dou Yongchao
et al.
Lu Rita Jui-Hsien
Mutch David
Wu Yige
Wyczalkowski Matthew A
Publication venue: Digital Commons@Becker
Publication date: 11/09/2023
Field of study

We characterized a prospective endometrial carcinoma (EC) cohort containing 138 tumors and 20 enriched normal tissues using 10 different omics platforms. Targeted quantitation of two peptides can predict antigen processing and presentation machinery activity, and may inform patient selection for immunotherapy. Association analysis between MYC activity and metformin treatment in both patients and cell lines suggests a potential role for metformin treatment in non-diabetic patients with elevated MYC activity. PIK3R1 in-frame indels are associated with elevated AKT phosphorylation and increased sensitivity to AKT inhibitors. CTNNB1 hotspot mutations are concentrated near phosphorylation sites mediating pS45-induced degradation of β-catenin, which may render Wnt-FZD antagonists ineffective. Deep learning accurately predicts EC subtypes and mutations from histopathology images, which may be useful for rapid diagnosis. Overall, this study identified molecular and imaging markers that can be further investigated to guide patient stratification for more precise treatment of EC

Digital Commons@Becker

Proteogenomic characterization of endometrial carcinoma

Author: Cao Song
Cui Zhou Daniel
Ding Li
Dou Yongchao
et al
Fuh Katherine
Gao Qingsong
Karpova Alla
Mutch David
Sethuraman Sunantha
Wu Yige
Wyczalkowski Matthew A
Publication venue: Digital Commons@Becker
Publication date: 20/02/2020
Field of study

We undertook a comprehensive proteogenomic characterization of 95 prospectively collected endometrial carcinomas, comprising 83 endometrioid and 12 serous tumors. This analysis revealed possible new consequences of perturbations to the p53 and Wnt/β-catenin pathways, identified a potential role for circRNAs in the epithelial-mesenchymal transition, and provided new information about proteomic markers of clinical and genomic tumor subgroups, including relationships to known druggable pathways. An extensive genome-wide acetylation survey yielded insights into regulatory mechanisms linking Wnt signaling and histone acetylation. We also characterized aspects of the tumor immune landscape, including immunogenic alterations, neoantigens, common cancer/testis antigens, and the immune microenvironment, all of which can inform immunotherapy decisions. Collectively, our multi-omic analyses provide a valuable resource for researchers and clinicians, identify new molecular associations of potential mechanistic significance in the development of endometrial cancers, and suggest novel approaches for identifying potential therapeutic targets

Digital Commons@Becker

L1pred: A Sequence-Based Prediction Tool for Catalytic Residues in Enzymes with the L1-logreg Classifier

Author: A Armon
A del Sol Mesa
A Gutteridge
AR Panchenko
B Sterner
C Berezin
C Marino Buslje
C Porter
CA Innis
Chi Zhang
D La
DR Caffrey
E Chea
E Cilia
E Greenshtein
E Youn
F Glaser
G Lopez
GJ Bartlett
HM Berman
I Mayrose
I Mihalek
IA Vergara
Iddo Friedberg
J Capra
J Pei
JD Fischer
Jialiang Yang
Jun Wang
K Koh
K Wang
K Ye
KC Bahadur Dukka
L Mirny
LJ McGuffin
M Brylinski
M Landau
N Petrova
P Zhao
R Alterovitz
RM Sweet
RM Williamson
S Ahmad
S Gong
S Pande
S Sankararaman
S Sankararaman
SA van de Geer
SF Altschul
SW Zhang
T Kato
T Zhang
W Taylor
W Tong
W Valdar
XS Liu
YC Dou
YC Dou
YC Dou
Yongchao Dou
YR Tang
ZP Liu
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

To understand enzyme functions, identifying the catalytic residues is a usual first step. Moreover, knowledge about catalytic residues is also useful for protein engineering and drug-design. However, to experimentally identify catalytic residues remains challenging for reasons of time and cost. Therefore, computational methods have been explored to predict catalytic residues. Here, we developed a new algorithm, L1pred, for catalytic residue prediction, by using the L1-logreg classifier to integrate eight sequence-based scoring functions. We tested L1pred and compared it against several existing sequence-based methods on carefully designed datasets Data604 and Data63. With ten-fold cross-validation, L1pred showed the area under precision-recall curve (AUPR) and the area under ROC curve (AUC) of 0.2198 and 0.9494 on the training dataset, Data604, respectively. In addition, on the independent test dataset, Data63, it showed the AUPR and AUC values of 0.2636 and 0.9375, respectively. Compared with other sequence-based methods, L1pred showed the best performance on both datasets. We also analyzed the importance of each attribute in the algorithm, and found that all the scores contributed more or less equally to the L1pred performance

CiteSeerX

Public Library of Science (PLOS)

Crossref

DigitalCommons@University of Nebraska

Directory of Open Access Journals

PubMed Central

Frozen tissue coring and layered histological analysis improves cell type-specific proteogenomic characterization of pancreatic adenocarcinoma

Author: Bathe Oliver F.
Chen Lijun
Dou Yongchao
Hostetter Galen
Jewell Scott
Li Qing K.
Newton Chelsea
Omenn Gilbert S.
Robles Ana I.
Savage Sara R.
Thiagarajan Mathangi
Wang Yuefan
Zhang Bing
Zhang Hui
Publication venue
Publication date: 04/02/2024
Field of study

Abstract Background Omics characterization of pancreatic adenocarcinoma tissue is complicated by the highly heterogeneous and mixed populations of cells. We evaluate the feasibility and potential benefit of using a coring method to enrich specific regions from bulk tissue and then perform proteogenomic analyses. Methods We used the Biopsy Trifecta Extraction (BioTExt) technique to isolate cores of epithelial-enriched and stroma-enriched tissue from pancreatic tumor and adjacent tissue blocks. Histology was assessed at multiple depths throughout each core. DNA sequencing, RNA sequencing, and proteomics were performed on the cored and bulk tissue samples. Supervised and unsupervised analyses were performed based on integrated molecular and histology data. Results Tissue cores had mixed cell composition at varying depths throughout. Average cell type percentages assessed by histology throughout the core were better associated with KRAS variant allele frequencies than standard histology assessment of the cut surface. Clustering based on serial histology data separated the cores into three groups with enrichment of neoplastic epithelium, stroma, and acinar cells, respectively. Using this classification, tumor overexpressed proteins identified in bulk tissue analysis were assigned into epithelial- or stroma-specific categories, which revealed novel epithelial-specific tumor overexpressed proteins. Conclusions Our study demonstrates the feasibility of multi-omics data generation from tissue cores, the necessity of interval H&E stains in serial histology sections, and the utility of coring to improve analysis over bulk tissue data

PRISM: University of Calgary Digital Repository

Identification of RNA silencing components in soybean and sorghum

Author: Dou Yongchao
Liu Xiang
Lu Tao
Yu Bin
Zhang Chi
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/01/2014
Field of study

Background: RNA silencing is a process triggered by 21–24 small RNAs to repress gene expression. Many organisms including plants use RNA silencing to regulate development and physiology, and to maintain genome stability. Plants possess two classes of small RNAs: microRNAs (miRNAs) and small interfering RNAs (siRNAs). The frameworks of miRNA and siRNA pathways have been established in the model plant, Arabidopsis thaliana (Arabidopsis). Results: Here we report the identification of putative genes that are required for the generation and function of miRNAs and siRNAs in soybean and sorghum, based on knowledge obtained from Arabidopsis. The gene families, including DCL, HEN1, SE, HYL1, HST, RDR, NRPD1, NRPD2/NRPE2, NRPE1, and AGO, were analyzed for gene structures, phylogenetic relationships, and protein motifs. The gene expression was validated using RNA-seq, expressed sequence tags (EST), and reverse transcription PCR (RT-PCR). Conclusions: The identification of these components could provide not only insight into RNA silencing mechanism in soybean and sorghum but also basis for further investigation. All data are available at http://sysbio.unl.edu/

Crossref

DigitalCommons@University of Nebraska

Springer - Publisher Connector

PubMed Central