20 research outputs found
Systems biology approaches to a rational drug discovery paradigm
The published manuscript is available at EurekaSelect via http://www.eurekaselect.com/openurl/content.php?genre=article&doi=10.2174/1568026615666150826114524.Prathipati P., Mizuguchi K.. Systems biology approaches to a rational drug discovery paradigm. Current Topics in Medicinal Chemistry, 16, 9, 1009. https://doi.org/10.2174/1568026615666150826114524
Improved pose and affinity predictions using different protocols tailored on the basis of data availability
This is a post-peer-review, pre-copyedit version of an article published in Journal of Computer-Aided Molecular Design. The final authenticated version is available online at: http://dx.doi.org/10.1007/s10822-016-9982-4.Prathipati, P., Nagao, C., Ahmad, S. et al. Improved pose and affinity predictions using different protocols tailored on the basis of data availability. J Comput Aided Mol Des 30, 817–828 (2016). https://doi.org/10.1007/s10822-016-9982-
SAP30, a Novel Oncogenic Transcription Factor in High-Risk Neuroblastoma: Clinical Significance and Role in Tumor-Progression, Survival, and Drug Resistance
Neuroblastoma is the most common devastating extracranial solid malignancy in children, accounting for 15% of childhood cancer-related mortality. Despite an intense treatment regimen, approximately 50% of children treated for high-risk neuroblastoma have more aggressive tumor relapse with less than 20% five-year overall survival. Amplification of the oncogene MYCN is associated with a high risk of relapse. However, only 25% of high-risk neuroblastomas are MYCN-amplified, indicating that the rest are driven by factors other than MYCN. Therefore, it is essential to identify novel driver transcription factors but not passenger genes that improve prediction efficacy of therapy response and association with high-risk, progression, stage 4, and survival in neuroblastoma patients. We used three neuroblastoma patient datasets (n=1252 patients) and applied robust bioinformatic data mining tools such as Weighted Gene Co-expression Network Analysis (WGCNA), cisTarget, and Single-Cell Regulatory Network Inference and Clustering (SCENIC) to identify driver transcription factors (regulon) that associate with high-risk, progression, stage, and survival in neuroblastoma patients. Based on the regulon specificity score, we derived a 10-transcription factor signature and prioritized Sin3A Associated Protein 30 (SAP30), given its highest regulon specificity score, especially in high-risk and aggressive stage cohorts. Higher SAP30 expression was found in high-risk neuroblastoma patients and progression-specific patient-derived xenograft tumors than their respective controls. The advanced pharmacogenomic analysis and CRISPR-Cas9 screens indicated that SAP30 essentiality correlated with Cisplatin resistance and further validated in Cisplatin resistant patient-derived xenograft tumor-derived cell lines. SAP30 silencing inhibited cell proliferation, slowed growth and induced cell death in vitro, and reduced tumor burden and size in vivo. Overall, our results indicate that SAP30 is a better prognostic and Cisplatin resistant marker associated with high-risk, stage 4 progression, and poor survival in neuroblastoma patients.https://digitalcommons.unmc.edu/chri_forum/1057/thumbnail.jp
A robust tool for discriminative analysis and feature selection in paired samples impacts the identification of the genes essential for reprogramming lung tissue to adenocarcinoma
<p>Abstract</p> <p>Background</p> <p>Lung cancer is the leading cause of cancer deaths in the world. The most common type of lung cancer is lung adenocarcinoma (AC). The genetic mechanisms of the early stages and lung AC progression steps are poorly understood. There is currently no clinically applicable gene test for the early diagnosis and AC aggressiveness. Among the major reasons for the lack of reliable diagnostic biomarkers are the extraordinary heterogeneity of the cancer cells, complex and poorly understudied interactions of the AC cells with adjacent tissue and immune system, gene variation across patient cohorts, measurement variability, small sample sizes and sub-optimal analytical methods. We suggest that gene expression profiling of the primary tumours and adjacent tissues (PT-AT) handled with a rational statistical and bioinformatics strategy of biomarker prediction and validation could provide significant progress in the identification of clinical biomarkers of AC. To minimise sample-to-sample variability, repeated multivariate measurements in the same object (organ or tissue, e.g. PT-AT in lung) across patients should be designed, but prediction and validation on the genome scale with small sample size is a great methodical challenge.</p> <p>Results</p> <p>To analyse PT-AT relationships efficiently in the statistical modelling, we propose an Extreme Class Discrimination (ECD) feature selection method that identifies a sub-set of the most discriminative variables (e.g. expressed genes). Our method consists of a paired Cross-normalization (CN) step followed by a modified sign Wilcoxon test with multivariate adjustment carried out for each variable. Using an Affymetrix U133A microarray paired dataset of 27 AC patients, we reviewed the global reprogramming of the transcriptome in human lung AC tissue versus normal lung tissue, which is associated with about 2,300 genes discriminating the tissues with 100% accuracy. Cluster analysis applied to these genes resulted in four distinct gene groups which we classified as associated with (i) up-regulated genes in the mitotic cell cycle lung AC, (ii) silenced/suppressed gene specific for normal lung tissue, (iii) cell communication and cell motility and (iv) the immune system features. The genes related to mutagenesis, specific lung cancers, early stage of AC development, tumour aggressiveness and metabolic pathway alterations and adaptations of cancer cells are strongly enriched in the AC PT-AT discriminative gene set. Two AC diagnostic biomarkers SPP1 and CENPA were successfully validated on RT-RCR tissue array. ECD method was systematically compared to several alternative methods and proved to be of better performance and as well as it was validated by comparison of the predicted gene set with literature meta-signature.</p> <p>Conclusions</p> <p>We developed a method that identifies and selects highly discriminative variables from high dimensional data spaces of potential biomarkers based on a statistical analysis of paired samples when the number of samples is small. This method provides superior selection in comparison to conventional methods and can be widely used in different applications. Our method revealed at least 23 hundreds patho-biologically essential genes associated with the global transcriptional reprogramming of human lung epithelium cells and lung AC aggressiveness. This gene set includes many previously published AC biomarkers reflecting inherent disease complexity and specifies the mechanisms of carcinogenesis in the lung AC. SPP1, CENPA and many other PT-AT discriminative genes could be considered as the prospective diagnostic and prognostic biomarkers of lung AC.</p
A prospective compound screening contest identified broader inhibitors for Sirtuin 1
Potential inhibitors of a target biomolecule, NAD-dependent deacetylase Sirtuin 1, were identified by a contest-based approach, in which participants were asked to propose a prioritized list of 400 compounds from a designated compound library containing 2.5 million compounds using in silico methods and scoring. Our aim was to identify target enzyme inhibitors and to benchmark computer-aided drug discovery methods under the same experimental conditions. Collecting compound lists derived from various methods is advantageous for aggregating compounds with structurally diversified properties compared with the use of a single method. The inhibitory action on Sirtuin 1 of approximately half of the proposed compounds was experimentally accessed. Ultimately, seven structurally diverse compounds were identified
Global bayesian models for the prioritization of antitubercular agents.
To aid the creation of novel antituberculosis (antiTB) compounds, Bayesian models were derived and validated on a data set of 3779 compounds which have been measured for minimum inhibitory concentration (MIC) in the Mycobacterium tuberculosis H37Rv strain. The model development and validation involved exploring six different training sets and 15 fingerprint types which resulted in a total of 90 models, with active compounds defined as those with MIC < 5 muM. The best model was derived using Extended Class Fingerprints of maximum diameter 12 (ECFP_12) and a few global descriptors on a training set derived using Functional Class Fingerprints of maximum diameter 4 (FCFP_4). This model demonstrated very good discriminant ability in general, with excellent discriminant statistics for the training set (total accuracy: 0.968; positive recall: 0.967) and a good predictive ability for the test set (total accuracy: 0.869; positive recall: 0.789). The good predictive ability was maintained when the model was applied to a well-separated test set of 2880 compounds derived from a commercial database (total accuracy: 0.73; positive recall: 0.72). The model revealed several conserved substructures present in the active and inactive compounds which are believed to have incremental and detrimental effects on the MIC, respectively. Strategies for enhancing the repertoire of antiTB compounds with the model, including virtual screening of large databases and combinatorial library design, are proposed