36 research outputs found

    Prediction of Pharmacological and Xenobiotic Responses to Drugs Based on Time Course Gene Expression Profiles

    Get PDF
    More and more people are concerned by the risk of unexpected side effects observed in the later steps of the development of new drugs, either in late clinical development or after marketing approval. In order to reduce the risk of the side effects, it is important to look out for the possible xenobiotic responses at an early stage. We attempt such an effort through a prediction by assuming that similarities in microarray profiles indicate shared mechanisms of action and/or toxicological responses among the chemicals being compared. A large time course microarray database derived from livers of compound-treated rats with thirty-four distinct pharmacological and toxicological responses were studied. The mRMR (Minimum-Redundancy-Maximum-Relevance) method and IFS (Incremental Feature Selection) were used to select a compact feature set (141 features) for the reduction of feature dimension and improvement of prediction performance. With these 141 features, the Leave-one-out cross-validation prediction accuracy of first order response using NNA (Nearest Neighbor Algorithm) was 63.9%. Our method can be used for pharmacological and xenobiotic responses prediction of new compounds and accelerate drug development

    Analysis and Identification of Aptamer-Compound Interactions with a Maximum Relevance Minimum Redundancy and Nearest Neighbor Algorithm

    Get PDF
    The development of biochemistry and molecular biology has revealed an increasingly important role of compounds in several biological processes. Like the aptamer-protein interaction, aptamer-compound interaction attracts increasing attention. However, it is time-consuming to select proper aptamers against compounds using traditional methods, such as exponential enrichment. Thus, there is an urgent need to design effective computational methods for searching effective aptamers against compounds. This study attempted to extract important features for aptamer-compound interactions using feature selection methods, such as Maximum Relevance Minimum Redundancy, as well as incremental feature selection. Each aptamer-compound pair was represented by properties derived from the aptamer and compound, including frequencies of single nucleotides and dinucleotides for the aptamer, as well as the constitutional, electrostatic, quantum-chemical, and space conformational descriptors of the compounds. As a result, some important features were obtained. To confirm the importance of the obtained features, we further discussed the associations between them and aptamer-compound interactions. Simultaneously, an optimal prediction model based on the nearest neighbor algorithm was built to identify aptamer-compound interactions, which has the potential to be a useful tool for the identification of novel aptamer-compound interactions. The program is available upon the request

    Potential Tumor Suppressor NESG1 as an Unfavorable Prognosis Factor in Nasopharyngeal Carcinoma

    Get PDF
    BACKGROUND:Recently we identified nasopharyngeal epithelium specific protein 1 (NESG1) as a potential tumor suppressor in nasopharyngeal carcinoma (NPC). The purpose of this study is to investigate the involvement of NESG1 in tumor progression and prognosis of human NPC. METHODOLOGY/PRINCIPAL FINDINGS:NESG1 protein expression in NPC was examined. Survival analysis was performed using Kaplan-Meier method. The effect of NESG1 on cell proliferation, migration, and invasion were also investigated. RESULTS:NESG1 expression was downregulated in atypical hyperplasia and NPC samples compared to normal and squamous nasopharynx tissues. Reduced protein expression was negatively associated with the status of NPC progression. Patients with lower NESG1 expression had a shorter overall survival and disease-free time than did patients with higher NESG1 expression. Multivariate analysis suggested NESG1 expression as an independent prognostic indicator for NPC patient survival. Proliferation, migration, and invasion ability were significantly increased in cell lines following lentiviral-mediated shRNA suppression of NESG1 expression. Microarray analysis indicated that NESG1 participated in multiple pathways, including MAPK signaling and cell cycle regulation. Finally, DNA methylation microarray examination revealed a lack of hypermethylation at the NESG1 promoter, suggesting other mechanisms are involved in suppressing NESG1 expression in NPC. CONCLUSION:Our studies are the first to demonstrate that decreased NESG1 expression is an unfavorable prognostic factor for NPC

    Prediction of Deleterious Non-Synonymous SNPs Based on Protein Interaction Network and Hybrid Properties

    Get PDF
    Non-synonymous SNPs (nsSNPs), also known as Single Amino acid Polymorphisms (SAPs) account for the majority of human inherited diseases. It is important to distinguish the deleterious SAPs from neutral ones. Most traditional computational methods to classify SAPs are based on sequential or structural features. However, these features cannot fully explain the association between a SAP and the observed pathophysiological phenotype. We believe the better rationale for deleterious SAP prediction should be: If a SAP lies in the protein with important functions and it can change the protein sequence and structure severely, it is more likely related to disease. So we established a method to predict deleterious SAPs based on both protein interaction network and traditional hybrid properties. Each SAP is represented by 472 features that include sequential features, structural features and network features. Maximum Relevance Minimum Redundancy (mRMR) method and Incremental Feature Selection (IFS) were applied to obtain the optimal feature set and the prediction model was Nearest Neighbor Algorithm (NNA). In jackknife cross-validation, 83.27% of SAPs were correctly predicted when the optimized 263 features were used. The optimized predictor with 263 features was also tested in an independent dataset and the accuracy was still 80.00%. In contrast, SIFT, a widely used predictor of deleterious SAPs based on sequential features, has a prediction accuracy of 71.05% on the same dataset. In our study, network features were found to be most important for accurate prediction and can significantly improve the prediction performance. Our results suggest that the protein interaction context could provide important clues to help better illustrate SAP's functional association. This research will facilitate the post genome-wide association studies

    Cooperativity among Short Amyloid Stretches in Long Amyloidogenic Sequences

    Get PDF
    Amyloid fibrillar aggregates of polypeptides are associated with many neurodegenerative diseases. Short peptide segments in protein sequences may trigger aggregation. Identifying these stretches and examining their behavior in longer protein segments is critical for understanding these diseases and obtaining potential therapies. In this study, we combined machine learning and structure-based energy evaluation to examine and predict amyloidogenic segments. Our feature selection method discovered that windows consisting of long amino acid segments of ∼30 residues, instead of the commonly used short hexapeptides, provided the highest accuracy. Weighted contributions of an amino acid at each position in a 27 residue window revealed three cooperative regions of short stretch, resemble the β-strand-turn-β-strand motif in A-βpeptide amyloid and β-solenoid structure of HET-s(218–289) prion (C). Using an in-house energy evaluation algorithm, the interaction energy between two short stretches in long segment is computed and incorporated as an additional feature. The algorithm successfully predicted and classified amyloid segments with an overall accuracy of 75%. Our study revealed that genome-wide amyloid segments are not only dependent on short high propensity stretches, but also on nearby residues

    Prediction of nucleosome positioning based on transcription factor binding sites.

    Get PDF
    BackgroundThe DNA of all eukaryotic organisms is packaged into nucleosomes, the basic repeating units of chromatin. The nucleosome consists of a histone octamer around which a DNA core is wrapped and the linker histone H1, which is associated with linker DNA. By altering the accessibility of DNA sequences, the nucleosome has profound effects on all DNA-dependent processes. Understanding the factors that influence nucleosome positioning is of great importance for the study of genomic control mechanisms. Transcription factors (TFs) have been suggested to play a role in nucleosome positioning in vivo.Principal findingsHere, the minimum redundancy maximum relevance (mRMR) feature selection algorithm, the nearest neighbor algorithm (NNA), and the incremental feature selection (IFS) method were used to identify the most important TFs that either favor or inhibit nucleosome positioning by analyzing the numbers of transcription factor binding sites (TFBSs) in 53,021 nucleosomal DNA sequences and 50,299 linker DNA sequences. A total of nine important families of TFs were extracted from 35 families, and the overall prediction accuracy was 87.4% as evaluated by the jackknife cross-validation test.ConclusionsOur results are consistent with the notion that TFs are more likely to bind linker DNA sequences than the sequences in the nucleosomes. In addition, our results imply that there may be some TFs that are important for nucleosome positioning but that play an insignificant role in discriminating nucleosome-forming DNA sequences from nucleosome-inhibiting DNA sequences. The hypothesis that TFs play a role in nucleosome positioning is, thus, confirmed by the results of this study

    Stress state measured at ~7 km depth in the Tarim Basin, NW China

    Get PDF
    タリム盆地の掘削調査で地下7kmの地殻にかかる力を計測. 京都大学プレスリリース. 2017-07-04.The in-situ stress state in the Tarim Basin, Northwest China, down to 7 km depth is constrained using the anelastic strain recovery (ASR) method and wellbore failure analysis. Results are consistent between the two methods, and indicate that the maximum principal stresses (σ1) are close to vertical and the intermediate and minimum principal stresses (σ2 and σ3) are approximately horizontal. The states of stress at the studied wellbore is in the normal faulting stress regime within the Tarim Basin rather than in the compressional tectonic stress regime as in the periphery of the Tarim Basin, which explains the presence of the normal faults interpreted in 3-D seismic profiles collected from adjacent areas. Our results demonstrate that the ASR method can be used for rocks recovered from depths as deep as 7 km to recover reliable stress state information. The in-situ stress measurement results revealed in this paper will help future development of the petroleum resources and kinematics study in the Tarim Basin

    Analysis and Identification of Aptamer-Compound Interactions with a Maximum Relevance Minimum Redundancy and Nearest Neighbor Algorithm

    No full text
    The development of biochemistry and molecular biology has revealed an increasingly important role of compounds in several biological processes. Like the aptamer-protein interaction, aptamer-compound interaction attracts increasing attention. However, it is time-consuming to select proper aptamers against compounds using traditional methods, such as exponential enrichment. Thus, there is an urgent need to design effective computational methods for searching effective aptamers against compounds. This study attempted to extract important features for aptamer-compound interactions using feature selection methods, such as Maximum Relevance Minimum Redundancy, as well as incremental feature selection. Each aptamer-compound pair was represented by properties derived from the aptamer and compound, including frequencies of single nucleotides and dinucleotides for the aptamer, as well as the constitutional, electrostatic, quantum-chemical, and space conformational descriptors of the compounds. As a result, some important features were obtained. To confirm the importance of the obtained features, we further discussed the associations between them and aptamer-compound interactions. Simultaneously, an optimal prediction model based on the nearest neighbor algorithm was built to identify aptamer-compound interactions, which has the potential to be a useful tool for the identification of novel aptamer-compound interactions. The program is available upon the request
    corecore