281 research outputs found

    Ward's Hierarchical Clustering Method: Clustering Criterion and Agglomerative Algorithm

    Full text link
    The Ward error sum of squares hierarchical clustering method has been very widely used since its first description by Ward in a 1963 publication. It has also been generalized in various ways. However there are different interpretations in the literature and there are different implementations of the Ward agglomerative algorithm in commonly used software systems, including differing expressions of the agglomerative criterion. Our survey work and case studies will be useful for all those involved in developing software for data analysis using Ward's hierarchical clustering method.Comment: 20 pages, 21 citations, 4 figure

    Local and global modes of drug action in biochemical networks

    Get PDF
    It becomes increasingly accepted that a shift is needed from the traditional target-based approach of drug development to an integrated perspective of drug action in biochemical systems. We here present an integrative analysis of the interactions between drugs and metabolism based on the concept of drug scope. The drug scope represents the set of metabolic compounds and reactions that are potentially affected by a drug. We constructed and analyzed the scopes of all US approved drugs having metabolic targets. Our analysis shows that the distribution of drug scopes is highly uneven, and that drugs can be classified into several categories based on their scopes. Some of them have small scopes corresponding to localized action, while others have large scopes corresponding to potential large-scale systemic action. These groups are well conserved throughout different topologies of the underlying metabolic network. They can furthermore be associated to specific drug therapeutic properties

    PREDICT: a new UK prognostic model that predicts survival following surgery for invasive breast cancer

    Get PDF
    RIGHTS : This article is licensed under the BioMed Central licence at http://www.biomedcentral.com/about/license which is similar to the 'Creative Commons Attribution Licence'. In brief you may : copy, distribute, and display the work; make derivative works; or make commercial use of the work - under the following conditions: the original author must be given credit; for any reuse or distribution, it must be made clear to others what the license terms of this work are.Abstract Introduction The aim of this study was to develop and validate a prognostication model to predict overall and breast cancer specific survival for women treated for early breast cancer in the UK. Methods Using the Eastern Cancer Registration and Information Centre (ECRIC) dataset, information was collated for 5,694 women who had surgery for invasive breast cancer in East Anglia from 1999 to 2003. Breast cancer mortality models for oestrogen receptor (ER) positive and ER negative tumours were derived from these data using Cox proportional hazards, adjusting for prognostic factors and mode of cancer detection (symptomatic versus screen-detected). An external dataset of 5,468 patients from the West Midlands Cancer Intelligence Unit (WMCIU) was used for validation. Results Differences in overall actual and predicted mortality were <1% at eight years for ECRIC (18.9% vs. 19.0%) and WMCIU (17.5% vs. 18.3%) with area under receiver-operator-characteristic curves (AUC) of 0.81 and 0.79 respectively. Differences in breast cancer specific actual and predicted mortality were <1% at eight years for ECRIC (12.9% vs. 13.5%) and <1.5% at eight years for WMCIU (12.2% vs. 13.6%) with AUC of 0.84 and 0.82 respectively. Model calibration was good for both ER positive and negative models although the ER positive model provided better discrimination (AUC 0.82) than ER negative (AUC 0.75). Conclusions We have developed a prognostication model for early breast cancer based on UK cancer registry data that predicts breast cancer survival following surgery for invasive breast cancer and includes mode of detection for the first time. The model is well calibrated, provides a high degree of discrimination and has been validated in a second UK patient cohort

    The Wasteland of Random Supergravities

    Full text link
    We show that in a general \cal{N} = 1 supergravity with N \gg 1 scalar fields, an exponentially small fraction of the de Sitter critical points are metastable vacua. Taking the superpotential and Kahler potential to be random functions, we construct a random matrix model for the Hessian matrix, which is well-approximated by the sum of a Wigner matrix and two Wishart matrices. We compute the eigenvalue spectrum analytically from the free convolution of the constituent spectra and find that in typical configurations, a significant fraction of the eigenvalues are negative. Building on the Tracy-Widom law governing fluctuations of extreme eigenvalues, we determine the probability P of a large fluctuation in which all the eigenvalues become positive. Strong eigenvalue repulsion makes this extremely unlikely: we find P \propto exp(-c N^p), with c, p being constants. For generic critical points we find p \approx 1.5, while for approximately-supersymmetric critical points, p \approx 1.3. Our results have significant implications for the counting of de Sitter vacua in string theory, but the number of vacua remains vast.Comment: 39 pages, 9 figures; v2: fixed typos, added refs and clarification

    Screen-detected vs symptomatic breast cancer: is improved survival due to stage migration alone?

    Get PDF
    This paper examines whether screen-detected breast cancer confers additional prognostic benefit to the patient, over and above that expected by any shift in stage at presentation. In all, 5604 women (aged 50–70 years) diagnosed with invasive breast cancer between 1998 and 2003 were identified by the Eastern Cancer Registration and Information Centre (ECRIC) and mammographic screening status was determined. Using proportional hazards regression, we estimated the effect of screen detection compared with symptomatic diagnosis on 5-year survival unadjusted, then adjusted for age and Nottingham Prognostic Index (NPI). A total of 72% of the survival benefit associated with screen-detected breast cancer can be accounted for by age and shift in NPI. Survival analysis by continuous NPI showed a small but systematic survival benefit for screen-detected cancers at each NPI value. These data show that although most of the screen-detected survival advantage is due to a shift in NPI, the mode of detection does impact on survival in patients with equivalent NPI scores. This residual survival benefit is small but significant, and is likely to be due to differences in tumour biology. Current prognostication tools may, therefore, overestimate the benefit of systemic treatments in screen-detected cancers and lead to overtreatment of these patients

    Remodeling the Proteostasis Network to Rescue Glucocerebrosidase Variants by Inhibiting ER-Associated Degradation and Enhancing ER Folding

    Get PDF
    Gaucher’s disease (GD) is characterized by loss of lysosomal glucocerebrosidase (GC) activity. Mutations in the gene encoding GC destabilize the protein’s native folding leading to ER-associated degradation (ERAD) of the misfolded enzyme. Enhancing the cellular folding capacity by remodeling the proteostasis network promotes native folding and lysosomal activity of mutated GC variants. However, proteostasis modulators reported so far, including ERAD inhibitors, trigger cellular stress and lead to induction of apoptosis. We show herein that lacidipine, an L-type Ca2+ channel blocker that also inhibits ryanodine receptors on the ER membrane, enhances folding, trafficking and lysosomal activity of the most severely destabilized GC variant achieved via ERAD inhibition in fibroblasts derived from patients with GD. Interestingly, reprogramming the proteostasis network by combining modulation of Ca2+ homeostasis and ERAD inhibition remodels the unfolded protein response and dramatically lowers apoptosis induction typically associated with ERAD inhibition

    Translating Clinical Findings into Knowledge in Drug Safety Evaluation - Drug Induced Liver Injury Prediction System (DILIps)

    Get PDF
    Drug-induced liver injury (DILI) is a significant concern in drug development due to the poor concordance between preclinical and clinical findings of liver toxicity. We hypothesized that the DILI types (hepatotoxic side effects) seen in the clinic can be translated into the development of predictive in silico models for use in the drug discovery phase. We identified 13 hepatotoxic side effects with high accuracy for classifying marketed drugs for their DILI potential. We then developed in silico predictive models for each of these 13 side effects, which were further combined to construct a DILI prediction system (DILIps). The DILIps yielded 60–70% prediction accuracy for three independent validation sets. To enhance the confidence for identification of drugs that cause severe DILI in humans, the “Rule of Three” was developed in DILIps by using a consensus strategy based on 13 models. This gave high positive predictive value (91%) when applied to an external dataset containing 206 drugs from three independent literature datasets. Using the DILIps, we screened all the drugs in DrugBank and investigated their DILI potential in terms of protein targets and therapeutic categories through network modeling. We demonstrated that two therapeutic categories, anti-infectives for systemic use and musculoskeletal system drugs, were enriched for DILI, which is consistent with current knowledge. We also identified protein targets and pathways that are related to drugs that cause DILI by using pathway analysis and co-occurrence text mining. While marketed drugs were the focus of this study, the DILIps has a potential as an evaluation tool to screen and prioritize new drug candidates or chemicals, such as environmental chemicals, to avoid those that might cause liver toxicity. We expect that the methodology can be also applied to other drug safety endpoints, such as renal or cardiovascular toxicity

    Solution Structure of LC4 Transmembrane Segment of CCR5

    Get PDF
    CC-chemokine receptor 5 (CCR5) is a specific co-receptor allowing the entry of human immunodeficiency virus type 1 (HIV-1). The LC4 region in CCR5 is required for HIV-1 entry into the cells. In this study, the solution structure of LC4 in SDS micelles was elucidated by using standard 1H two-dimensional NMR spectroscopy, circular dichroism, and fluorescdence quenching. The LC4 structure adopts two helical structures, whereas the C-terminal part remains unstructured. The positions in which LC4 binds to the HIV-1 inhibitory peptide LC5 were determined by docking calculations in addition to NMR data. The poses showed the importance of the hydrophobic interface of the assembled structures. The solution structure of LC4 elucidated in the present work provides a structural basis for further studies on the HIV-1 inhibitory function of the LC4 region

    Gene Characterization Index: Assessing the Depth of Gene Annotation

    Get PDF
    We introduce the Gene Characterization Index, a bioinformatics method for scoring the extent to which a protein-encoding gene is functionally described. Inherently a reflection of human perception, the Gene Characterization Index is applied for assessing the characterization status of individual genes, thus serving the advancement of both genome annotation and applied genomics research by rapid and unbiased identification of groups of uncharacterized genes for diverse applications such as directed functional studies and delineation of novel drug targets.The scoring procedure is based on a global survey of researchers, who assigned characterization scores from 1 (poor) to 10 (extensive) for a sample of genes based on major online resources. By evaluating the survey as training data, we developed a bioinformatics procedure to assign gene characterization scores to all genes in the human genome. We analyzed snapshots of functional genome annotation over a period of 6 years to assess temporal changes reflected by the increase of the average Gene Characterization Index. Applying the Gene Characterization Index to genes within pharmaceutically relevant classes, we confirmed known drug targets as high-scoring genes and revealed potentially interesting novel targets with low characterization indexes. Removing known drug targets and genes linked to sequence-related patent filings from the entirety of indexed genes, we identified sets of low-scoring genes particularly suited for further experimental investigation.The Gene Characterization Index is intended to serve as a tool to the scientific community and granting agencies for focusing resources and efforts on unexplored areas of the genome. The Gene Characterization Index is available from http://cisreg.ca/gci/
    corecore