215 research outputs found

    Algebraic Comparison of Partial Lists in Bioinformatics

    Get PDF
    The outcome of a functional genomics pipeline is usually a partial list of genomic features, ranked by their relevance in modelling biological phenotype in terms of a classification or regression model. Due to resampling protocols or just within a meta-analysis comparison, instead of one list it is often the case that sets of alternative feature lists (possibly of different lengths) are obtained. Here we introduce a method, based on the algebraic theory of symmetric groups, for studying the variability between lists ("list stability") in the case of lists of unequal length. We provide algorithms evaluating stability for lists embedded in the full feature set or just limited to the features occurring in the partial lists. The method is demonstrated first on synthetic data in a gene filtering task and then for finding gene profiles on a recent prostate cancer dataset

    Belle II Technical Design Report

    Full text link
    The Belle detector at the KEKB electron-positron collider has collected almost 1 billion Y(4S) events in its decade of operation. Super-KEKB, an upgrade of KEKB is under construction, to increase the luminosity by two orders of magnitude during a three-year shutdown, with an ultimate goal of 8E35 /cm^2 /s luminosity. To exploit the increased luminosity, an upgrade of the Belle detector has been proposed. A new international collaboration Belle-II, is being formed. The Technical Design Report presents physics motivation, basic methods of the accelerator upgrade, as well as key improvements of the detector.Comment: Edited by: Z. Dole\v{z}al and S. Un

    On plexus representation of dissimilarities

    Get PDF
    Correspondence analysis has found widespread application in analysing vegetation gradients. However, it is not clear how it is robust to situations where structures other than a simple gradient exist. The introduction of instrumental variables in canonical correspondence analysis does not avoid these difficulties. In this paper I propose to examine some simple methods based on the notion of the plexus (sensu McIntosh) where graphs or networks are used to display some of the structure of the data so that an informed choice of models is possible. I showthat two different classes of plexus model are available. These classes are distinguished by the use in one case of a global Euclidean model to obtain well-separated pair decomposition (WSPD) of a set of points which implicitly involves all dissimilarities, while in the other a Riemannian view is taken and emphasis is placed locally, i.e., on small dissimilarities. I showan example of each of these classes applied to vegetation data

    Reconsidering the use of rankings in the valuation of health states: a model for estimating cardinal values from ordinal data

    Get PDF
    BACKGROUND: In survey studies on health-state valuations, ordinal ranking exercises often are used as precursors to other elicitation methods such as the time trade-off (TTO) or standard gamble, but the ranking data have not been used in deriving cardinal valuations. This study reconsiders the role of ordinal ranks in valuing health and introduces a new approach to estimate interval-scaled valuations based on aggregate ranking data. METHODS: Analyses were undertaken on data from a previously published general population survey study in the United Kingdom that included rankings and TTO values for hypothetical states described using the EQ-5D classification system. The EQ-5D includes five domains (mobility, self-care, usual activities, pain/discomfort and anxiety/depression) with three possible levels on each. Rank data were analysed using a random utility model, operationalized through conditional logit regression. In the statistical model, probabilities of observed rankings were related to the latent utilities of different health states, modeled as a linear function of EQ-5D domain scores, as in previously reported EQ-5D valuation functions. Predicted valuations based on the conditional logit model were compared to observed TTO values for the 42 states in the study and to predictions based on a model estimated directly from the TTO values. Models were evaluated using the intraclass correlation coefficient (ICC) between predictions and mean observations, and the root mean squared error of predictions at the individual level. RESULTS: Agreement between predicted valuations from the rank model and observed TTO values was very high, with an ICC of 0.97, only marginally lower than for predictions based on the model estimated directly from TTO values (ICC = 0.99). Individual-level errors were also comparable in the two models, with root mean squared errors of 0.503 and 0.496 for the rank-based and TTO-based predictions, respectively. CONCLUSIONS: Modeling health-state valuations based on ordinal ranks can provide results that are similar to those obtained from more widely analyzed valuation techniques such as the TTO. The information content in aggregate ranking data is not currently exploited to full advantage. The possibility of estimating cardinal valuations from ordinal ranks could also simplify future data collection dramatically and facilitate wider empirical study of health-state valuations in diverse settings and population groups

    A Novel Mechanism of Programmed Cell Death in Bacteria by Toxin–Antitoxin Systems Corrupts Peptidoglycan Synthesis

    Get PDF
    Most genomes of bacteria contain toxin–antitoxin (TA) systems. These gene systems encode a toxic protein and its cognate antitoxin. Upon antitoxin degradation, the toxin induces cell stasis or death. TA systems have been linked with numerous functions, including growth modulation, genome maintenance, and stress response. Members of the epsilon/zeta TA family are found throughout the genomes of pathogenic bacteria and were shown not only to stabilize resistance plasmids but also to promote virulence. The broad distribution of epsilon/zeta systems implies that zeta toxins utilize a ubiquitous bacteriotoxic mechanism. However, whereas all other TA families known to date poison macromolecules involved in translation or replication, the target of zeta toxins remained inscrutable. We used in vivo techniques such as microscropy and permeability assays to show that pneumococcal zeta toxin PezT impairs cell wall synthesis and triggers autolysis in Escherichia coli. Subsequently, we demonstrated in vitro that zeta toxins in general phosphorylate the ubiquitous peptidoglycan precursor uridine diphosphate-N-acetylglucosamine (UNAG) and that this activity is counteracted by binding of antitoxin. After identification of the product we verified the kinase activity in vivo by analyzing metabolite extracts of cells poisoned by PezT using high pressure liquid chromatograpy (HPLC). We further show that phosphorylated UNAG inhibitis MurA, the enzyme catalyzing the initial step in bacterial peptidoglycan biosynthesis. Additionally, we provide what is to our knowledge the first crystal structure of a zeta toxin bound to its substrate. We show that zeta toxins are novel kinases that poison bacteria through global inhibition of peptidoglycan synthesis. This provides a fundamental understanding of how epsilon/zeta TA systems stabilize mobile genetic elements. Additionally, our results imply a mechanism that connects activity of zeta toxin PezT to virulence of pneumococcal infections. Finally, we discuss how phosphorylated UNAG likely poisons additional pathways of bacterial cell wall synthesis, making it an attractive lead compound for development of new antibiotics

    Evidence for Limited Genetic Compartmentalization of HIV-1 between Lung and Blood

    Get PDF
    BACKGROUND:HIV-1 is frequently detected in the lungs of infected individuals and is likely important in the development of pulmonary opportunistic infections. The unique environment of the lung, rich in alveolar macrophages and with specialized local immune responses, may contribute to differential evolution or selection of HIV-1. METHODOLOGY AND FINDINGS:We characterized HIV-1 in the lung in relation to contemporaneous viral populations in the blood. The C2-V5 region of HIV-1 env was sequenced from paired lung (induced sputum or bronchoalveolar lavage) and blood (plasma RNA and proviral DNA from sorted or unsorted PBMC) from 18 subjects. Compartmentalization between tissue pairs was assessed using 5 established tree or distance-based methods, including permutation tests to determine statistical significance. We found statistical evidence of compartmentalization between lung and blood in 10/18 subjects, although lung and blood sequences were intermingled on phylogenetic trees in all subjects. The subject showing the greatest compartmentalization contained many nearly identical sequences in BAL sample, suggesting clonal expansion may contribute to reduced viral diversity in the lung in some cases. However, HIV-1 sequences in lung were not more homogeneous overall, nor were we able to find a lung-specific genotype associated with macrophage tropism in V3. In all four subjects in whom predicted X4 genotypes were found in blood, predicted X4 genotypes were also found in lung. CONCLUSIONS:Our results support a picture of continuous migration of HIV-1 between circulating blood and lung tissue, with perhaps a very limited degree of localized evolution or clonal replication

    HIV-1 Populations in Semen Arise through Multiple Mechanisms

    Get PDF
    HIV-1 is present in anatomical compartments and bodily fluids. Most transmissions occur through sexual acts, making virus in semen the proximal source in male donors. We find three distinct relationships in comparing viral RNA populations between blood and semen in men with chronic HIV-1 infection, and we propose that the viral populations in semen arise by multiple mechanisms including: direct import of virus, oligoclonal amplification within the seminal tract, or compartmentalization. In addition, we find significant enrichment of six out of nineteen cytokines and chemokines in semen of both HIV-infected and uninfected men, and another seven further enriched in infected individuals. The enrichment of cytokines involved in innate immunity in the seminal tract, complemented with chemokines in infected men, creates an environment conducive to T cell activation and viral replication. These studies define different relationships between virus in blood and semen that can significantly alter the composition of the viral population at the source that is most proximal to the transmitted virus

    Finishing the euchromatic sequence of the human genome

    Get PDF
    The sequence of the human genome encodes the genetic instructions for human physiology, as well as rich information about human evolution. In 2001, the International Human Genome Sequencing Consortium reported a draft sequence of the euchromatic portion of the human genome. Since then, the international collaboration has worked to convert this draft into a genome sequence with high accuracy and nearly complete coverage. Here, we report the result of this finishing process. The current genome sequence (Build 35) contains 2.85 billion nucleotides interrupted by only 341 gaps. It covers ∼99% of the euchromatic genome and is accurate to an error rate of ∼1 event per 100,000 bases. Many of the remaining euchromatic gaps are associated with segmental duplications and will require focused work with new methods. The near-complete sequence, the first for a vertebrate, greatly improves the precision of biological analyses of the human genome including studies of gene number, birth and death. Notably, the human enome seems to encode only 20,000-25,000 protein-coding genes. The genome sequence reported here should serve as a firm foundation for biomedical research in the decades ahead
    corecore