2,932 research outputs found

    Proteome-wide structural analysis identifies warhead- and coverage-specific biases in cysteine-focused chemoproteomics

    Get PDF
    Covalent drug discovery has undergone a resurgence over the past two decades and reactive cysteine profiling has emerged in parallel as a platform for ligand discovery through on- and off-target profiling; however, the scope of this approach has not been fully explored at the whole-proteome level. We combined AlphaFold2-predicted side-chain accessibilities for >95% of the human proteome with a meta-analysis of eighteen public cysteine profiling datasets, totaling 44,187 unique cysteine residues, revealing accessibility biases in sampled cysteines primarily dictated by warhead chemistry. Analysis of >3.5 million cysteine-fragment interactions further showed that hit elaboration and optimization drives increased bias against buried cysteine residues. Based on these data, we suggest that current profiling approaches cover a small proportion of potential ligandable cysteine residues and propose future directions for increasing coverage, focusing on high-priority residues and depth. All analysis and produced resources are freely available and extendable to other reactive amino acids

    Estimation of required sample size for external validation of risk models for binary outcomes

    Get PDF
    Risk-prediction models for health outcomes are used in practice as part of clinical decision-making, and it is essential that their performance be externally validated. An important aspect in the design of a validation study is choosing an adequate sample size. In this paper, we investigate the sample size requirements for validation studies with binary outcomes to estimate measures of predictive performance (C-statistic for discrimination and calibration slope and calibration in the large). We aim for sufficient precision in the estimated measures. In addition, we investigate the sample size to achieve sufficient power to detect a difference from a target value. Under normality assumptions on the distribution of the linear predictor, we obtain simple estimators for sample size calculations based on the measures above. Simulation studies show that the estimators perform well for common values of the C-statistic and outcome prevalence when the linear predictor is marginally Normal. Their performance deteriorates only slightly when the normality assumptions are violated. We also propose estimators which do not require normality assumptions but require specification of the marginal distribution of the linear predictor and require the use of numerical integration. These estimators were also seen to perform very well under marginal normality. Our sample size equations require a specified standard error (SE) and the anticipated C-statistic and outcome prevalence. The sample size requirement varies according to the prognostic strength of the model, outcome prevalence, choice of the performance measure and study objective. For example, to achieve an SE < 0.025 for the C-statistic, 60-170 events are required if the true C-statistic and outcome prevalence are between 0.64-0.85 and 0.05-0.3, respectively. For the calibration slope and calibration in the large, achieving SE < 0.15   would require 40-280 and 50-100 events, respectively. Our estimators may also be used for survival outcomes when the proportion of censored observations is high

    Improving Phrap-Based Assembly of the Rat Using “Reliable” Overlaps

    Get PDF
    The assembly methods used for whole-genome shotgun (WGS) data have a major impact on the quality of resulting draft genomes. We present a novel algorithm to generate a set of “reliable” overlaps based on identifying repeat k-mers. To demonstrate the benefits of using reliable overlaps, we have created a version of the Phrap assembly program that uses only overlaps from a specific list. We call this version PhrapUMD. Integrating PhrapUMD and our “reliable-overlap” algorithm with the Baylor College of Medicine assembler, Atlas, we assemble the BACs from the Rattus norvegicus genome project. Starting with the same data as the Nov. 2002 Atlas assembly, we compare our results and the Atlas assembly to the 4.3 Mb of rat sequence in the 21 BACs that have been finished. Our version of the draft assembly of the 21 BACs increases the coverage of finished sequence from 93.4% to 96.3%, while simultaneously reducing the base error rate from 4.5 to 1.1 errors per 10,000 bases. There are a number of ways of assessing the relative merits of assemblies when the finished sequence is available. If one views the overall quality of an assembly as proportional to the inverse of the product of the error rate and sequence missed, then the assembly presented here is seven times better. The UMD Overlapper with options for reliable overlaps is available from the authors at http://www.genome.umd.edu. We also provide the changes to the Phrap source code enabling it to use only the reliable overlaps

    A fitness assay for comparing RNAi effects across multiple C. elegans genotypes

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>RNAi technology by feeding of <it>E. coli </it>containing dsRNA in <it>C. elegans </it>has significantly contributed to further our understanding of many different fields, including genetics, molecular biology, developmental biology and functional genomics. Most of this research has been carried out in a single genotype or genetic background. However, RNAi effects in one genotype do not reveal the allelic effects that segregate in natural populations and contribute to phenotypic variation.</p> <p>Results</p> <p>Here we present a method that allows for rapidly comparing RNAi effects among diverse genotypes at an improved high throughput rate. It is based on assessing the fitness of a population of worms by measuring the rate at which <it>E. coli </it>is consumed. Critically, we demonstrate the analytical power of this method by QTL mapping the loss of RNAi sensitivity (in the germline) in a recombinant inbred population derived from a cross between Bristol and a natural isolate from Hawaii. Hawaii has lost RNAi sensitivity in the germline. We found that polymorphisms in <it>ppw-1 </it>contribute to this loss of RNAi sensitivity, but that other loci are also likely to be important.</p> <p>Conclusions</p> <p>In summary, we have established a fast method that improves the throughput of RNAi in liquid, that generates quantitative data, that is easy to implement in most laboratories, and importantly that enables QTL mapping using RNAi.</p

    The missions of medical schools: the pursuit of health in the service of society

    Get PDF
    Mission statements and role documents of medical schools in the United Kingdom, United States, Canada and Australia have been examined on their Internet Web sites and categorised in purpose, content and presentation. The format and content are highly variable, but there is a common vision of three integral roles, namely, education, advancement of knowledge and service to society. Other frequent themes include tradition and historical perspective, service for designated communities, and benchmarking to accreditation standards. Differences in content reflect variable interpretation of the notion of "mission", and local or national characteristics such as institutional affiliations, the types, levels and organisation of medical education, relationships with health systems, and extent of multi-professional education. Outcomes data and measures of medical school performance referenced to the institution's stated missions are rarely encountered. Mission documents placed on the Internet are in the public domain. These Web sites and documents and linked information constitute a valuable new resource for international exchange of approaches and ideas in medical education and generally in academic medicine. Routine inclusion of outcome or performance data could help to demonstrate the community roles and social accountability of medical schools This paper proposes that partial standardisation of these Web documents could enhance their value both internally and for external readers. A generic descriptive statement template is offered

    Using Abbreviated Injury Scale (AIS) codes to classify Computed Tomography (CT) features in the Marshall System

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>The purpose of Abbreviated Injury Scale (AIS) is to code various types of Traumatic Brain Injuries (TBI) based on their anatomical location and severity. The Marshall CT Classification is used to identify those subgroups of brain injured patients at higher risk of deterioration or mortality. The purpose of this study is to determine whether and how AIS coding can be translated to the Marshall Classification</p> <p>Methods</p> <p>Initially, a Marshall Class was allocated to each AIS code through cross-tabulation. This was agreed upon through several discussion meetings with experts from both fields (clinicians and AIS coders). Furthermore, in order to make this translation possible, some necessary assumptions with regards to coding and classification of mass lesions and brain swelling were essential which were all approved and made explicit.</p> <p>Results</p> <p>The proposed method involves two stages: firstly to determine all possible Marshall Classes which a given patient can attract based on allocated AIS codes; via cross-tabulation and secondly to assign one Marshall Class to each patient through an algorithm.</p> <p>Conclusion</p> <p>This method can be easily programmed in computer softwares and it would enable future important TBI research programs using trauma registry data.</p

    Immigrant community integration in world cities

    Full text link
    As a consequence of the accelerated globalization process, today major cities all over the world are characterized by an increasing multiculturalism. The integration of immigrant communities may be affected by social polarization and spatial segregation. How are these dynamics evolving over time? To what extent the different policies launched to tackle these problems are working? These are critical questions traditionally addressed by studies based on surveys and census data. Such sources are safe to avoid spurious biases, but the data collection becomes an intensive and rather expensive work. Here, we conduct a comprehensive study on immigrant integration in 53 world cities by introducing an innovative approach: an analysis of the spatio-temporal communication patterns of immigrant and local communities based on language detection in Twitter and on novel metrics of spatial integration. We quantify the "Power of Integration" of cities --their capacity to spatially integrate diverse cultures-- and characterize the relations between different cultures when acting as hosts or immigrants.Comment: 13 pages, 5 figures + Appendi

    Re-Assembly of the Genome of Francisella tularensis Subsp. holarctica OSU18

    Get PDF
    Francisella tularensis is a highly infectious human intracellular pathogen that is the causative agent of tularemia. It occurs in several major subtypes, including the live vaccine strain holarctica (type B). F. tularensis is classified as category A biodefense agent in part because a relatively small number of organisms can cause severe illness. Three complete genomes of subspecies holarctica have been sequenced and deposited in public archives, of which OSU18 was the first and the only strain for which a scientific publication has appeared [1]. We re-assembled the OSU18 strain using both de novo and comparative assembly techniques, and found that the published sequence has two large inversion mis-assemblies. We generated a corrected assembly of the entire genome along with detailed information on the placement of individual reads within the assembly. This assembly will provide a more accurate basis for future comparative studies of this pathogen

    Long-term (trophic) purinergic signalling: purinoceptors control cell proliferation, differentiation and death

    Get PDF
    The purinergic signalling system, which uses purines and pyrimidines as chemical transmitters, and purinoceptors as effectors, is deeply rooted in evolution and development and is a pivotal factor in cell communication. The ATP and its derivatives function as a 'danger signal' in the most primitive forms of life. Purinoceptors are extraordinarily widely distributed in all cell types and tissues and they are involved in the regulation of an even more extraordinary number of biological processes. In addition to fast purinergic signalling in neurotransmission, neuromodulation and secretion, there is long-term (trophic) purinergic signalling involving cell proliferation, differentiation, motility and death in the development and regeneration of most systems of the body. In this article, we focus on the latter in the immune/defence system, in stratified epithelia in visceral organs and skin, embryological development, bone formation and resorption, as well as in cancer. Cell Death and Disease (2010) 1, e9; doi:10.1038/cddis.2009.11; published online 14 January 201
    corecore