22 research outputs found

    表紙

    Get PDF
    千葉大学教育学部研究紀要. II, 人文・社会科学編 4

    Prediction of protein assemblies, the next frontier: The CASP14-CAPRI experiment

    Get PDF
    We present the results for CAPRI Round 50, the fourth joint CASP-CAPRI protein assembly prediction challenge. The Round comprised a total of twelve targets, including six dimers, three trimers, and three higher-order oligomers. Four of these were easy targets, for which good structural templates were available either for the full assembly, or for the main interfaces (of the higher-order oligomers). Eight were difficult targets for which only distantly related templates were found for the individual subunits. Twenty-five CAPRI groups including eight automatic servers submitted ~1250 models per target. Twenty groups including six servers participated in the CAPRI scoring challenge submitted ~190 models per target. The accuracy of the predicted models was evaluated using the classical CAPRI criteria. The prediction performance was measured by a weighted scoring scheme that takes into account the number of models of acceptable quality or higher submitted by each group as part of their five top-ranking models. Compared to the previous CASP-CAPRI challenge, top performing groups submitted such models for a larger fraction (70–75%) of the targets in this Round, but fewer of these models were of high accuracy. Scorer groups achieved stronger performance with more groups submitting correct models for 70–80% of the targets or achieving high accuracy predictions. Servers performed less well in general, except for the MDOCKPP and LZERD servers, who performed on par with human groups. In addition to these results, major advances in methodology are discussed, providing an informative overview of where the prediction of protein assemblies currently stands.Cancer Research UK, Grant/Award Number: FC001003; Changzhou Science and Technology Bureau, Grant/Award Number: CE20200503; Department of Energy and Climate Change, Grant/Award Numbers: DE-AR001213, DE-SC0020400, DE-SC0021303; H2020 European Institute of Innovation and Technology, Grant/Award Numbers: 675728, 777536, 823830; Institut national de recherche en informatique et en automatique (INRIA), Grant/Award Number: Cordi-S; Lietuvos Mokslo Taryba, Grant/Award Numbers: S-MIP-17-60, S-MIP-21-35; Medical Research Council, Grant/Award Number: FC001003; Japan Society for the Promotion of Science KAKENHI, Grant/Award Number: JP19J00950; Ministerio de Ciencia e Innovación, Grant/Award Number: PID2019-110167RB-I00; Narodowe Centrum Nauki, Grant/Award Numbers: UMO-2017/25/B/ST4/01026, UMO-2017/26/M/ST4/00044, UMO-2017/27/B/ST4/00926; National Institute of General Medical Sciences, Grant/Award Numbers: R21GM127952, R35GM118078, RM1135136, T32GM132024; National Institutes of Health, Grant/Award Numbers: R01GM074255, R01GM078221, R01GM093123, R01GM109980, R01GM133840, R01GN123055, R01HL142301, R35GM124952, R35GM136409; National Natural Science Foundation of China, Grant/Award Number: 81603152; National Science Foundation, Grant/Award Numbers: AF1645512, CCF1943008, CMMI1825941, DBI1759277, DBI1759934, DBI1917263, DBI20036350, IIS1763246, MCB1925643; NWO, Grant/Award Number: TOP-PUNT 718.015.001; Wellcome Trust, Grant/Award Number: FC00100

    Routine health management information system data in Ethiopia: consistency, trends, and challenges.

    Get PDF
    Background: Ethiopia is investing in the routine Health Management Information System. Improved routine data are needed for decision-making in the health sector. Objective: To analyse the quality of the routine Health Management Information System data and triangulate with other sources, such as the Demographic and Health Surveys. Methods: We analysed national Health Management Information System data on 19 indicators of maternal health, neonatal survival, immunization, child nutrition, malaria, and tuberculosis over the 2012-2018 time period. The analyses were conducted by 38 analysts from the Ministry of Health, Ethiopia, and two government agencies who participated in the Operational Research and Coaching for Analysts (ORCA) project between June 2018 and June 2020. Using a World Health Organization Data Quality Review toolkit, we assessed indicator definitions, completeness, internal consistency over time and between related indicators, and external consistency compared with other data sources. Results: Several services reported coverage of above 100%. For many indicators, denominators were based on poor-quality population data estimates. Data on individual vaccinations had relatively good internal consistency. In contrast, there was low external consistency for data on fully vaccinated children, with the routine Health Management Information System showing 89% coverage but the Demographic and Health Survey estimate at 39%. Maternal health indicators displayed increasing coverage over time. Indicators on child nutrition, malaria, and tuberculosis were less consistent. Data on neonatal mortality were incomplete and operationalised as mortality on day 0-6. Our comparisons with survey and population projections indicated that one in eight early neonatal deaths were reported in the routine Health Management Information System. Data quality varied between regions. Conclusions: The quality of routine data gathered in the health system needs further attention. We suggest regular triangulation with data from other sources. We recommend addressing the denominator issues, reducing the complexity of indicators, and aligning indicators to international definitions

    The evolving SARS-CoV-2 epidemic in Africa: Insights from rapidly expanding genomic surveillance

    Get PDF
    INTRODUCTION Investment in Africa over the past year with regard to severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) sequencing has led to a massive increase in the number of sequences, which, to date, exceeds 100,000 sequences generated to track the pandemic on the continent. These sequences have profoundly affected how public health officials in Africa have navigated the COVID-19 pandemic. RATIONALE We demonstrate how the first 100,000 SARS-CoV-2 sequences from Africa have helped monitor the epidemic on the continent, how genomic surveillance expanded over the course of the pandemic, and how we adapted our sequencing methods to deal with an evolving virus. Finally, we also examine how viral lineages have spread across the continent in a phylogeographic framework to gain insights into the underlying temporal and spatial transmission dynamics for several variants of concern (VOCs). RESULTS Our results indicate that the number of countries in Africa that can sequence the virus within their own borders is growing and that this is coupled with a shorter turnaround time from the time of sampling to sequence submission. Ongoing evolution necessitated the continual updating of primer sets, and, as a result, eight primer sets were designed in tandem with viral evolution and used to ensure effective sequencing of the virus. The pandemic unfolded through multiple waves of infection that were each driven by distinct genetic lineages, with B.1-like ancestral strains associated with the first pandemic wave of infections in 2020. Successive waves on the continent were fueled by different VOCs, with Alpha and Beta cocirculating in distinct spatial patterns during the second wave and Delta and Omicron affecting the whole continent during the third and fourth waves, respectively. Phylogeographic reconstruction points toward distinct differences in viral importation and exportation patterns associated with the Alpha, Beta, Delta, and Omicron variants and subvariants, when considering both Africa versus the rest of the world and viral dissemination within the continent. Our epidemiological and phylogenetic inferences therefore underscore the heterogeneous nature of the pandemic on the continent and highlight key insights and challenges, for instance, recognizing the limitations of low testing proportions. We also highlight the early warning capacity that genomic surveillance in Africa has had for the rest of the world with the detection of new lineages and variants, the most recent being the characterization of various Omicron subvariants. CONCLUSION Sustained investment for diagnostics and genomic surveillance in Africa is needed as the virus continues to evolve. This is important not only to help combat SARS-CoV-2 on the continent but also because it can be used as a platform to help address the many emerging and reemerging infectious disease threats in Africa. In particular, capacity building for local sequencing within countries or within the continent should be prioritized because this is generally associated with shorter turnaround times, providing the most benefit to local public health authorities tasked with pandemic response and mitigation and allowing for the fastest reaction to localized outbreaks. These investments are crucial for pandemic preparedness and response and will serve the health of the continent well into the 21st century

    Machine learning and template based modeling for improving and expanding the functionality of rigid body docking

    Full text link
    Proteins govern practically every process in living organisms through inhibiting, activating, or acting on other proteins in different ways. With the large and growing number of known interactions through high throughput screening technologies, experimental determination of atomic-level details of these interactions is nigh impossible. Computational methods such as docking can speed up efforts of understanding these interactions. However, several issues ought to be addressed before docking can replace experimental methods. This thesis describes work on assessment of the state of the art in docking methods, implementation of a machine learning algorithm to improve model ranking and integration of docking with template-based modeling to expand its usage with a special focus on antibody-antigen interactions. Firstly, the performance of docking methods was rigorously assessed by using a diverse set of protein complexes with a special focus on ClusPro, one of the leading rigid-body docking servers. Different strengths and potential areas of improvement for ClusPro and rigid-body docking methods in general were highlighted. Secondly, one of the major short-comings of docking noted in the first project, poor ranking of good models, was addressed. A regression-based machine learning algorithm was introduced to improve the ranking. Finally, a server was developed to tackle the challenge of epitope mapping by integrating template-based modeling with docking. An intuitive ensemble approach to scoring residue likelihood using docking poses and different homologues is shown to yield great success. In addition to shifting docking’s purpose of conformational search to interface identification, this server also allows users to start with protein sequence inputs.2022-09-28T00:00:00

    Neural network‐derived Potts models for structure‐based protein design using backbone atomic coordinates and tertiary motifs

    No full text
    Designing novel proteins to perform desired functions, such as binding or catalysis, is a major goal in synthetic biology. A variety of computational approaches can aid in this task. An energy-based framework rooted in the sequence-structure statistics of tertiary motifs (TERMs) can be used for sequence design on predefined backbones. Neural network models that use backbone coordinate-derived features provide another way to design new proteins. In this work, we combine the two methods to make neural structure-based models more suitable for protein design. Specifically, we supplement backbone-coordinate features with TERM-derived data, as inputs, and we generate energy functions as outputs. We present two architectures that generate Potts models over the sequence space: TERMinator, which uses both TERM-based and coordinate-based information, and COORDinator, which uses only coordinate-based information. Using these two models, we demonstrate that TERMs can be utilized to improve native sequence recovery performance of neural models. Furthermore, we demonstrate that sequences designed by TERMinator are predicted to fold to their target structures by AlphaFold. Finally, we show that both TERMinator and COORDinator learn notions of energetics, and these methods can be fine-tuned on experimental data to improve predictions. Our results suggest that using TERM-based and coordinate-based features together may be beneficial for protein design and that structure-based neural models that produce Potts energy tables have utility for flexible applications in protein science

    ClusPro in rounds 38 to 45 of CAPRI: Toward combining template‐based methods with free docking

    No full text
    Targets in the protein docking experiment CAPRI (Critical Assessment of Predicted Interactions) generally present new challenges and contribute to new developments in methodology. In rounds 38 to 45 of CAPRI, most targets could be effectively predicted using template-based methods. However, the server ClusPro required structures rather than sequences as input, and hence we had to generate and dock homology models. The available templates also provided distance restraints that were directly used as input to the server. We show here that such an approach has some advantages. Free docking with template-based restraints using ClusPro reproduced some interfaces suggested by weak or ambiguous templates while not reproducing others, resulting in correct server predicted models. More recently we developed the fully automated ClusPro TBM server that performs template-based modeling and thus can use sequences rather than structures of component proteins as input. The performance of the server, freely available for noncommercial use at https://tbm.cluspro.org, is demonstrated by predicting the protein-protein targets of rounds 38 to 45 of CAPRI
    corecore