34 research outputs found
Improving the performance of DomainDiscovery of protein domain boundary assignment using inter-domain linker index
BACKGROUND: Knowledge of protein domain boundaries is critical for the characterisation and understanding of protein function. The ability to identify domains without the knowledge of the structure – by using sequence information only – is an essential step in many types of protein analyses. In this present study, we demonstrate that the performance of DomainDiscovery is improved significantly by including the inter-domain linker index value for domain identification from sequence-based information. Improved DomainDiscovery uses a Support Vector Machine (SVM) approach and a unique training dataset built on the principle of consensus among experts in defining domains in protein structure. The SVM was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. RESULTS: Improved DomainDiscovery is compared with other methods by benchmarking against a structurally non-redundant dataset and also CASP5 targets. Improved DomainDiscovery achieves 70% accuracy for domain boundary identification in multi-domains proteins. CONCLUSION: Improved DomainDiscovery compares favourably to the performance of other methods and excels in the identification of domain boundaries for multi-domain proteins as a result of introducing support vector machine with benchmark_2 dataset
Improved general regression network for protein domain boundary prediction
Background: Protein domains present some of the most useful information that can be used to understand protein structure and functions. Recent research on protein domain boundary prediction has been mainly based on widely known machine learning techniques, such as Artificial Neural Networks and Support Vector Machines. In this study, we propose a new machine learning model (IGRN) that can achieve accurate and reliable classification, with significantly reduced computations. The IGRN was trained using a PSSM (Position Specific Scoring Matrix), secondary structure, solvent accessibility information and inter-domain linker index to detect possible domain boundaries for a target sequence. Results: The proposed model achieved average prediction accuracy of 67% on the Benchmark_2 dataset for domain boundary identification in multi-domains proteins and showed superior predictive performance and generalisation ability among the most widely used neural network models. With the CASP7 benchmark dataset, it also demonstrated comparable performance to existing domain boundary predictors such as DOMpro, DomPred, DomSSEA, DomCut and DomainDiscovery with 70.10% prediction accuracy. Conclusion: The performance of proposed model has been compared favourably to the performance of other existing machine learning based methods as well as widely known domain boundary predictors on two benchmark datasets and excels in the identification of domain boundaries in terms of model bias, generalisation and computational requirements. © 2008 Yoo et al; licensee BioMed Central Ltd
BLM and RMI1 alleviate RPA inhibition of topoIIIα decatenase activity
RPA is a single-stranded DNA binding protein that physically associates with the BLM complex. RPA stimulates BLM helicase activity as well as the double Holliday junction dissolution activity of the BLM-topoisomerase IIIα complex. We investigated the effect of RPA on the ssDNA decatenase activity of topoisomerase IIIα. We found that RPA and other ssDNA binding proteins inhibit decatenation by topoisomerase IIIα. Complex formation between BLM, TopoIIIα, and RMI1 ablates inhibition of decatenation by ssDNA binding proteins. Together, these data indicate that inhibition by RPA does not involve species-specific interactions between RPA and BLM-TopoIIIα-RMI1, which contrasts with RPA modulation of double Holliday junction dissolution. We propose that topoisomerase IIIα and RPA compete to bind to single-stranded regions of catenanes. Interactions with BLM and RMI1 enhance toposiomerase IIIα activity, promoting decatenation in the presence of RPA
Establishing bioinformatics research in the Asia Pacific
In 1998, the Asia Pacific Bioinformatics Network (APBioNet), Asia's oldest bioinformatics organisation was set up to champion the advancement of bioinformatics in the Asia Pacific. By 2002, APBioNet was able to gain sufficient critical mass to initiate the first International Conference on Bioinformatics (InCoB) bringing together scientists working in the field of bioinformatics in the region. This year, the InCoB2006 Conference was organized as the 5(th )annual conference of the Asia-Pacific Bioinformatics Network, on Dec. 18–20, 2006 in New Delhi, India, following a series of successful events in Bangkok (Thailand), Penang (Malaysia), Auckland (New Zealand) and Busan (South Korea). This Introduction provides a brief overview of the peer-reviewed manuscripts accepted for publication in this Supplement. It exemplifies a typical snapshot of the growing research excellence in bioinformatics of the region as we embark on a trajectory of establishing a solid bioinformatics research culture in the Asia Pacific that is able to contribute fully to the global bioinformatics community
Characterization of the interaction of staphylococcal enterotoxin B with CD1d expressed in human renal proximal tubule epithelial cells
Recommended from our members
Prevalence, years lived with disability, and trends in anaemia burden by severity and cause, 1990-2021: findings from the Global Burden of Disease Study 2021
Background
Anaemia is a major health problem worldwide. Global estimates of anaemia burden are crucial for developing appropriate interventions to meet current international targets for disease mitigation. We describe the prevalence, years lived with disability, and trends of anaemia and its underlying causes in 204 countries and territories.
Methods
We estimated population-level distributions of haemoglobin concentration by age and sex for each location from 1990 to 2021. We then calculated anaemia burden by severity and associated years lived with disability (YLDs). With data on prevalence of the causes of anaemia and associated cause-specific shifts in haemoglobin concentrations, we modelled the proportion of anaemia attributed to 37 underlying causes for all locations, years, and demographics in the Global Burden of Disease Study 2021.
Findings
In 2021, the global prevalence of anaemia across all ages was 24·3% (95% uncertainty interval [UI] 23·9–24·7), corresponding to 1·92 billion (1·89–1·95) prevalent cases, compared with a prevalence of 28·2% (27·8–28·5) and 1·50 billion (1·48–1·52) prevalent cases in 1990. Large variations were observed in anaemia burden by age, sex, and geography, with children younger than 5 years, women, and countries in sub-Saharan Africa and south Asia being particularly affected. Anaemia caused 52·0 million (35·1–75·1) YLDs in 2021, and the YLD rate due to anaemia declined with increasing Socio-demographic Index. The most common causes of anaemia YLDs in 2021 were dietary iron deficiency (cause-specific anaemia YLD rate per 100 000 population: 422·4 [95% UI 286·1–612·9]), haemoglobinopathies and haemolytic anaemias (89·0 [58·2–123·7]), and other neglected tropical diseases (36·3 [24·4–52·8]), collectively accounting for 84·7% (84·1–85·2) of anaemia YLDs.
Interpretation
Anaemia remains a substantial global health challenge, with persistent disparities according to age, sex, and geography. Estimates of cause-specific anaemia burden can be used to design locally relevant health interventions aimed at improving anaemia management and prevention.
Funding
Bill & Melinda Gates Foundation
Assessing the Spatial Transferability of Freight (Trip) Generation Models across and within States of India: Empirical Evidence and Implications for Benefit Transfer
Machine learning approach to predict protein phosphorylation sites by incorporating evolutionary information
Modeling of magnetic cooling power of manganite-based materials using computational intelligence approach
Magnetic refrigeration (MR) technology has been identified as a potential replacement for the gas compression system of refrigeration due to its environmental friendliness and high level of efficiency. This technology utilizes manganite-based materials as magnetic refrigerants due to the dependence of magnetic properties as well as relative cooling power (RCP) of this class of materials on the concentration and nature of the dopants. Quantifying the effect of dopants on RCP of manganite-based materials requires a long experimental procedures and techniques that are costly and time-consuming. In order to circumvent these challenges, we propose a model, based on support vector regression (SVR), which quickly estimates the RCP of doped manganite-based materials with high level of accuracy using crystal lattice constants as descriptors. The accuracy and ease with which the proposed SVR-based model estimates RCP of doped manganite-based materials is very promising and effective in designing MR system of desired RCP