672 research outputs found

    Integrating protein-protein interactions and text mining for protein function prediction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>Functional annotation of proteins remains a challenging task. Currently the scientific literature serves as the main source for yet uncurated functional annotations, but curation work is slow and expensive. Automatic techniques that support this work are still lacking reliability. We developed a method to identify conserved protein interaction graphs and to predict missing protein functions from orthologs in these graphs. To enhance the precision of the results, we furthermore implemented a procedure that validates all predictions based on findings reported in the literature.</p> <p>Results</p> <p>Using this procedure, more than 80% of the GO annotations for proteins with highly conserved orthologs that are available in UniProtKb/Swiss-Prot could be verified automatically. For a subset of proteins we predicted new GO annotations that were not available in UniProtKb/Swiss-Prot. All predictions were correct (100% precision) according to the verifications from a trained curator.</p> <p>Conclusion</p> <p>Our method of integrating CCSs and literature mining is thus a highly reliable approach to predict GO annotations for weakly characterized proteins with orthologs.</p

    Identification of disease-causing genes using microarray data mining and gene ontology

    Get PDF
    Background: One of the best and most accurate methods for identifying disease-causing genes is monitoring gene expression values in different samples using microarray technology. One of the shortcomings of microarray data is that they provide a small quantity of samples with respect to the number of genes. This problem reduces the classification accuracy of the methods, so gene selection is essential to improve the predictive accuracy and to identify potential marker genes for a disease. Among numerous existing methods for gene selection, support vector machine-based recursive feature elimination (SVMRFE) has become one of the leading methods, but its performance can be reduced because of the small sample size, noisy data and the fact that the method does not remove redundant genes. Methods: We propose a novel framework for gene selection which uses the advantageous features of conventional methods and addresses their weaknesses. In fact, we have combined the Fisher method and SVMRFE to utilize the advantages of a filtering method as well as an embedded method. Furthermore, we have added a redundancy reduction stage to address the weakness of the Fisher method and SVMRFE. In addition to gene expression values, the proposed method uses Gene Ontology which is a reliable source of information on genes. The use of Gene Ontology can compensate, in part, for the limitations of microarrays, such as having a small number of samples and erroneous measurement results. Results: The proposed method has been applied to colon, Diffuse Large B-Cell Lymphoma (DLBCL) and prostate cancer datasets. The empirical results show that our method has improved classification performance in terms of accuracy, sensitivity and specificity. In addition, the study of the molecular function of selected genes strengthened the hypothesis that these genes are involved in the process of cancer growth. Conclusions: The proposed method addresses the weakness of conventional methods by adding a redundancy reduction stage and utilizing Gene Ontology information. It predicts marker genes for colon, DLBCL and prostate cancer with a high accuracy. The predictions made in this study can serve as a list of candidates for subsequent wet-lab verification and might help in the search for a cure for cancers

    Jean-Baptiste Bélanger, hydraulic engineer, researcher and academic

    Get PDF
    Jean-Baptiste BÉLANGER (1790-1874) worked as a hydraulic engineer at the beginning of his career. He developed the backwater equation to calculate gradually-varied open channel flow properties for steady flow conditions. Later, as an academic at the leading French engineering schools (Ecole Centrale des Arts et Manufactures, Ecole des Ponts et Chaussées, and Ecole Polytechnique), he developed a new university curriculum in mechanics and several textbooks including a seminal text in hydraulic engineering. His influence on his contemporaries was considerable, and his name is written on the border of one of the four facades of the Eiffel Tower. BÉLANGER's leading role demonstrated the dynamism of practicing engineers at the time, and his contributions paved the way to many significant works in hydraulics

    Synthesis and Biocidal Activity of Some Naphthalene-Based Cationic Surfactants

    Get PDF
    In this study, different cationic surfactants were prepared by reacting dodecyl bromide with tertiary amines to produce a series of quaternary ammonium salts that were converted subsequently to stannous and cobalt cationic complexes via complexing them with stannous (II) or cobalt (II) ions. Surface properties such as surface- and interfacial-tension, and the emulsifying power of these surfactants were investigated. The surface parameters including critical micelle concentration, maximum surface excess, minimum surface area, tension lowering efficiency and effectiveness were studied. The free energy of micellization and adsorption were calculated. Antimicrobial activity was determined via the inhibition zone diameter of the prepared compounds, which was measured against six strains of a representative group of microorganisms. The antimicrobial activity of some of the prepared surfactants against sulfate reducing bacteria was determined by the dilution method. FTIR spectra, elemental analysis and a H1 NMR spectrum were examined to confirm compound structure and purity. The results obtained indicate that these compounds have good surface properties and good biocidal effect on broad spectrum of micro organisms

    A latent trait look at pretest-posttest validation of criterion-referenced test items

    Get PDF
    Since Cox and Vargas (1966) introduced their pretest-posttest validity index for criterion-referenced test items, a great number of additions and modifications have followed. All are based on the idea of gain scoring; that is, they are computed from the differences between proportions of pretest and posttest item responses. Although the method is simple and generally considered as the prototype of criterion-referenced item analysis, it has many and serious disadvantages. Some of these go back to the fact that it leads to indices based on a dual test administration- and population-dependent item p values. Others have to do with the global information about the discriminating power that these indices provide, the implicit weighting they suppose, and the meaningless maximization of posttest scores they lead to. Analyzing the pretest-posttest method from a latent trait point of view, it is proposed to replace indices like Cox and Vargas’ Dpp by an evaluation of the item information function for the mastery score. An empirical study was conducted to compare the differences in item selection between both methods

    Search for new phenomena in final states with an energetic jet and large missing transverse momentum in pp collisions at √ s = 8 TeV with the ATLAS detector

    Get PDF
    Results of a search for new phenomena in final states with an energetic jet and large missing transverse momentum are reported. The search uses 20.3 fb−1 of √ s = 8 TeV data collected in 2012 with the ATLAS detector at the LHC. Events are required to have at least one jet with pT > 120 GeV and no leptons. Nine signal regions are considered with increasing missing transverse momentum requirements between Emiss T > 150 GeV and Emiss T > 700 GeV. Good agreement is observed between the number of events in data and Standard Model expectations. The results are translated into exclusion limits on models with either large extra spatial dimensions, pair production of weakly interacting dark matter candidates, or production of very light gravitinos in a gauge-mediated supersymmetric model. In addition, limits on the production of an invisibly decaying Higgs-like boson leading to similar topologies in the final state are presente

    Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species.</p> <p>Results</p> <p>We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (<it>MLH1</it>, <it>PMS2</it>, <it>EPHB4 </it>) and could confirm more than 73% of them based on evidence in the literature.</p> <p>Conclusions</p> <p>The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage.</p

    Assessing road effects on bats: the role of landscape, road features, and bat activity on road-kills

    Get PDF
    Recent studies suggest that roads can significantly impact bat populations. Though bats are one of the most threatened groups of European vertebrates, studies aiming to quantify bat mortality and determine the main factors driving it remain scarce. Between March 16 and October 31 of 2009, we surveyed road-killed bats daily along a 51-km-long transect that incorporates different types of roads in southern Portugal. We found 154 road-killed bats of 11 species. The two most common species in the study area, Pipistrellus kuhlii and P. pygmaeus, were also the most commonly identified road-kill, representing 72 % of the total specimens collected. About two-thirds of the total mortality occurred between mid July and late September, peaking in the second half of August. We also recorded casualties of threatened and rare species, including Miniopterus schreibersii, Rhinolophus ferrumequinum, R. hipposideros, Barbastella barbastellus, and Nyctalus leisleri. These species were found mostly in early autumn, corresponding to the mating and swarming periods. Landscape features were the most important variable subset for explaining bat casualties. Road stretches crossing or in the vicinity of high-quality habitats for bats—including dense Mediterranean woodland (‘‘montado’’) areas, water courses with riparian gallery, and water reservoirs—yielded a significantly higher number of casualties. Additionally, more roadkilled bats were recorded on high-traffic road stretches with viaducts, in areas of higher bat activity and near known roosts
    corecore