95 research outputs found

    Bayesian optimization of the PC algorithm for learning Gaussian Bayesian networks

    Full text link
    The PC algorithm is a popular method for learning the structure of Gaussian Bayesian networks. It carries out statistical tests to determine absent edges in the network. It is hence governed by two parameters: (i) The type of test, and (ii) its significance level. These parameters are usually set to values recommended by an expert. Nevertheless, such an approach can suffer from human bias, leading to suboptimal reconstruction results. In this paper we consider a more principled approach for choosing these parameters in an automatic way. For this we optimize a reconstruction score evaluated on a set of different Gaussian Bayesian networks. This objective is expensive to evaluate and lacks a closed-form expression, which means that Bayesian optimization (BO) is a natural choice. BO methods use a model to guide the search and are hence able to exploit smoothness properties of the objective surface. We show that the parameters found by a BO method outperform those found by a random search strategy and the expert recommendation. Importantly, we have found that an often overlooked statistical test provides the best over-all reconstruction results

    Relationship among research collaboration, number of documents and number of citations. A case study in Spanish computer science production in 2000-2009.

    Get PDF
    This paper analyzes the relationship among research collaboration, number of documents and number of citations of computer science research activity. It analyzes the number of documents and citations and how they vary by number of authors. They are also analyzed (according to author set cardinality) under different circumstances, that is, when documents are written in different types of collaboration, when documents are published in different document types, when documents are published in different computer science subdisciplines, and, finally, when documents are published by journals with different impact factor quartiles. To investigate the above relationships, this paper analyzes the publications listed in the Web of Science and produced by active Spanish university professors between 2000 and 2009, working in the computer science field. Analyzing all documents, we show that the highest percentage of documents are published by three authors, whereas single-authored documents account for the lowest percentage. By number of citations, there is no positive association between the author cardinality and citation impact. Statistical tests show that documents written by two authors receive more citations per document and year than documents published by more authors. In contrast, results do not show statistically significant differences between documents published by two authors and one author. The research findings suggest that international collaboration results on average in publications with higher citation rates than national and institutional collaborations. We also find differences regarding citation rates between journals and conferences, across different computer science subdisciplines and journal quartiles as expected. Finally, our impression is that the collaborative level (number of authors per document) will increase in the coming years, and documents published by three or four authors will be the trend in computer science literature

    Multi-Target Prediction: A Unifying View on Problems and Methods

    Full text link
    Multi-target prediction (MTP) is concerned with the simultaneous prediction of multiple target variables of diverse type. Due to its enormous application potential, it has developed into an active and rapidly expanding research field that combines several subfields of machine learning, including multivariate regression, multi-label classification, multi-task learning, dyadic prediction, zero-shot learning, network inference, and matrix completion. In this paper, we present a unifying view on MTP problems and methods. First, we formally discuss commonalities and differences between existing MTP problems. To this end, we introduce a general framework that covers the above subfields as special cases. As a second contribution, we provide a structured overview of MTP methods. This is accomplished by identifying a number of key properties, which distinguish such methods and determine their suitability for different types of problems. Finally, we also discuss a few challenges for future research

    Ryanodine receptor point mutations confer diamide insecticide resistance in tomato leafminer, Tuta absoluta (Lepidoptera: Gelechiidae).

    Get PDF
    This is the author accepted manuscript. The final version is available from Elsevier via the DOI in this record.Insect ryanodine receptors (RyR) are the molecular target-site for the recently introduced diamide insecticides. Diamides are particularly active on Lepidoptera pests, including tomato leafminer, Tuta absoluta (Lepidoptera: Gelechiidae). High levels of diamide resistance were recently described in some European populations of T. absoluta, however, the mechanisms of resistance remained unknown. In this study the molecular basis of diamide resistance was investigated in a diamide resistant strain from Italy (IT-GELA-SD4), and additional resistant field populations collected in Greece, Spain and Brazil. The genetics of resistance was investigated by reciprocally crossing strain IT-GELA-SD4 with a susceptible strain and revealed an autosomal incompletely recessive mode of inheritance. To investigate the possible role of target-site mutations as known from diamondback moth (Plutella xylostella), we sequenced respective domains of the RyR gene of T. absoluta. Genotyping of individuals of IT-GELA-SD4 and field-collected strains showing different levels of diamide resistance revealed the presence of G4903E and I4746M RyR target-site mutations. These amino acid substitutions correspond to those recently described for diamide resistant diamondback moth, i.e. G4946E and I4790M. We also detected two novel mutations, G4903V and I4746T, in some of the resistant T. absoluta strains. Radioligand binding studies with thoracic membrane preparations of the IT-GELA-SD4 strain provided functional evidence that these mutations alter the affinity of the RyR to diamides. In combination with previous work on P. xylostella our study highlights the importance of position G4903 (G4946 in P. xylostella) of the insect RyR in defining sensitivity to diamides. The discovery of diamide resistance mutations in T. absoluta populations of diverse geographic origin has serious implications for the efficacy of diamides under applied conditions. The implementation of appropriate resistance management strategies is strongly advised to delay the further spread of resistance.The work of Hellenic Agricultural Organisation - 'Demeter' was partially supported by an ARIMnet2 StomP grand to A.T and E.R. This work was also partially funded from a fellowship granted to H.A.A.S. (CNPq - PQ - Proc 308461/2013-4). The Universidad Politécnica de Cartagena group would like to thank for partial financial support the Ministerio de Economía y Competitividad of Spain and FEDER (AGL2011-25164). Lidia García-Vidal holds a grant from the MECD (FPU13/01528). The Tuta absoluta strain from Gela, Sicily was collected under the frame a resistance monitoring program established among the Hellenic Agricultural Organisation - 'Demeter' and DuPont De Nemurs (data published in 2015). Finally, the Hellenic Agricultural Organisation - 'Demeter' would like to thank Fytochem S.A., Neo Mirtos, Ierapetra for supplies of plant material

    Regularized logistic regression and multi-objective variable selection for classifying MEG data

    Get PDF
    This paper addresses the question of maximizing classifier accuracy for classifying task-related mental activity from Magnetoencelophalography (MEG) data. We propose the use of different sources of information and introduce an automatic channel selection procedure. To determine an informative set of channels, our approach combines a variety of machine learning algorithms: feature subset selection methods, classifiers based on regularized logistic regression, information fusion, and multiobjective optimization based on probabilistic modeling of the search space. The experimental results show that our proposal is able to improve classification accuracy compared to approaches whose classifiers use only one type of MEG information or for which the set of channels is fixed a priori
    corecore