10,247 research outputs found

    Multiple instance learning for sequence data with across bag dependencies

    Full text link
    In Multiple Instance Learning (MIL) problem for sequence data, the instances inside the bags are sequences. In some real world applications such as bioinformatics, comparing a random couple of sequences makes no sense. In fact, each instance may have structural and/or functional relations with instances of other bags. Thus, the classification task should take into account this across bag relation. In this work, we present two novel MIL approaches for sequence data classification named ABClass and ABSim. ABClass extracts motifs from related instances and use them to encode sequences. A discriminative classifier is then applied to compute a partial classification result for each set of related sequences. ABSim uses a similarity measure to discriminate the related instances and to compute a scores matrix. For both approaches, an aggregation method is applied in order to generate the final classification result. We applied both approaches to solve the problem of bacterial Ionizing Radiation Resistance prediction. The experimental results of the presented approaches are satisfactory

    Bag-Level Aggregation for Multiple Instance Active Learning in Instance Classification Problems

    Full text link
    A growing number of applications, e.g. video surveillance and medical image analysis, require training recognition systems from large amounts of weakly annotated data while some targeted interactions with a domain expert are allowed to improve the training process. In such cases, active learning (AL) can reduce labeling costs for training a classifier by querying the expert to provide the labels of most informative instances. This paper focuses on AL methods for instance classification problems in multiple instance learning (MIL), where data is arranged into sets, called bags, that are weakly labeled. Most AL methods focus on single instance learning problems. These methods are not suitable for MIL problems because they cannot account for the bag structure of data. In this paper, new methods for bag-level aggregation of instance informativeness are proposed for multiple instance active learning (MIAL). The \textit{aggregated informativeness} method identifies the most informative instances based on classifier uncertainty, and queries bags incorporating the most information. The other proposed method, called \textit{cluster-based aggregative sampling}, clusters data hierarchically in the instance space. The informativeness of instances is assessed by considering bag labels, inferred instance labels, and the proportion of labels that remain to be discovered in clusters. Both proposed methods significantly outperform reference methods in extensive experiments using benchmark data from several application domains. Results indicate that using an appropriate strategy to address MIAL problems yields a significant reduction in the number of queries needed to achieve the same level of performance as single instance AL methods

    Learning and Interpreting Multi-Multi-Instance Learning Networks

    Get PDF
    We introduce an extension of the multi-instance learning problem where examples are organized as nested bags of instances (e.g., a document could be represented as a bag of sentences, which in turn are bags of words). This framework can be useful in various scenarios, such as text and image classification, but also supervised learning over graphs. As a further advantage, multi-multi instance learning enables a particular way of interpreting predictions and the decision function. Our approach is based on a special neural network layer, called bag-layer, whose units aggregate bags of inputs of arbitrary size. We prove theoretically that the associated class of functions contains all Boolean functions over sets of sets of instances and we provide empirical evidence that functions of this kind can be actually learned on semi-synthetic datasets. We finally present experiments on text classification, on citation graphs, and social graph data, which show that our model obtains competitive results with respect to accuracy when compared to other approaches such as convolutional networks on graphs, while at the same time it supports a general approach to interpret the learnt model, as well as explain individual predictions.Comment: JML

    Spatial Patterns and Sequential Sampling Plans for Estimating Densities of Stink Bugs (Hemiptera: Pentatomidae) in Soybean in the North Central Region of the United States

    Get PDF
    Stink bugs are an emerging threat to soybean (Fabales: Fabaceae) in the North Central Region of the United States. Consequently, region-specific scouting recommendations for stink bugs are needed. The aim of this study was to characterize the spatial pattern and to develop sampling plans to estimate stink bug population density in soybean fields. In 2016 and 2017, 125 fields distributed across nine states were sampled using sweep nets. Regression analyses were used to determine the effects of stink bug species [Chinavia hilaris (Say) (Hemiptera: Pentatomidae) and Euschistus spp. (Hemiptera: Pentatomidae)], life stages (nymphs and adults), and field locations (edge and interior) on spatial pattern as represented by variance–mean relationships. Results showed that stink bugs were aggregated. Sequential sampling plans were developed for each combination of species, life stage, and location and for all the data combined. Results for required sample size showed that an average of 40–42 sample units (sets of 25 sweeps) would be necessary to achieve a precision of 0.25 for stink bug densities commonly encountered across the region. However, based on the observed geographic gradient of stink bug densities, more practical sample sizes (5–10 sample units) may be sufficient in states in the southeastern part of the region, whereas impractical sample sizes (\u3e100 sample units) may be required in the northwestern part of the region. Our findings provide research-based sampling recommendations for estimating densities of these emerging pests in soybean
    • …
    corecore