Search CORE

10,607 research outputs found

Multiple instance learning for sequence data with across bag dependencies

Author: Aridhi Sabeur
Maddouri Mondher
Nguifo Engelbert Mephu
Zoghlami Manel
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2019
Field of study

In Multiple Instance Learning (MIL) problem for sequence data, the instances inside the bags are sequences. In some real world applications such as bioinformatics, comparing a random couple of sequences makes no sense. In fact, each instance may have structural and/or functional relations with instances of other bags. Thus, the classification task should take into account this across bag relation. In this work, we present two novel MIL approaches for sequence data classification named ABClass and ABSim. ABClass extracts motifs from related instances and use them to encode sequences. A discriminative classifier is then applied to compute a partial classification result for each set of related sequences. ABSim uses a similarity measure to discriminate the related instances and to compute a scores matrix. For both approaches, an aggregation method is applied in order to generate the final classification result. We applied both approaches to solve the problem of bacterial Ionizing Radiation Resistance prediction. The experimental results of the presented approaches are satisfactory

arXiv.org e-Print Archive

HAL Clermont Université

INRIA a CCSD electronic archive server

Recommended from our members

Simulating intertwined design processes that have similar structures: A case study of a small company that creates made-to-order fashion products

Author: Clarkson PJ
Eckert CM
Wynn DC
Publication venue: International Journal of Product Development
Publication date: 01/01/2011
Field of study

The authors use simulation to analyse the resource-driven dependencies between concurrent processes used to create customised products in a company. Such processes are uncertain and unique according to the design changes required. However, they have similar structures. For simulation, a level of abstraction is chosen such that all possible processes are represented by the same activity network. Differences between processes are determined by the customisations that they implement. The approach is illustrated through application to a small business that creates customised fashion products. We suggest that similar techniques could be applied to study intertwined design processes in more complex domains.The case study was carried out as part of Considerate Design for Personalised Fashion funded by the EPSRC/AHRC Design in the 21st century programme. The context of a multi-project environment was analysed as part of the EU Framework 7 CONVERGE project CP-FP 228746-2.Post-prin

Open Research Online (The Open University)

Apollo (Cambridge)

On Classification with Bags, Groups and Sets

Author: Cheplygina Veronika
Loog Marco
Tax David M. J.
Publication venue: 'Elsevier BV'
Publication date: 07/10/2014
Field of study

Many classification problems can be difficult to formulate directly in terms of the traditional supervised setting, where both training and test samples are individual feature vectors. There are cases in which samples are better described by sets of feature vectors, that labels are only available for sets rather than individual samples, or, if individual labels are available, that these are not independent. To better deal with such problems, several extensions of supervised learning have been proposed, where either training and/or test objects are sets of feature vectors. However, having been proposed rather independently of each other, their mutual similarities and differences have hitherto not been mapped out. In this work, we provide an overview of such learning scenarios, propose a taxonomy to illustrate the relationships between them, and discuss directions for further research in these areas

arXiv.org e-Print Archive

CiteSeerX

Copenhagen University Research Information System

COTA: Improving the Speed and Accuracy of Customer Support through Ranking and Deep Networks

Author: Bahdanau Dzmitry
Diederik
Hakkani-Tür Dilek
Ioffe Sergey
Liang Chen
McCulloh Ian
Rocktäschel Tim
Sarikaya R.
Sutskever Ilya
van der Maaten Laurens
Zhang Xiang
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 03/07/2018
Field of study

For a company looking to provide delightful user experiences, it is of paramount importance to take care of any customer issues. This paper proposes COTA, a system to improve speed and reliability of customer support for end users through automated ticket classification and answers selection for support representatives. Two machine learning and natural language processing techniques are demonstrated: one relying on feature engineering (COTA v1) and the other exploiting raw signals through deep learning architectures (COTA v2). COTA v1 employs a new approach that converts the multi-classification task into a ranking problem, demonstrating significantly better performance in the case of thousands of classes. For COTA v2, we propose an Encoder-Combiner-Decoder, a novel deep learning architecture that allows for heterogeneous input and output feature types and injection of prior knowledge through network architecture choices. This paper compares these models and their variants on the task of ticket classification and answer selection, showing model COTA v2 outperforms COTA v1, and analyzes their inner workings and shortcomings. Finally, an A/B test is conducted in a production setting validating the real-world impact of COTA in reducing issue resolution time by 10 percent without reducing customer satisfaction

arXiv.org e-Print Archive

Crossref

Unsupervised Learning of Long-Term Motion Dynamics for Videos

Author: Alahi Alexandre
Fei-Fei Li
Huang De-An
Luo Zelun
Peng Boya
Publication venue
Publication date: 11/04/2017
Field of study

We present an unsupervised representation learning approach that compactly encodes the motion dependencies in videos. Given a pair of images from a video clip, our framework learns to predict the long-term 3D motions. To reduce the complexity of the learning framework, we propose to describe the motion as a sequence of atomic 3D flows computed with RGB-D modality. We use a Recurrent Neural Network based Encoder-Decoder framework to predict these sequences of flows. We argue that in order for the decoder to reconstruct these sequences, the encoder must learn a robust video representation that captures long-term motion dependencies and spatial-temporal relations. We demonstrate the effectiveness of our learned temporal representations on activity classification across multiple modalities and datasets such as NTU RGB+D and MSR Daily Activity 3D. Our framework is generic to any input modality, i.e., RGB, Depth, and RGB-D videos.Comment: CVPR 201

arXiv.org e-Print Archive

Infoscience - École polytechnique fédérale de Lausanne