Search CORE

328 research outputs found

Learning and Interpreting Multi-Multi-Instance Learning Networks

Author: Frasconi Paolo
Jaeger Manfred
Tibo Alessandro
Publication venue
Publication date: 01/10/2020
Field of study

We introduce an extension of the multi-instance learning problem where examples are organized as nested bags of instances (e.g., a document could be represented as a bag of sentences, which in turn are bags of words). This framework can be useful in various scenarios, such as text and image classification, but also supervised learning over graphs. As a further advantage, multi-multi instance learning enables a particular way of interpreting predictions and the decision function. Our approach is based on a special neural network layer, called bag-layer, whose units aggregate bags of inputs of arbitrary size. We prove theoretically that the associated class of functions contains all Boolean functions over sets of sets of instances and we provide empirical evidence that functions of this kind can be actually learned on semi-synthetic datasets. We finally present experiments on text classification, on citation graphs, and social graph data, which show that our model obtains competitive results with respect to accuracy when compared to other approaches such as convolutional networks on graphs, while at the same time it supports a general approach to interpret the learnt model, as well as explain individual predictions.Comment: JML

arXiv.org e-Print Archive

VBN

Shift Aggregate Extract Networks

Author: Baracchi Daniele
Frasconi Paolo
Orsini Francesco
Publication venue
Publication date: 16/03/2017
Field of study

We introduce an architecture based on deep hierarchical decompositions to learn effective representations of large graphs. Our framework extends classic R-decompositions used in kernel methods, enabling nested "part-of-part" relations. Unlike recursive neural networks, which unroll a template on input graphs directly, we unroll a neural network template over the decomposition hierarchy, allowing us to deal with the high degree variability that typically characterize social network graphs. Deep hierarchical decompositions are also amenable to domain compression, a technique that reduces both space and time complexity by exploiting symmetries. We show empirically that our approach is competitive with current state-of-the-art graph classification methods, particularly when dealing with social network datasets

arXiv.org e-Print Archive

Florence Research

Directory of Open Access Journals

Frontiers - Publisher Connector

Learning and Interpreting Multi-Multi-Instance Learning Networks

Author: Frasconi Paolo
Jaeger Manfred
Tibo Alessandro
Publication venue
Publication date: 01/10/2020
Field of study

VBN

Learning and Interpreting Multi-Multi-Instance Learning Networks

Author: Alessandro Tibo
Frasconi Paolo
Manfred Jaeger
Publication venue
Publication date: 01/01/2020
Field of study

Florence Research

Classification of cancer pathology reports: a large-scale comparative study

Author: Frasconi Paolo
Martina Stefano
Ventura Leonardo
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2020
Field of study

We report about the application of state-of-the-art deep learning techniques to the automatic and interpretable assignment of ICD-O3 topography and morphology codes to free-text cancer reports. We present results on a large dataset (more than 80 000 labeled and 1 500 000 unlabeled anonymized reports written in Italian and collected from hospitals in Tuscany over more than a decade) and with a large number of classes (134 morphological classes and 61 topographical classes). We compare alternative architectures in terms of prediction accuracy and interpretability and show that our best model achieves a multiclass accuracy of 90.3% on topography site assignment and 84.8% on morphology type assignment. We found that in this context hierarchical models are not better than flat models and that an element-wise maximum aggregator is slightly better than attentive models on site classification. Moreover, the maximum aggregator offers a way to interpret the classification process.Comment: 10 pages, 6 figures, 3 tables, accepted for publication in IEEE Journal of Biomedical and Health Informatics (J-BHI

arXiv.org e-Print Archive

Florence Research

kProbLog: an algebraic Prolog for machine learning

Author: de Raedt Luc
Frasconi Paolo
Orsini Francesco
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Florence Research