Search CORE

79 research outputs found

Learning from Categorical Attribute Relationships for Positive-Unlabeled Classification

Author: Ienco Dino
Pensa Ruggero Gaetano
Publication venue: Beijing University of Posts and Telecommunications
Publication date: 01/01/2014
Field of study

Institutional Research Information System University of Turin

Rumor Spreading in Social Networks with Individual Privacy Policies

Author: Bioglio Livio
Pensa Ruggero Gaetano
Publication venue
Publication date: 01/01/2016
Field of study

Institutional Research Information System University of Turin

Co-clustering Numerical Data under User-defined Constraints

Author: Cordero Francesca
Pensa Ruggero Gaetano
Publication venue
Publication date: 01/01/2010
Field of study

Institutional Research Information System University of Turin

Impact of Neighbors on the Privacy of Individuals in Online Social Networks

Author: Bioglio Livio
Pensa Ruggero Gaetano
Publication venue: 'Elsevier BV'
Publication date: 01/01/2017
Field of study

Institutional Research Information System University of Turin

Geographic Summaries from Crowdsourced Data

Author: Meo Rosa
Pensa Ruggero Gaetano
Publication venue: Springer International Publishing
Publication date: 01/01/2014
Field of study

Institutional Research Information System University of Turin

Concept-Enhanced Multi-view Co-clustering of Document Data

Author: Pensa Ruggero Gaetano
Rho Valentina
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2017
Field of study

Crossref

Institutional Research Information System University of Turin

Constrained Co-clustering of Gene Expression Data

Author: Boulicaut J. F.
Pensa Ruggero Gaetano
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2008
Field of study

Institutional Research Information System University of Turin

Leveraging additional knowledge to support coherent bicluster discovery in gene expression data

Author: Cordero Francesca
Pensa Ruggero Gaetano
Visconti Alessia
Publication venue: 'IOS Press'
Publication date: 01/01/2014
Field of study

Institutional Research Information System University of Turin

Social Network Analysis as Knowledge Discovery process: a case study on Digital Bibliography

Author: Coscia M.
Giannotti F.
Pensa Ruggero Gaetano
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2009
Field of study

Institutional Research Information System University of Turin

Hierarchical Co-Clustering: Off-line and Incremental Approaches

Author: Ienco Dino
Meo Rosa
Pensa Ruggero Gaetano
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2014
Field of study

International audienceClustering data is challenging especially for two reasons. The dimensionality of the data is often very high which makes the cluster interpretation hard. Moreover, with high-dimensional data the classic metrics fail in identifying the real similarities between objects. The second challenge is the evolving nature of the observed phenomena which makes the datasets accumulating over time. In this paper we show how we propose to solve these problems. To tackle the high-dimensionality problem, we propose to apply a co-clustering approach on the dataset that stores the occurrence of features in the observed objects. Co-clustering computes a partition of objects and a partition of features simultaneously. The novelty of our co-clustering solution is that it arranges the clusters in a hierarchical fashion, and it consists of two hierarchies: one on the objects and one on the features. The two hierarchies are coupled because the clusters at a certain level in one hierarchy are coupled with the clusters at the same level of the other hierarchy and form the co-clusters. Each cluster of one of the two hierarchies thus provides insights on the clusters of the other hierarchy. Another novelty of the proposed solution is that the number of clusters is possibly unlimited. Nevertheless, the produced hierarchies are still compact and therefore more readable because our method allows multiple splits of a cluster at the lower level. As regards the second challenge, the accumulating nature of the data makes the datasets intractably huge over time. In this case, an incremental solution relieves the issue because it partitions the problem. In this paper we introduce an incremental version of our algorithm of hierarchical co-clustering. It starts from an intermediate solution computed on the previous version of the data and it updates the co-clustering results considering only the added block of data. This solution has the merit of speeding up the computation with respect to the original approach that would recompute the result on the overall dataset. In addition, the incremental algorithm guarantees approximately the same answer than the original version, but it saves much computational load. We validate the incremental approach on several high-dimensional datasets and perform an accurate comparison with both the original version of our algorithm and with the state of the art competitors as well. The obtained results open the way to a novel usage of the co-clustering algorithms in which it is advantageous to partition the data into several blocks and process them incrementally thus "incorporating" data gradually into an on-going co-clustering solutio

Institutional Research Information System University of Turin