Search CORE

859 research outputs found

Outlier Detection from Network Data with Subnetwork Interpretation

Author: Basu Prithwish
Dang Xuan-Hong
Silva Arlei
Singh Ambuj
Swami Ananthram
Publication venue
Publication date: 30/09/2016
Field of study

Detecting a small number of outliers from a set of data observations is always challenging. This problem is more difficult in the setting of multiple network samples, where computing the anomalous degree of a network sample is generally not sufficient. In fact, explaining why the network is exceptional, expressed in the form of subnetwork, is also equally important. In this paper, we develop a novel algorithm to address these two key problems. We treat each network sample as a potential outlier and identify subnetworks that mostly discriminate it from nearby regular samples. The algorithm is developed in the framework of network regression combined with the constraints on both network topology and L1-norm shrinkage to perform subnetwork discovery. Our method thus goes beyond subspace/subgraph discovery and we show that it converges to a global optimum. Evaluation on various real-world network datasets demonstrates that our algorithm not only outperforms baselines in both network and high dimensional setting, but also discovers highly relevant and interpretable local subnetworks, further enhancing our understanding of anomalous networks

arXiv.org e-Print Archive

Crossref

Distributed static linear Gaussian models using consensus

Author: Bajovic
Baldi
Bertsekas
De la Torre
Diamantaras
Forero
Forero
Garin
Gastpar
Hotelling
Jolliffe
Kar
Kargupta
Le Borgne
Olfati-Saber
Olshevsky
Pavle Belanovic
Pearson
Qi
Reyment
Roweis
Roweis
Santiago Zazo
Schizas
Schizas
Sergio Valcarcel Macua
Tipping
Tsitsiklis
van de Geijn
Xiao
Publication venue: 'Elsevier BV'
Publication date: 01/01/2012
Field of study

Algorithms for distributed agreement are a powerful means for formulating distributed versions of existing centralized algorithms. We present a toolkit for this task and show how it can be used systematically to design fully distributed algorithms for static linear Gaussian models, including principal component analysis, factor analysis, and probabilistic principal component analysis. These algorithms do not rely on a fusion center, require only low-volume local (1-hop neighborhood) communications, and are thus efficient, scalable, and robust. We show how they are also guaranteed to asymptotically converge to the same solution as the corresponding existing centralized algorithms. Finally, we illustrate the functioning of our algorithms on two examples, and examine the inherent cost-performance tradeoff

Infoscience - École polytechnique fédérale de Lausanne

Crossref

LAReferencia - Red Federada de Repositorios Institucionales de Publicaciones Científicas Latinoamericanas

Archivo Digital UPM

Face Recognition using R-KDA with Non-Linear SVM for Multi-View Database

Author: Devi Hidangmayum Saxena
Laishram Romesh
Thounaojam Dalton Meitei
Publication venue: The Authors. Published by Elsevier B.V.
Publication date: 31/12/2015
Field of study

AbstractThis paper develops a new Face Recognition System which combines R-KDA for selecting optimal discriminant features with non-linear SVM for Recognition. Experiment results have been conducted showing the comparison of enhanced efficiency of our proposed system over R-KDA with k-nn as the similarity distance measure

Elsevier - Publisher Connector

Online Deep Learning from Doubly-Streaming Data

Author: Atwood John S.
He Yi
Hou Bo-Jian
Lian Heng
Wu Jian
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2022
Field of study

This paper investigates a new online learning problem with doubly-streaming data, where the data streams are described by feature spaces that constantly evolve, with new features emerging and old features fading away. A plausible idea to deal with such data streams is to establish a relationship between the old and new feature spaces, so that an online learner can leverage the knowledge learned from the old features to better the learning performance on the new features. Unfortunately, this idea does not scale up to high-dimensional multimedia data with complex feature interplay, which suffers a tradeoff between onlineness, which biases shallow learners, and expressiveness, which requires deep models. Motivated by this, we propose a novel OLD3S paradigm, where a shared latent subspace is discovered to summarize information from the old and new feature spaces, building an intermediate feature mapping relationship. A key trait of OLD3S is to treat the model capacity as a learnable semantics, aiming to yield optimal model depth and parameters jointly in accordance with the complexity and non-linearity of the input data streams in an online fashion. Both theoretical analysis and empirical studies substantiate the viability and effectiveness of our proposed approach. The code is available online at https://github.com/X1aoLian/OLD3S

arXiv.org e-Print Archive

Old Dominion University