361 research outputs found
Gravitational Clustering: A Simple, Robust and Adaptive Approach for Distributed Networks
Distributed signal processing for wireless sensor networks enables that
different devices cooperate to solve different signal processing tasks. A
crucial first step is to answer the question: who observes what? Recently,
several distributed algorithms have been proposed, which frame the
signal/object labelling problem in terms of cluster analysis after extracting
source-specific features, however, the number of clusters is assumed to be
known. We propose a new method called Gravitational Clustering (GC) to
adaptively estimate the time-varying number of clusters based on a set of
feature vectors. The key idea is to exploit the physical principle of
gravitational force between mass units: streaming-in feature vectors are
considered as mass units of fixed position in the feature space, around which
mobile mass units are injected at each time instant. The cluster enumeration
exploits the fact that the highest attraction on the mobile mass units is
exerted by regions with a high density of feature vectors, i.e., gravitational
clusters. By sharing estimates among neighboring nodes via a
diffusion-adaptation scheme, cooperative and distributed cluster enumeration is
achieved. Numerical experiments concerning robustness against outliers,
convergence and computational complexity are conducted. The application in a
distributed cooperative multi-view camera network illustrates the applicability
to real-world problems.Comment: 12 pages, 9 figure
Bayesian Cluster Enumeration Criterion for Unsupervised Learning
We derive a new Bayesian Information Criterion (BIC) by formulating the
problem of estimating the number of clusters in an observed data set as
maximization of the posterior probability of the candidate models. Given that
some mild assumptions are satisfied, we provide a general BIC expression for a
broad class of data distributions. This serves as a starting point when
deriving the BIC for specific distributions. Along this line, we provide a
closed-form BIC expression for multivariate Gaussian distributed variables. We
show that incorporating the data structure of the clustering problem into the
derivation of the BIC results in an expression whose penalty term is different
from that of the original BIC. We propose a two-step cluster enumeration
algorithm. First, a model-based unsupervised learning algorithm partitions the
data according to a given set of candidate models. Subsequently, the number of
clusters is determined as the one associated with the model for which the
proposed BIC is maximal. The performance of the proposed two-step algorithm is
tested using synthetic and real data sets.Comment: 14 pages, 7 figure
EC1541 D D T for Lice
Extension circular 1541 discusses D D T for lice
EC1541 D D T for Lice
Extension circular 1541 discusses D D T for lice
Foot and mouth disease in Zambia: Spatial and temporal distributions of outbreaks, assessment of clusters and implications for control
Zambia has been experiencing low livestock productivity as well as trade restrictions owing to the occurrence of foot and mouth disease (FMD), but little is known about the epidemiology of the disease in these endemic settings. The fundamental questions relate to the spatio-temporal distribution of FMD cases and what determines their occurrence. A retrospective review of FMD cases in Zambia from 1981 to 2012 was conducted using geographical information systems and the SaTScan software package. Information was collected from peer-reviewed journal articles, conference proceedings, laboratory reports, unpublished scientific reports and grey literature. A space–time permutation probability model using a varying time window of one year was used to scan for areas with high infection rates. The spatial scan statistic detected a significant purely spatial cluster around the Mbala–Isoka area between 2009 and 2012, with secondary clusters in Sesheke–Kazungula in 2007 and 2008, the Kafue flats in 2004 and 2005 and Livingstone in 2012. This study provides evidence of the existence of statistically significant FMD clusters and an increase in occurrence in Zambia between 2004 and 2012. The identified clusters agree with areas known to be at high risk of FMD. The FMD virus transmission dynamics and the heterogeneous variability in risk within these locations may need further investigation
Quantitative Risk Assessment of Developing Salmonellosis through Consumption of Beef in Lusaka Province, Zambia
Based on the Codex Alimentarious framework, this study quantitatively assessed the risk of developing salmonellosis through consumption of beef in Lusaka Province of Zambia. Data used to achieve this objective were obtained from reviews of scientific literature, Government reports, and survey results from a questionnaire that was administered to consumers to address information gaps from secondary data. The Swift Quantitative Microbiological Risk Assessment (sQMRA) model was used to analyse the data. The study was driven by a lack of empircally-based risk estimation despite a number of reported cases of salmonellosis in humans.
A typology of consumers including all age groups was developed based on their beef consumption habits, distinguishing between those with low home consumption, those with medium levels of home consumption, and those with high levels through restaurant consumption. This study shows that the risk of developing salmonellosis in this population, from consuming beef, was generally low. At ID50 of 9.61 Ă— 103 cfu/g and a retail contamination concentration of 12 cfu/g, the risk of developing salmonellosis through the consumption of beef prepared by consumers with low and medium levels of beef consumption was estimated at 0.06% and 0.08%, respectively, while the risk associated with restaurant consumption was estimated at 0.16% per year.
The study concludes that the risk of developing salmonellosis among residents in Lusaka province, as a result of beef consumption, was generally low, mainly due to the methods used for food preparation. Further work is required to broaden the scope of the study and also undertake microbiological evaluation of ready-to-eat beef from both the household and restaurant risk exposure pathways
Robust and Efficient Aggregation for Distributed Learning
Distributed learning paradigms, such as federated and decentralized learning,
allow for the coordination of models across a collection of agents, and without
the need to exchange raw data. Instead, agents compute model updates locally
based on their available data, and subsequently share the update model with a
parameter server or their peers. This is followed by an aggregation step, which
traditionally takes the form of a (weighted) average. Distributed learning
schemes based on averaging are known to be susceptible to outliers. A single
malicious agent is able to drive an averaging-based distributed learning
algorithm to an arbitrarily poor model. This has motivated the development of
robust aggregation schemes, which are based on variations of the median and
trimmed mean. While such procedures ensure robustness to outliers and malicious
behavior, they come at the cost of significantly reduced sample efficiency.
This means that current robust aggregation schemes require significantly higher
agent participation rates to achieve a given level of performance than their
mean-based counterparts in non-contaminated settings. In this work we remedy
this drawback by developing statistically efficient and robust aggregation
schemes for distributed learning
- …