Search CORE

361 research outputs found

Gravitational Clustering: A Simple, Robust and Adaptive Approach for Distributed Networks

Author: Binder Patricia
Muma Michael
Zoubir Abdelhak M.
Publication venue
Publication date: 31/08/2017
Field of study

Distributed signal processing for wireless sensor networks enables that different devices cooperate to solve different signal processing tasks. A crucial first step is to answer the question: who observes what? Recently, several distributed algorithms have been proposed, which frame the signal/object labelling problem in terms of cluster analysis after extracting source-specific features, however, the number of clusters is assumed to be known. We propose a new method called Gravitational Clustering (GC) to adaptively estimate the time-varying number of clusters based on a set of feature vectors. The key idea is to exploit the physical principle of gravitational force between mass units: streaming-in feature vectors are considered as mass units of fixed position in the feature space, around which mobile mass units are injected at each time instant. The cluster enumeration exploits the fact that the highest attraction on the mobile mass units is exerted by regions with a high density of feature vectors, i.e., gravitational clusters. By sharing estimates among neighboring nodes via a diffusion-adaptation scheme, cooperative and distributed cluster enumeration is achieved. Numerical experiments concerning robustness against outliers, convergence and computational complexity are conducted. The application in a distributed cooperative multi-view camera network illustrates the applicability to real-world problems.Comment: 12 pages, 9 figure

arXiv.org e-Print Archive

TUbiblio

Bayesian Cluster Enumeration Criterion for Unsupervised Learning

Author: Muma Michael
Teklehaymanot Freweyni K.
Zoubir Abdelhak M.
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 27/08/2018
Field of study

We derive a new Bayesian Information Criterion (BIC) by formulating the problem of estimating the number of clusters in an observed data set as maximization of the posterior probability of the candidate models. Given that some mild assumptions are satisfied, we provide a general BIC expression for a broad class of data distributions. This serves as a starting point when deriving the BIC for specific distributions. Along this line, we provide a closed-form BIC expression for multivariate Gaussian distributed variables. We show that incorporating the data structure of the clustering problem into the derivation of the BIC results in an expression whose penalty term is different from that of the original BIC. We propose a two-step cluster enumeration algorithm. First, a model-based unsupervised learning algorithm partitions the data according to a given set of candidate models. Subsequently, the number of clusters is determined as the one associated with the model for which the proposed BIC is maximal. The performance of the proposed two-step algorithm is tested using synthetic and real data sets.Comment: 14 pages, 7 figure

arXiv.org e-Print Archive

TUbiblio

EC1541 D D T for Lice

Author: Muma M. H.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/04/1946
Field of study

Extension circular 1541 discusses D D T for lice

DigitalCommons@University of Nebraska

EC1541 D D T for Lice

Author: Muma M. H.
Publication venue: DigitalCommons@University of Nebraska - Lincoln
Publication date: 01/04/1946
Field of study

Extension circular 1541 discusses D D T for lice

DigitalCommons@University of Nebraska

Foot and mouth disease in Zambia: Spatial and temporal distributions of outbreaks, assessment of clusters and implications for control

Author: Kasanga C J
Muma J B
Mweene A
Pfeiffer D U
Simuunza M
Sinkala Y
Publication venue: 'AOSIS'
Publication date: 01/04/2014
Field of study

Zambia has been experiencing low livestock productivity as well as trade restrictions owing to the occurrence of foot and mouth disease (FMD), but little is known about the epidemiology of the disease in these endemic settings. The fundamental questions relate to the spatio-temporal distribution of FMD cases and what determines their occurrence. A retrospective review of FMD cases in Zambia from 1981 to 2012 was conducted using geographical information systems and the SaTScan software package. Information was collected from peer-reviewed journal articles, conference proceedings, laboratory reports, unpublished scientific reports and grey literature. A space–time permutation probability model using a varying time window of one year was used to scan for areas with high infection rates. The spatial scan statistic detected a significant purely spatial cluster around the Mbala–Isoka area between 2009 and 2012, with secondary clusters in Sesheke–Kazungula in 2007 and 2008, the Kafue flats in 2004 and 2005 and Livingstone in 2012. This study provides evidence of the existence of statistically significant FMD clusters and an increase in occurrence in Zambia between 2004 and 2012. The identified clusters agree with areas known to be at high risk of FMD. The FMD virus transmission dynamics and the heterogeneous variability in risk within these locations may need further investigation

Crossref

Directory of Open Access Journals

Quantitative Risk Assessment of Developing Salmonellosis through Consumption of Beef in Lusaka Province, Zambia

Author: Bwanga E K
Häsler B
Manyori C I
Muma J B
Mumba C
Munyeme M
Mwale M M
Rich K M
Skjerve E
Publication venue: 'Elsevier BV'
Publication date: 01/03/2017
Field of study

Based on the Codex Alimentarious framework, this study quantitatively assessed the risk of developing salmonellosis through consumption of beef in Lusaka Province of Zambia. Data used to achieve this objective were obtained from reviews of scientific literature, Government reports, and survey results from a questionnaire that was administered to consumers to address information gaps from secondary data. The Swift Quantitative Microbiological Risk Assessment (sQMRA) model was used to analyse the data. The study was driven by a lack of empircally-based risk estimation despite a number of reported cases of salmonellosis in humans. A typology of consumers including all age groups was developed based on their beef consumption habits, distinguishing between those with low home consumption, those with medium levels of home consumption, and those with high levels through restaurant consumption. This study shows that the risk of developing salmonellosis in this population, from consuming beef, was generally low. At ID50 of 9.61 × 103 cfu/g and a retail contamination concentration of 12 cfu/g, the risk of developing salmonellosis through the consumption of beef prepared by consumers with low and medium levels of beef consumption was estimated at 0.06% and 0.08%, respectively, while the risk associated with restaurant consumption was estimated at 0.16% per year. The study concludes that the risk of developing salmonellosis among residents in Lusaka province, as a result of beef consumption, was generally low, mainly due to the methods used for food preparation. Further work is required to broaden the scope of the study and also undertake microbiological evaluation of ready-to-eat beef from both the household and restaurant risk exposure pathways

CGSpace

Robust and Efficient Aggregation for Distributed Learning

Author: Muma Michael
Schroth Christian
Vlaski Stefan
Zoubir Abdelhak M.
Publication venue
Publication date: 01/04/2022
Field of study

Distributed learning paradigms, such as federated and decentralized learning, allow for the coordination of models across a collection of agents, and without the need to exchange raw data. Instead, agents compute model updates locally based on their available data, and subsequently share the update model with a parameter server or their peers. This is followed by an aggregation step, which traditionally takes the form of a (weighted) average. Distributed learning schemes based on averaging are known to be susceptible to outliers. A single malicious agent is able to drive an averaging-based distributed learning algorithm to an arbitrarily poor model. This has motivated the development of robust aggregation schemes, which are based on variations of the median and trimmed mean. While such procedures ensure robustness to outliers and malicious behavior, they come at the cost of significantly reduced sample efficiency. This means that current robust aggregation schemes require significantly higher agent participation rates to achieve a given level of performance than their mean-based counterparts in non-contaminated settings. In this work we remedy this drawback by developing statistically efficient and robust aggregation schemes for distributed learning

arXiv.org e-Print Archive

TUbiblio

Spiral - Imperial College Digital Repository