Search CORE

127,012 research outputs found

A review of clustering techniques and developments

Author: Bharill N
Ding W
Er MJ
Gupta A
Lin CT
Patel OP
Prasad M
Saxena A
Tiwari A
Publication venue: 'Elsevier BV'
Publication date: 06/12/2017
Field of study

© 2017 Elsevier B.V. This paper presents a comprehensive study on clustering: exiting methods and developments made at various times. Clustering is defined as an unsupervised learning where the objects are grouped on the basis of some similarity inherent among them. There are different methods for clustering the objects such as hierarchical, partitional, grid, density based and model based. The approaches used in these methods are discussed with their respective states of art and applicability. The measures of similarity as well as the evaluation criteria, which are the central components of clustering, are also presented in the paper. The applications of clustering in some fields like image segmentation, object and character recognition and data mining are highlighted

OPUS - University of Technology Sydney

The structure and function of complex networks

Author: Ahuja Ravindra
Andersson Håkan
Bailey Norman
Békéssy A.
Corman S. R.
Dodds P. S.
Du Dingzhu
Erdös P.
Erdös P.
Everitt Brian
Freeman L. C.
Garfield E.
Harary Frank
Kuperman M.
M. E. J. Newman
Mariolis P.
Milgram S.
Moreno Y.
Morris M.
Newman M.
Ripeanu M.
Schwartz N.
Snijders T. A. B.
Watts Duncan
Publication venue: 'Society for Industrial & Applied Mathematics (SIAM)'
Publication date: 01/01/2003
Field of study

Inspired by empirical studies of networked systems such as the Internet, social networks, and biological networks, researchers have in recent years developed a variety of techniques and models to help us understand or predict the behavior of these systems. Here we review developments in this field, including such concepts as the small-world effect, degree distributions, clustering, network correlations, random graph models, models of network growth and preferential attachment, and dynamical processes taking place on networks.Comment: Review article, 58 pages, 16 figures, 3 tables, 429 references, published in SIAM Review (2003

arXiv.org e-Print Archive

CiteSeerX

Crossref

Methods of Hierarchical Clustering

Author: Contreras Pedro
Murtagh Fionn
Publication venue
Publication date: 01/01/2011
Field of study

We survey agglomerative hierarchical clustering algorithms and discuss efficient implementations that are available in R and other software environments. We look at hierarchical self-organizing maps, and mixture models. We review grid-based clustering, focusing on hierarchical density-based approaches. Finally we describe a recently developed very efficient (linear time) hierarchical clustering algorithm, which can also be viewed as a hierarchical grid-based algorithm.Comment: 21 pages, 2 figures, 1 table, 69 reference

arXiv.org e-Print Archive

Royal Holloway Research Online

Royal Holloway - Pure

Big Data and Reliability Applications: The Complexity Dimension

Author: Hong Yili
Meeker William
Meeker William
Zhang Man
Publication venue: 'Informa UK Limited'
Publication date: 01/01/2018
Field of study

Big data features not only large volumes of data but also data with complicated structures. Complexity imposes unique challenges in big data analytics. Meeker and Hong (2014, Quality Engineering, pp. 102-116) provided an extensive discussion of the opportunities and challenges in big data and reliability, and described engineering systems that can generate big data that can be used in reliability analysis. Meeker and Hong (2014) focused on large scale system operating and environment data (i.e., high-frequency multivariate time series data), and provided examples on how to link such data as covariates to traditional reliability responses such as time to failure, time to recurrence of events, and degradation measurements. This paper intends to extend that discussion by focusing on how to use data with complicated structures to do reliability analysis. Such data types include high-dimensional sensor data, functional curve data, and image streams. We first provide a review of recent development in those directions, and then we provide a discussion on how analytical methods can be developed to tackle the challenging aspects that arise from the complexity feature of big data in reliability applications. The use of modern statistical methods such as variable selection, functional data analysis, scalar-on-image regression, spatio-temporal data models, and machine learning techniques will also be discussed.Comment: 28 pages, 7 figure

arXiv.org e-Print Archive

Digital Repository @ Iowa State University (ISU)

Crossref

The Application of Cluster Analysis Techniques in Management

Author: Calvard Tom
Publication venue
Publication date: 01/01/2012
Field of study

Edinburgh Research Explorer