20,885 research outputs found
Soft clustering analysis of galaxy morphologies: A worked example with SDSS
Context: The huge and still rapidly growing amount of galaxies in modern sky
surveys raises the need of an automated and objective classification method.
Unsupervised learning algorithms are of particular interest, since they
discover classes automatically. Aims: We briefly discuss the pitfalls of
oversimplified classification methods and outline an alternative approach
called "clustering analysis". Methods: We categorise different classification
methods according to their capabilities. Based on this categorisation, we
present a probabilistic classification algorithm that automatically detects the
optimal classes preferred by the data. We explore the reliability of this
algorithm in systematic tests. Using a small sample of bright galaxies from the
SDSS, we demonstrate the performance of this algorithm in practice. We are able
to disentangle the problems of classification and parametrisation of galaxy
morphologies in this case. Results: We give physical arguments that a
probabilistic classification scheme is necessary. The algorithm we present
produces reasonable morphological classes and object-to-class assignments
without any prior assumptions. Conclusions: There are sophisticated automated
classification algorithms that meet all necessary requirements, but a lot of
work is still needed on the interpretation of the results.Comment: 18 pages, 19 figures, 2 tables, submitted to A
Assessing the Number of Components in Mixture Models: a Review.
Despite the widespread application of finite mixture models, the decision of how many classes are required to adequately represent the data is, according to many authors, an important, but unsolved issue. This work aims to review, describe and organize the available approaches designed to help the selection of the adequate number of mixture components (including Monte Carlo test procedures, information criteria and classification-based criteria); we also provide some published simulation results about their relative performance, with the purpose of identifying the scenarios where each criterion is more effective (adequate).Finite mixture; number of mixture components; information criteria; simulation studies.
ASPECT: A spectra clustering tool for exploration of large spectral surveys
We present the novel, semi-automated clustering tool ASPECT for analysing
voluminous archives of spectra. The heart of the program is a neural network in
form of Kohonen's self-organizing map. The resulting map is designed as an icon
map suitable for the inspection by eye. The visual analysis is supported by the
option to blend in individual object properties such as redshift, apparent
magnitude, or signal-to-noise ratio. In addition, the package provides several
tools for the selection of special spectral types, e.g. local difference maps
which reflect the deviations of all spectra from one given input spectrum (real
or artificial). ASPECT is able to produce a two-dimensional topological map of
a huge number of spectra. The software package enables the user to browse and
navigate through a huge data pool and helps him to gain an insight into
underlying relationships between the spectra and other physical properties and
to get the big picture of the entire data set. We demonstrate the capability of
ASPECT by clustering the entire data pool of 0.6 million spectra from the Data
Release 4 of the Sloan Digital Sky Survey (SDSS). To illustrate the results
regarding quality and completeness we track objects from existing catalogues of
quasars and carbon stars, respectively, and connect the SDSS spectra with
morphological information from the GalaxyZoo project.Comment: 15 pages, 14 figures; accepted for publication in Astronomy and
Astrophysic
- âŠ