2,052 research outputs found
An Active Instance-based Machine Learning method for Stellar Population Studies
We have developed a method for fast and accurate stellar population
parameters determination in order to apply it to high resolution galaxy
spectra. The method is based on an optimization technique that combines active
learning with an instance-based machine learning algorithm. We tested the
method with the retrieval of the star-formation history and dust content in
"synthetic" galaxies with a wide range of S/N ratios. The "synthetic" galaxies
where constructed using two different grids of high resolution theoretical
population synthesis models. The results of our controlled experiment shows
that our method can estimate with good speed and accuracy the parameters of the
stellar populations that make up the galaxy even for very low S/N input. For a
spectrum with S/N=5 the typical average deviation between the input and fitted
spectrum is less than 10**{-5}. Additional improvements are achieved using
prior knowledge.Comment: 14 pages, 25 figures, accepted by Monthly Notice
Discovery of Bright Galactic R Coronae Borealis and DY Persei Variables: Rare Gems Mined from ACVS
We present the results of a machine-learning (ML) based search for new R
Coronae Borealis (RCB) stars and DY Persei-like stars (DYPers) in the Galaxy
using cataloged light curves from the All-Sky Automated Survey (ASAS) Catalog
of Variable Stars (ACVS). RCB stars - a rare class of hydrogen-deficient
carbon-rich supergiants - are of great interest owing to the insights they can
provide on the late stages of stellar evolution. DYPers are possibly the
low-temperature, low-luminosity analogs to the RCB phenomenon, though
additional examples are needed to fully establish this connection. While RCB
stars and DYPers are traditionally identified by epochs of extreme dimming that
occur without regularity, the ML search framework more fully captures the
richness and diversity of their photometric behavior. We demonstrate that our
ML method can use newly discovered RCB stars to identify additional candidates
within the same data set. Our search yields 15 candidates that we consider
likely RCB stars/DYPers: new spectroscopic observations confirm that four of
these candidates are RCB stars and four are DYPers. Our discovery of four new
DYPers increases the number of known Galactic DYPers from two to six;
noteworthy is that one of the new DYPers has a measured parallax and is m ~ 7
mag, making it the brightest known DYPer to date. Future observations of these
new DYPers should prove instrumental in establishing the RCB connection. We
consider these results, derived from a machine-learned probabilistic
classification catalog, as an important proof-of-concept for the efficient
discovery of rare sources with time-domain surveys.Comment: 18 pages, 2 new figures, accepted for publication in Ap
Multivariate Approaches to Classification in Extragalactic Astronomy
Clustering objects into synthetic groups is a natural activity of any
science. Astrophysics is not an exception and is now facing a deluge of data.
For galaxies, the one-century old Hubble classification and the Hubble tuning
fork are still largely in use, together with numerous mono-or bivariate
classifications most often made by eye. However, a classification must be
driven by the data, and sophisticated multivariate statistical tools are used
more and more often. In this paper we review these different approaches in
order to situate them in the general context of unsupervised and supervised
learning. We insist on the astrophysical outcomes of these studies to show that
multivariate analyses provide an obvious path toward a renewal of our
classification of galaxies and are invaluable tools to investigate the physics
and evolution of galaxies.Comment: Open Access paper.
http://www.frontiersin.org/milky\_way\_and\_galaxies/10.3389/fspas.2015.00003/abstract\>.
\<10.3389/fspas.2015.00003 \&g
Sparsity and adaptivity for the blind separation of partially correlated sources
Blind source separation (BSS) is a very popular technique to analyze
multichannel data. In this context, the data are modeled as the linear
combination of sources to be retrieved. For that purpose, standard BSS methods
all rely on some discrimination principle, whether it is statistical
independence or morphological diversity, to distinguish between the sources.
However, dealing with real-world data reveals that such assumptions are rarely
valid in practice: the signals of interest are more likely partially
correlated, which generally hampers the performances of standard BSS methods.
In this article, we introduce a novel sparsity-enforcing BSS method coined
Adaptive Morphological Component Analysis (AMCA), which is designed to retrieve
sparse and partially correlated sources. More precisely, it makes profit of an
adaptive re-weighting scheme to favor/penalize samples based on their level of
correlation. Extensive numerical experiments have been carried out which show
that the proposed method is robust to the partial correlation of sources while
standard BSS techniques fail. The AMCA algorithm is evaluated in the field of
astrophysics for the separation of physical components from microwave data.Comment: submitted to IEEE Transactions on signal processin
ASPECT: A spectra clustering tool for exploration of large spectral surveys
We present the novel, semi-automated clustering tool ASPECT for analysing
voluminous archives of spectra. The heart of the program is a neural network in
form of Kohonen's self-organizing map. The resulting map is designed as an icon
map suitable for the inspection by eye. The visual analysis is supported by the
option to blend in individual object properties such as redshift, apparent
magnitude, or signal-to-noise ratio. In addition, the package provides several
tools for the selection of special spectral types, e.g. local difference maps
which reflect the deviations of all spectra from one given input spectrum (real
or artificial). ASPECT is able to produce a two-dimensional topological map of
a huge number of spectra. The software package enables the user to browse and
navigate through a huge data pool and helps him to gain an insight into
underlying relationships between the spectra and other physical properties and
to get the big picture of the entire data set. We demonstrate the capability of
ASPECT by clustering the entire data pool of 0.6 million spectra from the Data
Release 4 of the Sloan Digital Sky Survey (SDSS). To illustrate the results
regarding quality and completeness we track objects from existing catalogues of
quasars and carbon stars, respectively, and connect the SDSS spectra with
morphological information from the GalaxyZoo project.Comment: 15 pages, 14 figures; accepted for publication in Astronomy and
Astrophysic
Data Mining and Machine Learning in Astronomy
We review the current state of data mining and machine learning in astronomy.
'Data Mining' can have a somewhat mixed connotation from the point of view of a
researcher in this field. If used correctly, it can be a powerful approach,
holding the potential to fully exploit the exponentially increasing amount of
available data, promising great scientific advance. However, if misused, it can
be little more than the black-box application of complex computing algorithms
that may give little physical insight, and provide questionable results. Here,
we give an overview of the entire data mining process, from data collection
through to the interpretation of results. We cover common machine learning
algorithms, such as artificial neural networks and support vector machines,
applications from a broad range of astronomy, emphasizing those where data
mining techniques directly resulted in improved science, and important current
and future directions, including probability density functions, parallel
algorithms, petascale computing, and the time domain. We conclude that, so long
as one carefully selects an appropriate algorithm, and is guided by the
astronomical problem at hand, data mining can be very much the powerful tool,
and not the questionable black box.Comment: Published in IJMPD. 61 pages, uses ws-ijmpd.cls. Several extra
figures, some minor additions to the tex
- …