209 research outputs found
Connecting the time domain community with the Virtual Astronomical Observatory
The time domain has been identified as one of the most important areas of
astronomical research for the next decade. The Virtual Observatory is in the
vanguard with dedicated tools and services that enable and facilitate the
discovery, dissemination and analysis of time domain data. These range in scope
from rapid notifications of time-critical astronomical transients to annotating
long-term variables with the latest modeling results. In this paper, we will
review the prior art in these areas and focus on the capabilities that the VAO
is bringing to bear in support of time domain science. In particular, we will
focus on the issues involved with the heterogeneous collections of (ancillary)
data associated with astronomical transients, and the time series
characterization and classification tools required by the next generation of
sky surveys, such as LSST and SKA.Comment: Submitted to Proceedings of SPIE Observatory Operations: Strategies,
Processes and Systems IV, Amsterdam, 2012 July 2-
A recurrent neural network for classification of unevenly sampled variable stars
Astronomical surveys of celestial sources produce streams of noisy time
series measuring flux versus time ("light curves"). Unlike in many other
physical domains, however, large (and source-specific) temporal gaps in data
arise naturally due to intranight cadence choices as well as diurnal and
seasonal constraints. With nightly observations of millions of variable stars
and transients from upcoming surveys, efficient and accurate discovery and
classification techniques on noisy, irregularly sampled data must be employed
with minimal human-in-the-loop involvement. Machine learning for inference
tasks on such data traditionally requires the laborious hand-coding of
domain-specific numerical summaries of raw data ("features"). Here we present a
novel unsupervised autoencoding recurrent neural network (RNN) that makes
explicit use of sampling times and known heteroskedastic noise properties. When
trained on optical variable star catalogs, this network produces supervised
classification models that rival other best-in-class approaches. We find that
autoencoded features learned on one time-domain survey perform nearly as well
when applied to another survey. These networks can continue to learn from new
unlabeled observations and may be used in other unsupervised tasks such as
forecasting and anomaly detection.Comment: 23 pages, 14 figures. The published version is at Nature Astronomy
(https://www.nature.com/articles/s41550-017-0321-z). Source code for models,
experiments, and figures at
https://github.com/bnaul/IrregularTimeSeriesAutoencoderPaper (Zenodo Code
DOI: 10.5281/zenodo.1045560
From Data to Software to Science with the Rubin Observatory LSST
The Vera C. Rubin Observatory Legacy Survey of Space and Time (LSST) dataset
will dramatically alter our understanding of the Universe, from the origins of
the Solar System to the nature of dark matter and dark energy. Much of this
research will depend on the existence of robust, tested, and scalable
algorithms, software, and services. Identifying and developing such tools ahead
of time has the potential to significantly accelerate the delivery of early
science from LSST. Developing these collaboratively, and making them broadly
available, can enable more inclusive and equitable collaboration on LSST
science.
To facilitate such opportunities, a community workshop entitled "From Data to
Software to Science with the Rubin Observatory LSST" was organized by the LSST
Interdisciplinary Network for Collaboration and Computing (LINCC) and partners,
and held at the Flatiron Institute in New York, March 28-30th 2022. The
workshop included over 50 in-person attendees invited from over 300
applications. It identified seven key software areas of need: (i) scalable
cross-matching and distributed joining of catalogs, (ii) robust photometric
redshift determination, (iii) software for determination of selection
functions, (iv) frameworks for scalable time-series analyses, (v) services for
image access and reprocessing at scale, (vi) object image access (cutouts) and
analysis at scale, and (vii) scalable job execution systems.
This white paper summarizes the discussions of this workshop. It considers
the motivating science use cases, identified cross-cutting algorithms,
software, and services, their high-level technical specifications, and the
principles of inclusive collaborations needed to develop them. We provide it as
a useful roadmap of needs, as well as to spur action and collaboration between
groups and individuals looking to develop reusable software for early LSST
science.Comment: White paper from "From Data to Software to Science with the Rubin
Observatory LSST" worksho
Exploring the Variable Sky with LINEAR. III. Classification of Periodic Light Curves
We describe the construction of a highly reliable sample of ~7000 optically faint periodic variable stars with light curves obtained by the asteroid survey LINEAR across 10,000 deg^2 of the northern sky. The majority of these variables have not been cataloged yet. The sample flux limit is several magnitudes fainter than most other wide-angle surveys; the photometric errors range from ~0.03 mag at r = 15 to ~0.20 mag at r = 18. Light curves include on average 250 data points, collected over about a decade. Using Sloan Digital Sky Survey (SDSS) based photometric recalibration of the LINEAR data for about 25 million objects, we selected ~200,000 most probable candidate variables with r < 17 and visually confirmed and classified ~7000 periodic variables using phased light curves. The reliability and uniformity of visual classification across eight human classifiers was calibrated and tested using a catalog of variable stars from the SDSS Stripe 82 region and verified using an unsupervised machine learning approach. The resulting sample of periodic LINEAR variables is dominated by 3900 RR Lyrae stars and 2700 eclipsing binary stars of all subtypes and includes small fractions of relatively rare populations such as asymptotic giant branch stars and SX Phoenicis stars. We discuss the distribution of these mostly uncataloged variables in various diagrams constructed with optical-to-infrared SDSS, Two Micron All Sky Survey, and Wide-field Infrared Survey Explorer photometry, and with LINEAR light-curve features. We find that the combination of light-curve features and colors enables classification schemes much more powerful than when colors or light curves are each used separately. An interesting side result is a robust and precise quantitative description of a strong correlation between the light-curve period and color/spectral type for close and contact eclipsing binary stars (β Lyrae and W UMa): as the color-based spectral type varies from K4 to F5, the median period increases from 5.9 hr to 8.8 hr. These large samples of robustly classified variable stars will enable detailed statistical studies of the Galactic structure and physics of binary and other stars and we make these samples publicly available
An Automated tool to detect variable sources in the Vista Variables in the Vía Láctea Survey. The VVV Variables (V^4) catalog of tiles d001 and d002
27 pages, 19 figuresTime-varying phenomena are one of the most substantial sources of astrophysical information, and their study has led to many fundamental discoveries in modern astronomy. We have developed an automated tool to search for and analyze variable sources in the near-infrared K s band using the data from the VISTA Variables in the Vía Láctea (VVV) ESO Public Large Survey. This process relies on the characterization of variable sources using different variability indices calculated from time series generated with point-spread function (PSF) photometry of sources under analysis. In particular, we used two main indices, the total amplitude and the eta index η, to identify variable sources. Once the variable objects are identified, periods are determined with generalized Lomb-Scargle periodograms and the information potential metric. Variability classes are assigned according to a compromise between comparisons with VVV templates and the period of the variability. The automated tool is applied on VVV tiles d001 and d002 and led to the discovery of 200 variable sources. We detected 70 irregular variable sources and 130 periodic ones. In addition, nine open-cluster candidates projected in the region are analyzed, and the infrared variable candidates found around these clusters are further scrutinized by cross-matching their locations against emission star candidates from VPHAS+ survey H α color cuts.Peer reviewedFinal Accepted Versio
Massive Datasets in Astronomy
Astronomy has a long history of acquiring, systematizing, and interpreting
large quantities of data. Starting from the earliest sky atlases through the
first major photographic sky surveys of the 20th century, this tradition is
continuing today, and at an ever increasing rate.
Like many other fields, astronomy has become a very data-rich science, driven
by the advances in telescope, detector, and computer technology. Numerous large
digital sky surveys and archives already exist, with information content
measured in multiple Terabytes, and even larger, multi-Petabyte data sets are
on the horizon. Systematic observations of the sky, over a range of
wavelengths, are becoming the primary source of astronomical data. Numerical
simulations are also producing comparable volumes of information. Data mining
promises to both make the scientific utilization of these data sets more
effective and more complete, and to open completely new avenues of astronomical
research.
Technological problems range from the issues of database design and
federation, to data mining and advanced visualization, leading to a new toolkit
for astronomical research. This is similar to challenges encountered in other
data-intensive fields today.
These advances are now being organized through a concept of the Virtual
Observatories, federations of data archives and services representing a new
information infrastructure for astronomy of the 21st century. In this article,
we provide an overview of some of the major datasets in astronomy, discuss
different techniques used for archiving data, and conclude with a discussion of
the future of massive datasets in astronomy.Comment: 46 Pages, 21 Figures, Invited Review for the Handbook of Massive
Datasets, editors J. Abello, P. Pardalos, and M. Resende. Due to space
limitations this version has low resolution figures. For full resolution
review see http://www.astro.caltech.edu/~rb/publications/hmds.ps.g
- …