156 research outputs found
Effect of training characteristics on object classification: an application using Boosted Decision Trees
We present an application of a particular machine-learning method (Boosted
Decision Trees, BDTs using AdaBoost) to separate stars and galaxies in
photometric images using their catalog characteristics. BDTs are a well
established machine learning technique used for classification purposes. They
have been widely used specially in the field of particle and astroparticle
physics, and we use them here in an optical astronomy application. This
algorithm is able to improve from simple thresholding cuts on standard
separation variables that may be affected by local effects such as blending,
badly calculated background levels or which do not include information in other
bands. The improvements are shown using the Sloan Digital Sky Survey Data
Release 9, with respect to the type photometric classifier. We obtain an
improvement in the impurity of the galaxy sample of a factor 2-4 for this
particular dataset, adjusting for the same efficiency of the selection. Another
main goal of this study is to verify the effects that different input vectors
and training sets have on the classification performance, the results being of
wider use to other machine learning techniques.Comment: Accepted for publication in Astronomy & Computin
Star–galaxy classification in the Dark Energy Survey Y1 data set
We perform a comparison of different approaches to star–galaxy classification using the broadband photometric data from Year 1 of the Dark Energy Survey. This is done by performing a wide range of tests with and without external ‘truth’ information, which can be ported to other similar data sets. We make a broad evaluation of the performance of the classifiers in two science cases with DES data that are most affected by this systematic effect: large-scale structure and MilkyWay studies. In general, even though the default morphological classifiers used for DES Y1 cosmology studies are sufficient to maintain a low level of systematic contamination from stellarmisclassification, contamination can be reduced to theO(1 per cent) level by using multi-epoch and infrared information from external data sets. For Milky Way studies, the stellar sample can be augmented by ~20 per cent for a given flux limit
Clasificaci??n de objetos cosmol??gicos usando Redes Neuronales Convolucionales
En los ??ltimos 20 a??os, la tecnolog??a de detectores y procesamiento de datos ha permitido que hoy en d??a, los astr??nomos dispongan de una inmensa cantidad de datos, tanto de objetos particulares, como de amplias ??reas del cielo. Durante los primeros a??os de esta revoluci??n, el post-procesado de los datos, como en el caso de la clasificaci??n de objetos, era realizado
manualmente por los cient??ficos. Hoy en d??a, los instrumentos modernos nos permiten obtener fotometr??a de miles de objetos cada noche en todo el mundo. Para poder analizar toda esta informaci??n de manera eficiente, hay que crear sistemas automatizados de clasificaci??n. El objetivo de este
trabajo consiste en desarrollar un sistema para poder analizar la enorme cantidad de datos generada por los nuevos sistemas automatizados. Para ello, utilizaremos machine learning (aprendizaje autom??tico) para analizar los espectros que tomamos usando fotometr??a con filtros estrechos y poder separar entre galaxias, estrellas y cu??sares de una forma r??pida, eficaz y fiable
Optimized clustering estimators for BAO measurements accounting for significant redshift uncertainty
We determine an optimized clustering statistic to be used for galaxy samples with significant redshift uncertainty, such as those that rely on photometric redshifts. To do so, we study the baryon acoustic oscillation (BAO) information content as a function of the orientation of galaxy clustering modes with respect to their angle to the line of sight (LOS). The clustering along the LOS, as observed in a redshift-space with significant redshift uncertainty, has contributions from clustering modes with a range of orientations with respect to the true LOS. For redshift uncertainty σz ≥ 0.02(1 + z), we find that while the BAO information is confined to transverse clustering modes in the true space, it is spread nearly evenly in the observed space. Thus, measuring clustering in terms of the projected separation (regardless of the LOS) is an efficient and nearly lossless compression of the signal for σz ≥ 0.02(1 + z). For reduced redshift uncertainty, a more careful consideration is required. We then use more than 1700 realizations (combining two separate sets) of galaxy simulations mimicking the Dark Energy Survey Year 1 (DES Y1) sample to validate our analytic results and optimized analysis procedure. We find that using the correlation function binned in projected separation, we can achieve uncertainties that are within 10 per cent of those predicted by Fisher matrix forecasts. We predict that DES Y1 should achieve a 5 per cent distance measurement using our optimized methods. We expect the results presented here to be important for any future BAO measurements made using photometric redshift data.Please visit publisher's website for further information
Host galaxy identification for supernova surveys
Host galaxy identification is a crucial step for modern supernova (SN) surveys such as the Dark Energy Survey (DES) and the Large Synoptic Survey Telescope (LSST), which will discover SNe by the thousands. Spectroscopic resources are limited, so in the absence of real-time SN spectra these surveys must rely on host galaxy spectra to obtain accurate redshifts for the Hubble diagram and to improve photometric classification of SNe. In addition, SN luminosities are known to correlate with host-galaxy properties. Therefore, reliable identification of host galaxies is essential for cosmology and SN science. We simulate SN events and their locations within their host galaxies to develop and test methods for matching SNe to their hosts. We use both real and simulated galaxy catalog data from the Advanced Camera for Surveys General Catalog and MICECATv2.0, respectively. We also incorporate "hostless" SNe residing in undetected faint hosts into our analysis, with an assumed hostless rate of 5%. Our fully automated algorithm is run on catalog data and matches SNe to their hosts with 91% accuracy. We find that including a machine learning component, run after the initial matching algorithm, improves the accuracy (purity) of the matching to 97% with a 2% cost in efficiency (true positive rate). Although the exact results are dependent on the details of the survey and the galaxy catalogs used, the method of identifying host galaxies we outline here can be applied to any transient survey
A catalogue of structural and morphological measurements for DES Y1
We present a structural and morphological catalogue for 45 million objects selected from the first year data of the Dark Energy Survey (DES). Single Sersic fits and non-parametric ´ measurements are produced for g, r, and i filters. The parameters from the best-fitting Sersic ´ model (total magnitude, half-light radius, Sersic index, axis ratio, and position angle) are mea- ´ sured with GALFIT; the non-parametric coefficients (concentration, asymmetry, clumpiness, Gini, M20) are provided using the Zurich Estimator of Structural Types (ZEST+). To study the statistical uncertainties, we consider a sample of state-of-the-art image simulations with a realistic distribution in the input parameter space and then process and analyse them as we do with real data: this enables us to quantify the observational biases due to PSF blurring and magnitude effects and correct the measurements as a function of magnitude, galaxy size, Sersic ´ index (concentration for the analysis of the non-parametric measurements) and ellipticity. We present the largest structural catalogue to date: we find that accurate and complete measurements for all the structural parameters are typically obtained for galaxies with SEXTRACTOR MAG AUTO I ≤ 21. Indeed, the parameters in the filters i and r can be overall well recovered up to MAG AUTO ≤ 21.5, corresponding to a fitting completeness of ∼90 per cent below this threshold, for a total of 25 million galaxies. The combination of parametric and non-parametric structural measurements makes this catalogue an important instrument to explore and understand how galaxies form and evolve. The catalogue described in this paper will be publicly released alongside the DES collaboration Y1 cosmology data products at the following URL: https://des.ncsa.illinois.edu/releases
The PAU Survey: Intrinsic alignments and clustering of narrow-band photometric galaxies
We present the first measurements of the projected clustering and intrinsic alignments (IA) of galaxies observed by the Physics of the Accelerating Universe Survey (PAUS). With photometry in 40 narrow optical passbands (4500 Å–8500 Å), the quality of photometric redshift estimation is σz ∼ 0.01(1 + z) for galaxies in the 19 deg2 Canada-France-Hawaii Telescope Legacy Survey W3 field, allowing us to measure the projected 3D clustering and IA for flux-limited, faint galaxies (i < 22.5) out to z ∼ 0.8. To measure two-point statistics, we developed, and tested with mock photometric redshift samples, ‘cloned’ random galaxy catalogues which can reproduce data selection functions in 3D and account for photometric redshift errors. In our fiducial colour-split analysis, we made robust null detections of IA for blue galaxies and tentative detections of radial alignments for red galaxies (∼1 − 3σ), over scales of 0.1 − 18 h−1 Mpc. The galaxy clustering correlation functions in the PAUS samples are comparable to their counterparts in a spectroscopic population from the Galaxy and Mass Assembly survey, modulo the impact of photometric redshift uncertainty which tends to flatten the blue galaxy correlation function, whilst steepening that of red galaxies. We investigate the sensitivity of our correlation function measurements to choices in the random catalogue creation and the galaxy pair-binning along the line of sight, in preparation for an optimised analysis over the full PAUS area
- …
