Automated reliability assessment for spectroscopic redshift measurements

A. Schmitt; Abdalla; B. Garilli; Baldry; Benitez; Beutler; Beutler; Bolzonella; Brammer; C. Surace; Chandola; Collister; Cool; D. Vibert; Dietterich; Fawcett; Feldmann; Garilli; Garilli; Guzzo; Hastie; Huterer; Ilbert; L. Pozzetti; Le Fèvre; Le Fèvre; Le Fèvre; Linder; M. Moresco; Machado; O. Le Fèvre; Patcha; S. Jamal; Schuecker; Scodeggio; Shahid; Simkin; Tonry; V. Le Brun; Wahba; Wang; Y. Copin

research

Automated reliability assessment for spectroscopic redshift measurements

Authors: A. Schmitt
Abdalla
B. Garilli
Baldry
Benitez
Beutler
Beutler
Bolzonella
Brammer
C. Surace
Chandola
Collister
Cool
D. Vibert
Dietterich
Fawcett
Feldmann
Garilli
Garilli
Guzzo
Hastie
Huterer
Ilbert
L. Pozzetti
Le Fèvre
Le Fèvre
Le Fèvre
Linder
M. Moresco
Machado
O. Le Fèvre
Patcha
S. Jamal
Schuecker
Scodeggio
Shahid
Simkin
Tonry
V. Le Brun
Wahba
Wang
Y. Copin
Publication date: 1 January 2018
Publisher: 'EDP Sciences'
Doi

Abstract

We present a new approach to automate the spectroscopic redshift reliability assessment based on machine learning (ML) and characteristics of the redshift probability density function (PDF). We propose to rephrase the spectroscopic redshift estimation into a Bayesian framework, in order to incorporate all sources of information and uncertainties related to the redshift estimation process, and produce a redshift posterior PDF that will be the starting-point for ML algorithms to provide an automated assessment of a redshift reliability. As a use case, public data from the VIMOS VLT Deep Survey is exploited to present and test this new methodology. We first tried to reproduce the existing reliability flags using supervised classification to describe different types of redshift PDFs, but due to the subjective definition of these flags, soon opted for a new homogeneous partitioning of the data into distinct clusters via unsupervised classification. After assessing the accuracy of the new clusters via resubstitution and test predictions, unlabelled data from preliminary mock simulations for the Euclid space mission are projected into this mapping to predict their redshift reliability labels.Comment: Submitted on 02 June 2017 (v1). Revised on 08 September 2017 (v2). Latest version 28 September 2017 (this version v3