Search CORE

144 research outputs found

Photometric redshift estimation based on data mining with PhotoRApToR

Author: Brescia Massimo
Cavuoti Stefano
De Stefano Virgilio
Longo Giuseppe
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2015
Field of study

Photometric redshifts (photo-z) are crucial to the scientific exploitation of modern panchromatic digital surveys. In this paper we present PhotoRApToR (Photometric Research Application To Redshift): a Java/C++ based desktop application capable to solve non-linear regression and multi-variate classification problems, in particular specialized for photo-z estimation. It embeds a machine learning algorithm, namely a multilayer neural network trained by the Quasi Newton learning rule, and special tools dedicated to pre- and postprocessing data. PhotoRApToR has been successfully tested on several scientific cases. The application is available for free download from the DAME Program web site.Comment: To appear on Experimental Astronomy, Springer, 20 pages, 15 figure

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

OA@INAF - Istituto Nazionale di Astrofisica

Automated physical classification in the SDSS DR10. A catalogue of candidate Quasars

Author: Brescia Massimo
Cavuoti Stefano
Longo Giuseppe
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2015
Field of study

We discuss whether modern machine learning methods can be used to characterize the physical nature of the large number of objects sampled by the modern multi-band digital surveys. In particular, we applied the MLPQNA (Multi Layer Perceptron with Quasi Newton Algorithm) method to the optical data of the Sloan Digital Sky Survey - Data Release 10, investigating whether photometric data alone suffice to disentangle different classes of objects as they are defined in the SDSS spectroscopic classification. We discuss three groups of classification problems: (i) the simultaneous classification of galaxies, quasars and stars; (ii) the separation of stars from quasars; (iii) the separation of galaxies with normal spectral energy distribution from those with peculiar spectra, such as starburst or starforming galaxies and AGN. While confirming the difficulty of disentangling AGN from normal galaxies on a photometric basis only, MLPQNA proved to be quite effective in the three-class separation. In disentangling quasars from stars and galaxies, our method achieved an overall efficiency of 91.31% and a QSO class purity of ~95%. The resulting catalogue of candidate quasars/AGNs consists of ~3.6 million objects, of which about half a million are also flagged as robust candidates, and will be made available on CDS VizieR facility.Comment: Accepted for publication by MNRAS, 13 pages, 6 figure

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

OA@INAF - Istituto Nazionale di Astrofisica

Data-Rich Astronomy: Mining Sky Surveys with PhotoRApToR

Author: Brescia Massimo
Cavuoti Stefano
Longo Giuseppe
Publication venue
Publication date: 01/01/2014
Field of study

In the last decade a new generation of telescopes and sensors has allowed the production of a very large amount of data and astronomy has become a data-rich science. New automatic methods largely based on machine learning are needed to cope with such data tsunami. We present some results in the fields of photometric redshifts and galaxy classification, obtained using the MLPQNA algorithm available in the DAMEWARE (Data Mining and Web Application Resource) for the SDSS galaxies (DR9 and DR10). We present PhotoRApToR (Photometric Research Application To Redshift): a Java based desktop application capable to solve regression and classification problems and specialized for photo-z estimation.Comment: proceedings of the IAU Symposium, Vol. 306, Cambridge University Pres

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

OA@INAF - Istituto Nazionale di Astrofisica

PhotoRaptor - Photometric Research Application To Redshifts

Author: Brescia Massimo
Cavuoti Stefano
De Stefano Virgilio
Longo Giuseppe
Publication venue
Publication date: 01/01/2016
Field of study

Due to the necessity to evaluate photo-z for a variety of huge sky survey data sets, it seemed important to provide the astronomical community with an instrument able to fill this gap. Besides the problem of moving massive data sets over the network, another critical point is that a great part of astronomical data is stored in private archives that are not fully accessible on line. So, in order to evaluate photo-z it is needed a desktop application that can be downloaded and used by everyone locally, i.e. on his own personal computer or more in general within the local intranet hosted by a data center. The name chosen for the application is PhotoRApToR, i.e. Photometric Research Application To Redshift (Cavuoti et al. 2015, 2014; Brescia 2014b). It embeds a machine learning algorithm and special tools dedicated to preand post-processing data. The ML model is the MLPQNA (Multi Layer Perceptron trained by the Quasi Newton Algorithm), which has been revealed particularly powerful for the photo-z calculation on the base of a spectroscopic sample (Cavuoti et al. 2012; Brescia et al. 2013, 2014a; Biviano et al. 2013). The PhotoRApToR program package is available, for different platforms, at the official website (http://dame.dsf.unina.it/dame_photoz.html#photoraptor).Comment: User Manual of the PhotoRaptor tool, 54 pages. arXiv admin note: substantial text overlap with arXiv:1501.0650

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

Photometric redshifts with Quasi Newton Algorithm (MLPQNA). Results in the PHAT1 contest

Author: Brescia Massimo
Cavuoti Stefano
Longo Giuseppe
Mercurio Amata
Publication venue: 'EDP Sciences'
Publication date: 01/01/2012
Field of study

Context. Since the advent of modern multiband digital sky surveys, photometric redshifts (photo-z's) have become relevant if not crucial to many fields of observational cosmology, from the characterization of cosmic structures, to weak and strong lensing. Aims. We describe an application to an astrophysical context, namely the evaluation of photometric redshifts, of MLPQNA, a machine learning method based on Quasi Newton Algorithm. Methods. Theoretical methods for photo-z's evaluation are based on the interpolation of a priori knowledge (spectroscopic redshifts or SED templates) and represent an ideal comparison ground for neural networks based methods. The MultiLayer Perceptron with Quasi Newton learning rule (MLPQNA) described here is a computing effective implementation of Neural Networks for the first time exploited to solve regression problems in the astrophysical context and is offered to the community through the DAMEWARE (DAta Mining & ExplorationWeb Application REsource) infrastructure. Results. The PHAT contest (Hildebrandt et al. 2010) provides a standard dataset to test old and new methods for photometric redshift evaluation and with a set of statistical indicators which allow a straightforward comparison among different methods. The MLPQNA model has been applied on the whole PHAT1 dataset of 1984 objects after an optimization of the model performed by using as training set the 515 available spectroscopic redshifts. When applied to the PHAT1 dataset, MLPQNA obtains the best bias accuracy (0.0006) and very competitive accuracies in terms of scatter (0.056) and outlier percentage (16.3%), scoring as the second most effective empirical method among those which have so far participated to the contest. MLPQNA shows better generalization capabilities than most other empirical methods especially in presence of underpopulated regions of the Knowledge Base.Comment: Accepted for publication in Astronomy & Astrophysics; 9 pages, 2 figure

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

Crossref

EDP Sciences OAI-PMH repository (1.2.0)

OA@INAF - Istituto Nazionale di Astrofisica

Archivio della Ricerca - Università di Salerno

Return of the features. Efficient feature selection and interpretation for photometric redshifts

Author: Cavuoti Stefano
D'Isanto Antonio
Gieseke Fabian
Polsterer Kai Lars
Publication venue: 'EDP Sciences'
Publication date: 01/01/2018
Field of study

The explosion of data in recent years has generated an increasing need for new analysis techniques in order to extract knowledge from massive datasets. Machine learning has proved particularly useful to perform this task. Fully automatized methods have recently gathered great popularity, even though those methods often lack physical interpretability. In contrast, feature based approaches can provide both well-performing models and understandable causalities with respect to the correlations found between features and physical processes. Efficient feature selection is an essential tool to boost the performance of machine learning models. In this work, we propose a forward selection method in order to compute, evaluate, and characterize better performing features for regression and classification problems. Given the importance of photometric redshift estimation, we adopt it as our case study. We synthetically created 4,520 features by combining magnitudes, errors, radii, and ellipticities of quasars, taken from the SDSS. We apply a forward selection process, a recursive method in which a huge number of feature sets is tested through a kNN algorithm, leading to a tree of feature sets. The branches of the tree are then used to perform experiments with the random forest, in order to validate the best set with an alternative model. We demonstrate that the sets of features determined with our approach improve the performances of the regression models significantly when compared to the performance of the classic features from the literature. The found features are unexpected and surprising, being very different from the classic features. Therefore, a method to interpret some of the found features in a physical context is presented. The methodology described here is very general and can be used to improve the performance of machine learning models for any regression or classification task.Comment: 21 pages, 11 figures, accepted for publication on A&A, final version after language revisio

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

EDP Sciences OAI-PMH repository (1.2.0)

Copenhagen University Research Information System

OA@INAF - Istituto Nazionale di Astrofisica

Stellar formation rates in galaxies using Machine Learning models

Author: Brescia Massimo
Cavuoti Stefano
Longo Giuseppe
Riccio Giuseppe
Veneri Michele Delli
Publication venue
Publication date: 01/01/2018
Field of study

Global Stellar Formation Rates or SFRs are crucial to constrain theories of galaxy formation and evolution. SFR's are usually estimated via spectroscopic observations which require too much previous telescope time and therefore cannot match the needs of modern precision cosmology. We therefore propose a novel method to estimate SFRs for large samples of galaxies using a variety of supervised ML models.Comment: ESANN 2018 - Proceedings, ISBN-13 978287587048

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

OA@INAF - Istituto Nazionale di Astrofisica

Probability density estimation of photometric redshifts based on machine learning

Author: Amaro Valeria
Brescia Massimo
Cavuoti Stefano
Longo Giuseppe
Tortora Crescenzo
Vellucci Civita
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 01/01/2017
Field of study

Photometric redshifts (photo-z's) provide an alternative way to estimate the distances of large samples of galaxies and are therefore crucial to a large variety of cosmological problems. Among the various methods proposed over the years, supervised machine learning (ML) methods capable to interpolate the knowledge gained by means of spectroscopical data have proven to be very effective. METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric Redshifts) is a novel method designed to provide a reliable PDF (Probability density Function) of the error distribution of photometric redshifts predicted by ML methods. The method is implemented as a modular workflow, whose internal engine for photo-z estimation makes use of the MLPQNA neural network (Multi Layer Perceptron with Quasi Newton learning rule), with the possibility to easily replace the specific machine learning model chosen to predict photo-z's. After a short description of the software, we present a summary of results on public galaxy data (Sloan Digital Sky Survey - Data Release 9) and a comparison with a completely different method based on Spectral Energy Distribution (SED) template fitting.Comment: 2016 IEEE Symposium Series on Computational Intelligence, SSCI 2016 784995

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

Crossref

Proceedings - University of Groningen

University of Groningen

ARTS repository - University of Groningen

Dissertations of the University of Groningen

Genetic Algorithm Modeling with GPU Parallel Computing Technology

Author: Brescia Massimo
Cavuoti Stefano
Garofalo Mauro
Longo Giuseppe
Pescapé Antonio
Ventre Giorgio
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2012
Field of study

We present a multi-purpose genetic algorithm, designed and implemented with GPGPU / CUDA parallel computing technology. The model was derived from a multi-core CPU serial implementation, named GAME, already scientifically successfully tested and validated on astrophysical massive data classification problems, through a web application resource (DAMEWARE), specialized in data mining based on Machine Learning paradigms. Since genetic algorithms are inherently parallel, the GPGPU computing paradigm has provided an exploit of the internal training features of the model, permitting a strong optimization in terms of processing performances and scalability.Comment: 11 pages, 2 figures, refereed proceedings; Neural Nets and Surroundings, Proceedings of 22nd Italian Workshop on Neural Nets, WIRN 2012; Smart Innovation, Systems and Technologies, Vol. 19, Springe

arXiv.org e-Print Archive

Crossref

Archivio della ricerca - Università degli studi di Napoli Federico II

METAPHOR: Probability density estimation for machine learning based photometric redshifts

Author: Amaro Valeria
Brescia Massimo
Cavuoti Stefano
Longo Giuseppe
Tortora Crescenzo
Vellucci Civita
Publication venue: 'Cambridge University Press (CUP)'
Publication date: 01/10/2016
Field of study

We present METAPHOR (Machine-learning Estimation Tool for Accurate PHOtometric Redshifts), a method able to provide a reliable PDF for photometric galaxy redshifts estimated through empirical techniques. METAPHOR is a modular workflow, mainly based on the MLPQNA neural network as internal engine to derive photometric galaxy redshifts, but giving the possibility to easily replace MLPQNA with any other method to predict photo-z's and their PDF. We present here the results about a validation test of the workflow on the galaxies from SDSS-DR9, showing also the universality of the method by replacing MLPQNA with KNN and Random Forest models. The validation test include also a comparison with the PDF's derived from a traditional SED template fitting method (Le Phare).Comment: proceedings of the International Astronomical Union, IAU-325 symposium, Cambridge University pres

arXiv.org e-Print Archive

Archivio della ricerca - Università degli studi di Napoli Federico II

Proceedings - University of Groningen

Crossref

University of Groningen

ARTS repository - University of Groningen

OA@INAF - Istituto Nazionale di Astrofisica

Dissertations of the University of Groningen