Gaia Data Release 2 All-sky classification of high-amplitude pulsating stars

L. Rimoldini, B. Holl, M. Audard, N. Mowlavi, K. Nienartowicz, D. W. Evans, L. P. Guy, I. Lecoeur-Taïbi, G. Jevardat de Fombelle, O. Marchal, M. Roelens, J. De Ridder, L. M. Sarro, S. Regibo, M. Lopez, G. Clementini, V. Ripepi, R. Molinaro, A. Garofalo, L. Molnár, E. Plachy, Á. Juhász, L. Szabados, T. Lebzelter, D. Teyssier and L. Eyer. 2019. Gaia Data Release 2 All-sky classification of high-amplitude pulsating stars. Astronomy and Astrophysics 625 DOI: 10.1051/0004-6361/201834616

More than half a million of the 1.69 billion sources in Gaia Data Release 2 (DR2) are published with photometric time series that exhibit light variations during the 22 months of observation.

Aims. An all-sky classification of common high-amplitude pulsators (Cepheids, long-period variables, delta Scuti/SX Phoenicis, and RR Lyrae stars) is provided for stars with brightness variations greater than 0.1 mag in G band.

Methods. A semi-supervised classification approach was employed, firstly training multi-stage random forest classifiers with sources of known types in the literature, followed by a preliminary classification of the Gaia data and a second training phase that included a selection of the first classification results to improve the representation of some classes, before the improved classifiers were applied to the Gaia data. Dedicated validation classifiers were used to reduce the level of contamination in the published results. A relevant fraction of objects were not yet sufficiently sampled for reliable Fourier series decomposition, consequently classifiers were based on features derived from statistics of photometric time series in the G, G(BP), and G(RP) bands, as well as from some astrometric parameters.

Results. The published classification results include 195 780 RR Lyrae stars, 150 757 long-period variables, 8550 Cepheids, and 8882 delta Scuti/SX Phoenicis stars. All of these results represent candidates whose completeness and contamination are described as a function of variability type and classification reliability. Results are expressed in terms of class labels and classification scores, which are available in the vari_classifier_result table of the Gaia archive.

Other publications