论文标题
从多个调查中对超新星的泛质量光度分类和未来调查的转移学习
Pan-chromatic photometric classification of supernovae from multiple surveys and transfer learning for future surveys
论文作者
论文摘要
时间域天文学正在进入一个新时代,因为具有更高节奏的宽场调查允许比以往任何时候都有更多的发现。该领域已经看到,机器学习和深度学习的使用越来越多,将瞬变自动分类为既定的分类法。培训此类分类器需要足够大的代表性培训集,这对于诸如Vera Rubin天文台等新的未来调查,尤其是在运营开始时,不能保证。我们介绍了使用高斯工艺来从多个调查中创建超新星光曲线的统一表示,该曲线是通过开放的超新星目录获得的,用于通过卷积神经网络进行监督分类。我们还研究了转移学习的使用来从光度LSST天文时间序列分类挑战(Plactrc)数据集中对光曲线进行分类。使用卷积神经网络将高斯过程分类从多个调查中生成的超新星光曲线表示,我们将AUC得分达到0.859,将IA,IBC型分类为0.859。我们发现,在对Parterc Light Curves进行分类时,转移学习将最多代表性的类别的分类准确性提高了多达18%,并且在包括分类为六个类中的光度红移时能够达到0.945(IA,IA-91BG,IA-91BG,IBC,IBC,II,II,SLSN-I)。我们还研究了在有限的标记培训集时,我们研究转移学习的有用性,以查看该方法在操作开始时如何在未来的调查中用于培训分类器。
Time-domain astronomy is entering a new era as wide-field surveys with higher cadences allow for more discoveries than ever before. The field has seen an increased use of machine learning and deep learning for automated classification of transients into established taxonomies. Training such classifiers requires a large enough and representative training set, which is not guaranteed for new future surveys such as the Vera Rubin Observatory, especially at the beginning of operations. We present the use of Gaussian processes to create a uniform representation of supernova light curves from multiple surveys, obtained through the Open Supernova Catalog for supervised classification with convolutional neural networks. We also investigate the use of transfer learning to classify light curves from the Photometric LSST Astronomical Time Series Classification Challenge (PLAsTiCC) dataset. Using convolutional neural networks to classify the Gaussian process generated representation of supernova light curves from multiple surveys, we achieve an AUC score of 0.859 for classification into Type Ia, Ibc, and II. We find that transfer learning improves the classification accuracy for the most under-represented classes by up to 18% when classifying PLAsTiCC light curves, and is able to achieve an AUC score of 0.945 when including photometric redshifts for classification into six classes (Ia, Iax, Ia-91bg, Ibc, II, SLSN-I). We also investigate the usefulness of transfer learning when there is a limited labelled training set to see how this approach can be used for training classifiers in future surveys at the beginning of operations.