Skip to content

Development and experimentation repository for the project "Unsupervised Machine Learning for the Classification of Astrophysical X-ray Sources". V. S. Pérez-Díaz, J. R. Martínez-Galarza, A. Caicedo, R. D'Abrusco.

Notifications You must be signed in to change notification settings

samuelperezdi/astrox

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

25 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

UNSUPERVISED MACHINE LEARNING FOR THE CLASSIFICATION OF ASTROPHYSICAL X-RAY SOURCES

Development and experimentation

Here, you'll find various code iterations, data tests, and experimental models. As it's a live environment for testing and iteration, contents may change frequently. Please refer to the 'umlcaxs' repository (https://github.com/samuelperezdi/umlcaxs) for the finalized and cleaned versions of our data and code.


Abstract

The Chandra Source Catalog (CSC) is the ultimate repository of sources detected by Chandra and represents a fertile ground for discovery in high energy astrophysics. A significant fraction of the sources in CSC have not been studied in detail and only a small fraction of them have been classified. Among the potentially interesting sources that we can look for in Chandra data are compact object mergers, extrasolar planet transits, tidal disruption events, etc. In order to conduct a thorough investigation of the CSC sources, we need to classify as many as we can. This work proposes an unsupervised machine learning approach to classify previously uncategorized Chandra Source Catalog sources by using only the X-ray data available. We propose a new methodology that performs Gaussian Mixture clustering to a set of CSC source properties, and then associates cluster members with objects previously classified in order to provide a probabilistic classification for the sources. Together with classification catalogs for 8756 sources, we present a discussion on the astrophysical implications, specific classification cases, comparison of the results with other catalogs, and limitations of the work. We demonstrate that under certain circumstances, we can distinguish between X-ray binaries from AGNs candidates. We classify YSO and AGN+Seyfert sources with certain confidence (avg. recalls >0.77 and >0.74). In particular cases, AGNs are confused depending on the level of obscuration. Our study offers novel insights into the diverse X-ray properties of astronomical sources, highlighting distinctive variability patterns across different classes and suggesting potential parallels between the AGN unified model and X-ray binaries. Overall, our results demonstrate that it is possible to assign probabilistic classes to a significant fraction of new X-ray sources based on the X-ray information only.


Authors:

  • Víctor Samuel Pérez-Díaz1,2*
    E-mail: samuelperez.di@gmail.com
  • Juan Rafael Martínez-Galarza1
  • Alexander Caicedo3, 4
  • Raffaele D'Abrusco1

Affiliations:

  1. Center for Astrophysics | Harvard & Smithsonian, 60 Garden Street, Cambridge, MA 02138, USA
  2. School of Engineering, Science and Technology, Universidad del Rosario, Cll. 12C No. 6-25, Bogotá, Colombia
  3. Department of Electronics Engineering, Pontificia Universidad Javeriana, Cra. 7 No. 40-62, Bogotá, Colombia
  4. Ressolve, Cra. 42 # 5 Sur - 145, Medellín, Colombia

About

Development and experimentation repository for the project "Unsupervised Machine Learning for the Classification of Astrophysical X-ray Sources". V. S. Pérez-Díaz, J. R. Martínez-Galarza, A. Caicedo, R. D'Abrusco.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published