Abstract
Context. The formation time of dark-matter halos quantifies their mass assembly history and, as such, directly impacts the structural and dynamical properties of the galaxies within them, and even influences galaxy evolution. Despite its importance, halo formation time is not directly observable, necessitating the use of indirect observational proxies-often based on star formation history or galaxy spatial distributions. Recent advancements in machine learning allow for a more comprehensive analysis of galaxy and halo properties, making it possible to develop models for more accurate prediction of halo formation times.
Aims. This study aims to investigate a machine learning-based approach to predict halo formation time-defined as the epoch when a halo accretes half of its current mass-using both halo and baryonic properties derived from cosmological simulations. By incorporating properties associated with the brightest cluster galaxy located at the cluster center, its associated intracluster light component, and satellite galaxies, we aim to surpass these analytical predictions, improve prediction accuracy, and identify key properties that can provide the best proxy for the halo assembly history.
Methods. Using The Three Hundred cosmological hydrodynamical simulations, we trained random forest and convolutional neural network (CNN) models. The random forest models were trained using a range of dark matter halo and baryonic properties, including halo mass, concentration, stellar and gas masses, and properties of the brightest cluster galaxy and intracluster light within different radial apertures, while CNNs were trained on two-dimensional radial property maps generated by binning particles as a function of radius. Based on these results, we also constructed simple linear models that incrementally incorporate observationally accessible features to optimize the prediction of halo formation time for minimal bias and scatter.
Results. Our RF models demonstrated median biases between 4% and 9% with relative error standard deviations of around 20% in the prediction of the halo formation time. The CNN models trained on two-dimensional property maps, further reduced the median bias to .4%, though with a higher scatter than the random forest models. With our simple linear models, one can easily predict the halo formation time with only a limited number of observables and with the bias and scatter compatible with random forest results. Lastly, we also show that the traditional relations between halo formation time and halo mass or concentration are well preserved with our predicted values.
Journal Title
Journal ISSN
Volume Title
Publisher
EDP SCIENCES
URL external
Date
Description
Keywords
Citation
Srivastava, A., Cui, W., de Andres, D., Golden-Marx, J. B., Rasia, E., & Zu, Y. (2025). Predicting halo formation time using machine learning. Astronomy & Astrophysics, 700, A87. DOI: https://doi.org/10.1051/0004-6361/202453165
Collections
Endorsement
Review
Supplemented By
Referenced By
Document viewer
Select a file to preview:
Reload



