Machine learning study to identify collective flow in small and large colliding systems

Shuang Guo Key Laboratory of Nuclear Physics and Ion-beam Application (MOE), Institute of Modern Physics, Fudan University, Shanghai 200433, China Shanghai Research Center for Theoretical Nuclear Physics, NSFC and Fudan University, Shanghai 200438200438200438200438, China    Han-Sheng Wang Key Laboratory of Nuclear Physics and Ion-beam Application (MOE), Institute of Modern Physics, Fudan University, Shanghai 200433, China Shanghai Research Center for Theoretical Nuclear Physics, NSFC and Fudan University, Shanghai 200438200438200438200438, China    Kai Zhou [email protected] School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen (CUHK-Shenzhen), Guangdong, 518172518172518172518172, China    Guo-Liang Ma [email protected] Key Laboratory of Nuclear Physics and Ion-beam Application (MOE), Institute of Modern Physics, Fudan University, Shanghai 200433, China Shanghai Research Center for Theoretical Nuclear Physics, NSFC and Fudan University, Shanghai 200438200438200438200438, China
Abstract

Collective flow has been found to be similar between small colliding systems (p𝑝pitalic_p +++ p𝑝pitalic_p and p𝑝pitalic_p +++ A collisions) and large colliding systems (peripheral A +++ A collisions) at the CERN Large Hadron Collider. In order to study the differences of collective flow between small and large colliding systems, we employ a point cloud network to identify p𝑝pitalic_p +++ Pb collisions and peripheral Pb +++ Pb collisions at sNN=subscript𝑠𝑁𝑁absent\sqrt{s_{NN}}=square-root start_ARG italic_s start_POSTSUBSCRIPT italic_N italic_N end_POSTSUBSCRIPT end_ARG = 5.02 TeV generated from a multiphase transport model (AMPT). After removing the discrepancies in the pseudorapidity distribution and the pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra, we capture the discrepancy in collective flow. Although the verification accuracy of our PCN is limited due to similar event-by-event distributions of elliptic and triangular flow, we demonstrate that collective flow between p𝑝pitalic_p +++ Pb collisions and peripheral Pb +++ Pb collisions becomes more distinct with increasing final hadron multiplicity and parton scattering cross section. This study not only highlights the potential of PCN techniques in advancing the understanding of collective flow in varying colliding systems, but more importantly lays the groundwork for the future PCN-related research.

I INTRODUCTION

With advancements in computational hardware and algorithms, machine learning (ML) techniques have become powerful tools for extracting information from big data. Among many ML architectures, the point cloud network (PCN) stands out for its efficiency and effectiveness in solving problems involving point cloud-structured data, such as three-dimensional (3D) object segmentation and scene semantic parsing Qi et al. (2016). The unique operations employed in the PCN enhance its capability to process high-dimensional data effectively compared to traditional convolutional neural networks.

The PCN has demonstrated great potential in handling complex and irregular data in various physics domains. It excels in situations where traditional methods struggle with geometric complexity and high-dimensional data. For example, in fluid dynamics, the PCN is used to predict fluid flow fields on irregular geometries. By using point-cloud-based neural networks, researchers are able to capture geometric features and solve the partial differential equations governing fluid flow, resulting in accurate predictions in complex domains Kashefi et al. (2021); Kashefi and Mukerji (2022). In quantum computing, the integration of quantum convolutional neural networks with point cloud data processing provides a scalable solution for classifying high-dimensional data, thereby advancing quantum machine learning Baek et al. (2022). Additionally, in materials science, especially in additive manufacturing, the PCN can detect surface defects in real-time during the manufacturing process, leading to improved product quality and manufacturing efficiency Chen et al. (2021).

Recently, machine learning (ML) techniques have been widely used in various areas of high-energy nuclear physics Pang et al. (2018); Zhou et al. (2019); Graczykowski et al. (2022); ZHOU et al. (2022); WANG et al. (2022); HE et al. (2022); DU et al. (2022); LI et al. (2023); He et al. (2023); Ma et al. (2023); Zhou et al. (2024); Boehnlein et al. (2022). In high-energy nuclear physics, both experimental detector outputs and most models for heavy-ion collisions primarily produce particle tracks. Since a track can be represented as an N-dimensional point, an event naturally becomes a point cloud of disordered points in space. Consequently, PCNs have been well-utilized in this field for tasks such as precisely reconstructing collision impact parameters Steinheimer et al. (2019); Omana Kuttan et al. (2020a); Omana Kuttan et al. (2021), classifying equations of state (EOS) Omana Kuttan et al. (2020b), and identifying weak intermittency signals associated with critical phenomena Huang et al. (2022) by learning the data of tracks in heavy-ion collisions. The potential applications of the PCN extend far beyond the examples provided above. In this paper, we demonstrate how the PCN can distinguish between p𝑝pitalic_p +++ Pb collisions and peripheral Pb +++ Pb collisions, highlighting its ability to diagnose differences in the physical characteristics of these two collision systems.

Quark-gluon plasma (QGP) at extreme conditions of high temperature and density is thought to be a form of the early universe, which has been produced in the laboratory by relativistic heavy-ion collisions at the BNL Relativistic Heavy Ion Collider (RHIC) and the CERN Large Hadron Collider (LHC) Adams et al. (2005); Adcox et al. (2005); Aamodt et al. (2008); Bzdak et al. (2020); Luo and Xu (2017). The experimental results have shown that this new type of nearly perfect fluid can translate initial spatial geometry or initial energy density fluctuations into momentum anisotropy of final particles through the pressure gradient in hydrodynamics Heinz and Snellings (2013); Song and Heinz (2008); Jeon and Heinz (2015); Shen and Yan (2020); Gale et al. (2013); Yan (2018); Alver and Roland (2010); Ma and Wang (2011); Yan et al. (2014); Qin et al. (2010), thus resulting in the emergence of strong collective flow in A +++ A collisions Ollitrault (1992); Stoecker (2005).

Over the last decade, measurements of collective flow in various colliding systems, such as p𝑝pitalic_p +++ p𝑝pitalic_p, p𝑝pitalic_p +++ Pb collisions at the LHC Khachatryan et al. (2010); Chatrchyan et al. (2013a); Abelev et al. (2013a); Aad et al. (2013a); Sirunyan et al. (2018); Acharya et al. (2019), p𝑝pitalic_p +++ Au, d𝑑ditalic_d +++ Au, and 3He +++ Au collisions at RHIC Adare et al. (2015); Adamczyk et al. (2015); Aidala et al. (2019); Acharya et al. (2022); Abdulameer et al. (2023), have been performed. Surprisingly, similar collective flow is found in peripheral A +++ A and high multiplicity p𝑝pitalic_p +++ A collisions at the same multiplicity, raising doubts about whether QGP droplets can also be generated in small colliding systems. Many theoretical efforts have been made to understand the origin of collective flow in small colliding systems Dusling et al. (2016); Loizides (2016); Nagle and Zajc (2018). Similar to large colliding systems, hydrodynamics in the final state can transform initial geometric asymmetry into final momentum anisotropic flow through the pressure gradient of the QGP in small colliding systems Bozek (2012); Bzdak et al. (2013); Shuryak and Zahed (2013); Qin and Müller (2014); Bozek and Broniowski (2013); Bozek et al. (2015); Song et al. (2017). Conversely, it is generally believed that the transport model will behave more like hydrodynamics as the multiplicity or scattering cross section increases, i.e., the change of dynamics from non-equilibrium to equilibrium. A multi-phase transport (AMPT) model Lin et al. (2005) is capable of describing the experimental data on both radial and anisotropic flow in both large Chen et al. (2006); Lin (2014); Ma and Lin (2016) and small colliding systems Bzdak and Ma (2014); Orjuela Koop et al. (2015); Ma and Bzdak (2016). Since most of partons are not scattered especially for the small colliding systems at RHIC and the LHC, a parton escape mechanism has been proposed to explain the formation of azimuthal anisotropies in the transport model He et al. (2016); Lin et al. (2016). However, it has been shown that parton scatterings are crucial for generating anisotropic flows Ma and Bzdak (2016); Ma et al. (2021). Using a new test-particle method, we recently proved that collectivity established by final state parton scatterings is much stronger in large colliding systems than that in small colliding systems Wang and Ma (2022). The event-averaged flow vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT primarily reflects the averaged hydrodynamic response to the initial collision geometry of the produced QGP. More helpful information, such as the event-by-event (EbyE) fluctuations of the overlap region Alver and Roland (2010), can be obtained by measuring event-by-event vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT distribution P(vn)𝑃subscript𝑣𝑛P(v_{n})italic_P ( italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) for charged hadrons, as measured by the ATLAS Collaboration using an unfolding method in Pb +++ Pb collisions at sNNsubscript𝑠𝑁𝑁\sqrt{s_{NN}}square-root start_ARG italic_s start_POSTSUBSCRIPT italic_N italic_N end_POSTSUBSCRIPT end_ARG = 2.76 TeV Aad et al. (2013b). This provides a good constraint on the initial condition in A +++ A collisions. However, the corresponding experimental measurements in small colliding systems have yet to be available, which would provide more information on the differences in the origin of collectivity between small and large colliding systems.

In addition to anisotropic flow vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, investigating other observables’ differences between large and small colliding systems, such as pseudorapidity distribution and pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra, is worthwhile. For example, previous experimental studies have shown power law-shaped pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra in small colliding systems Adams et al. (2006); Abelev et al. (2014); Adam et al. (2015); Acharya et al. (2020); Abelev et al. (2009, 2013b), unlike exponential-shaped pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra in large colliding systems Adcox et al. (2004); Abelev et al. (2009, 2013b).

These motivate us to adopt the PCN in supervised training, to find the different EbyE features between the final state of p𝑝pitalic_p +++ Pb collisions and peripheral Pb +++ Pb collisions from the multi-phase transport (AMPT) model. Our goal is to identify the discrepancies in collective flow between large and small systems, which will enhance our understanding of the mechanisms underlying the generation of collective flow in small systems.Ultimately, we aim to combine experimental data to determine whether small colliding systems also produce the QGP or exhibit very different characteristics of collective flow compared to large colliding systems.

Additionally, our PCN study holds potential as a valuable tool for various other research interests, such as how to potentially classify the origin of atmospheric particle showers in cosmic ray physics Erdmann et al. (2018, 2019). When a particle of cosmic radiation, such as a proton or a nucleus, interacts with a molecule’s nucleus in the atmosphere, it generates numerous secondary particles, resulting in an air shower. These interactions are likely to be either p𝑝pitalic_p +++ Pb collision type or Pb +++ Pb collision type Lingenfelter (2019). This is the current focus of the experiments in the Telescope Array Project Tokuno et al. (2012) and the Pierre Auger Observatory Abraham et al. (2004). Our PCN study could be used in cosmic ray physics to enable a better understanding of the origin of these showers.

The paper is organized as follows. First, we introduce the AMPT model, which generates the data for p𝑝pitalic_p +++ Pb collisions and peripheral Pb +++ Pb collisions, and the relevant observables in Sec. II. In Sec. III, we describe the details of the PCN. In Sec. IV, the test accuracy of the PCN is presented, and the relevant physics is discussed. Finally, we summarize and give the implications of our results in Sec. V.

II MODEL AND METHOD

II.1 A multiphase transport model

The string melting version of the AMPT model consists of four main stages of heavy-ion collisions, i.e., initial state, parton cascade, hadronization, and hadronic rescatterings. The initial state with fluctuating initial conditions is generated by the heavy ion jet interaction generator (HIJING) model Gyulassy and Wang (1994). In HIJING model, minijet partons and excited strings are produced by hard and soft processes, respectively. In the string melting mechanism, all excited hadronic strings in the overlap volume are converted to partons according to the flavor and spin structures of their valence quarks Lin and Ko (2002). The initial positions of partons originating from melted strings are determined by tracing their parent hadrons along straight-line trajectories. The interactions among partons are described by the Zhang’s parton cascade (ZPC) model Zhang (1998), which includes only two-body elastic scatterings with a g+g \rightarrow g+g cross section, i.e.,

dσdt^=9παs22(1+μ2s^)1(t^μ2)2,𝑑𝜎𝑑^𝑡9𝜋subscriptsuperscript𝛼2𝑠21superscript𝜇2^𝑠1superscript^𝑡superscript𝜇22\frac{d\sigma}{d\hat{t}}=\frac{9\pi\alpha^{2}_{s}}{2}(1+\frac{\mu^{2}}{\hat{s}% })\frac{1}{(\hat{t}-\mu^{2})^{2}},divide start_ARG italic_d italic_σ end_ARG start_ARG italic_d over^ start_ARG italic_t end_ARG end_ARG = divide start_ARG 9 italic_π italic_α start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT end_ARG start_ARG 2 end_ARG ( 1 + divide start_ARG italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG start_ARG over^ start_ARG italic_s end_ARG end_ARG ) divide start_ARG 1 end_ARG start_ARG ( over^ start_ARG italic_t end_ARG - italic_μ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT ) start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG , (1)

where αssubscript𝛼𝑠\alpha_{s}italic_α start_POSTSUBSCRIPT italic_s end_POSTSUBSCRIPT is the strong coupling constant (taken as 0.33), while s^^𝑠\hat{s}over^ start_ARG italic_s end_ARG and t^^𝑡\hat{t}over^ start_ARG italic_t end_ARG are the usual Mandelstam variables. The effective screening mass μ𝜇\muitalic_μ is taken as a parameter in ZPC for adjusting the parton scattering cross section. Note that previous AMPT model studies have shown that a parton scattering cross section of 3 mb can well describe both large and small colliding systems at RHIC and the LHC energies Lin (2014); Orjuela Koop et al. (2015); Ma and Bzdak (2016); Ma and Lin (2016); He and Lin (2017); Lin and Zheng (2021). However, we will employ different parton scattering cross sections of 0 mb, 3 mb and 10 mb in order to establish collective flow of different strengths in this study. A quark coalescence model is used for hadronization at the freezeout of the parton system. The hadronic scatterings in the hadronic phase are simulated by a relativistic transport (ART) model Li and Ko (1995).

In this study, we simulated 2 million events each for peripheral Pb +++ Pb collisions and p𝑝pitalic_p +++ Pb minimum bias collisions at sNNsubscript𝑠𝑁𝑁\sqrt{s_{NN}}square-root start_ARG italic_s start_POSTSUBSCRIPT italic_N italic_N end_POSTSUBSCRIPT end_ARG = 5.02 TeV, using the string-melting version of the AMPT model.

II.2 Anisotropic flow

The anisotropic collective flow can be defined according to a Fourier decomposition of the azimuthal angle ϕitalic-ϕ\phiitalic_ϕ distribution of measured particles,

dNdϕ=12π[1+n=12vncos(n(ϕψn))].d𝑁ditalic-ϕ12𝜋delimited-[]1superscriptsubscript𝑛12subscript𝑣𝑛𝑛italic-ϕsubscript𝜓𝑛\displaystyle\frac{{\rm d}N}{{\rm d}\phi}=\frac{1}{2\pi}\Bigg{[}1+\sum_{n=1}^{% \infty}2v_{n}\cos{\left(n(\phi-\psi_{n})\right)}\Bigg{]}.divide start_ARG roman_d italic_N end_ARG start_ARG roman_d italic_ϕ end_ARG = divide start_ARG 1 end_ARG start_ARG 2 italic_π end_ARG [ 1 + ∑ start_POSTSUBSCRIPT italic_n = 1 end_POSTSUBSCRIPT start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT 2 italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT roman_cos ( italic_n ( italic_ϕ - italic_ψ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ) ] . (2)

The nthsuperscript𝑛thn^{\rm th}italic_n start_POSTSUPERSCRIPT roman_th end_POSTSUPERSCRIPT order of harmonic flow coefficient vn=cos[n(ϕψn)]subscript𝑣𝑛delimited-⟨⟩𝑛italic-ϕsubscript𝜓𝑛v_{n}=\langle\cos{[n(\phi-\psi_{n})]}\rangleitalic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = ⟨ roman_cos [ italic_n ( italic_ϕ - italic_ψ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ) ] ⟩ characterizes the magnitude of azimuthal anisotropies of the particle spectrum in the transverse directions Voloshin and Zhang (1996), while the phase ψnsubscript𝜓𝑛\psi_{n}italic_ψ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is the nthsuperscript𝑛thn^{\rm th}italic_n start_POSTSUPERSCRIPT roman_th end_POSTSUPERSCRIPT harmonic event plane angle. The second (v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT) and third (v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT) Fourier coefficients represent the amplitudes of elliptic and triangular flow, respectively. The linearized hydrodynamic response shows that anisotropy flow vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is likely correlated to the geometry asymmetry of energy density profile in spatial space of the initial state, namely the initial eccentricity εnsubscript𝜀𝑛\varepsilon_{n}italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT,

εn=rncos(nϕr)2+rnsin(nϕr)2rn,subscript𝜀𝑛superscriptdelimited-⟨⟩superscript𝑟𝑛𝑛subscriptitalic-ϕ𝑟2superscriptdelimited-⟨⟩superscript𝑟𝑛𝑛subscriptitalic-ϕ𝑟2delimited-⟨⟩superscript𝑟𝑛\varepsilon_{n}=\frac{\sqrt{\langle r^{n}\cos(n\phi_{r})\rangle^{2}+\langle r^% {n}\sin(n\phi_{r})\rangle^{2}}}{\langle r^{n}\rangle},italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = divide start_ARG square-root start_ARG ⟨ italic_r start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT roman_cos ( italic_n italic_ϕ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ⟩ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT + ⟨ italic_r start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT roman_sin ( italic_n italic_ϕ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT ) ⟩ start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT end_ARG end_ARG start_ARG ⟨ italic_r start_POSTSUPERSCRIPT italic_n end_POSTSUPERSCRIPT ⟩ end_ARG , (3)

where r𝑟ritalic_r and ϕrsubscriptitalic-ϕ𝑟\phi_{r}italic_ϕ start_POSTSUBSCRIPT italic_r end_POSTSUBSCRIPT are the polar coordinates of participating nucleons Alver and Roland (2010). In hydrodynamics, harmonic flows are responding to eccentricities,

vn=vn(εn,k),subscript𝑣𝑛subscript𝑣𝑛subscript𝜀𝑛𝑘v_{n}=v_{n}(\varepsilon_{n},k),italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT = italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT ( italic_ε start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT , italic_k ) , (4)

where the constant k𝑘kitalic_k is sensitive to the properties of the QGP, such as the transport coefficient η/s𝜂𝑠\eta/sitalic_η / italic_s Yan (2018).

Note that the above definition of vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT according to ψnsubscript𝜓𝑛\psi_{n}italic_ψ start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT is based on the assumptions that the QGP is created and governed by hydrodynamics in relativistic heavy-ion collisions. This has been well demonstrated in large colliding systems, but needs to be clarified in small systems since whether small colliding systems can create the QGP is still debatable. To find differences in the sources and characteristics of collective flow between large and small colliding systems, we will consider the PCN as a new approach to study different EbyE features of collective behavior between small and large colliding systems.

III Training the PCN for classifying two systems

In this section, we introduce the detailed analysis procedures, including the PCN architecture, input, output, training, and evaluation of the PCN.

Input PointCloudN×\times×dinputtransformmatrixinputtransform\begin{matrix}\text{input}\\ \text{transform}\end{matrix}start_ARG start_ROW start_CELL input end_CELL end_ROW start_ROW start_CELL transform end_CELL end_ROW end_ARGN×\times×dMLP(32)sharedN×\times×32featuretransformmatrixfeaturetransform\begin{matrix}\text{feature}\\ \text{transform}\end{matrix}start_ARG start_ROW start_CELL feature end_CELL end_ROW start_ROW start_CELL transform end_CELL end_ROW end_ARGN×\times×32MLP(32,64,512)sharedN×\times×512maxpoolmatrixmaxpool\begin{matrix}\text{max}\\ \text{pool}\end{matrix}start_ARG start_ROW start_CELL max end_CELL end_ROW start_ROW start_CELL pool end_CELL end_ROW end_ARG512MLP(256,128,2)matrixMLP(256,128,2)\begin{matrix}\text{MLP}\\ \text{(256,128,2)}\end{matrix}start_ARG start_ROW start_CELL MLP end_CELL end_ROW start_ROW start_CELL (256,128,2) end_CELL end_ROW end_ARGglobal feature2output scores
Figure 1: The architecture of the devised point cloud network (PCN). In this network, it takes final state N𝑁Nitalic_N points as input, applies input and feature transformations, and then aggregates point features by max-pooling. The output is classification scores for two classes, i.e., Pb +++ Pb or p𝑝pitalic_p +++ Pb collision.

III.1 Network architecture

The architecture of the PCN is shown in Fig.1. It begins with an input alignment network , which initiates the process of aligning the particle clouds in input space, and also enables the model to capture correlations irrespective of orientation in input space. The following is a shared pointwise multilayer perceptron (MLP)111An MLP is a type of feedforward neural network composed of multiple layers of nodes in a directed graph. Here in our model the pointwise MLP refers to an MLP that processes each point from the input independently. implemented by a 1D-convolution neural network (CNN)2221D CNNs are a type of neural network that uses 1D convolution operations to process and transform their input data. to extract 32 feature maps333A feature map, in the context of convolutional neural network in particular, is the output of one convolutional kernel (i.e., filter) applied to the previous layer, which can capture specific patterns or features from the input data, highlighting areas that match the convolutional kernel’s pattern., a feature alignment network, a shared MLP to extract 32, 64 and 512 feature maps, respectively. A global max pooling444A global max pooling operation is retrieving the maximum values of each feature across all particles, summarizing them as a singular global feature representative of the entire particle cloud. then gets the maximum values of each feature among all particles as one global feature of the particle cloud. Finally, a shared MLP implemented by three layers fully connected deep neural network (DNN)555A fully connected DNN is a neural network architecture where each node(i.e.,neuron) in a layer is connected to every node in the subsequent layer, facilitating complex pattern recognition through multiple layers of computation. with 256, 128, and 2 neurons tags each event as p𝑝pitalic_p +++ Pb or Pb +++ Pb collision. Batch normalization666Batch normalization is a technique in deep learning that normalizes the inputs of each layer, to improve training stability and performance by reducing internal covariate shift, for more detail see Ref.Ioffe and Szegedy (2015). layers are present between every convolution layer. The LeakyReLU(α𝛼\alphaitalic_α=0.01) activation function777The LeakyReLU activation function is defined as f(x)=x𝑓𝑥𝑥f(x)=xitalic_f ( italic_x ) = italic_x if x>0𝑥0x>0italic_x > 0 and f(x)=αx𝑓𝑥𝛼𝑥f(x)=\alpha xitalic_f ( italic_x ) = italic_α italic_x otherwise, where α𝛼\alphaitalic_α is a small, positive parameter. is used for all layers except the final layer. The sigmoid activation [σ(x)=1/(1+ex)𝜎𝑥11superscript𝑒𝑥\sigma(x)=1/(1+e^{-x})italic_σ ( italic_x ) = 1 / ( 1 + italic_e start_POSTSUPERSCRIPT - italic_x end_POSTSUPERSCRIPT )] is used on the final layer for binary classification. The models use the Adam optimizerKingma and Ba (2014) with a learning rate of 104superscript10410^{-4}10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT with total decay 104superscript10410^{-4}10 start_POSTSUPERSCRIPT - 4 end_POSTSUPERSCRIPT and categorical cross entropy as the loss function. In addition, a dropoutSrivastava et al. (2014) layers (with drop out probability 0.3) and L2 regularizationNg (2004) are present to tackle the overfitting issue. We use a maximum of 50 epochs with 32 batch size to train the data set. The architecture of the PCN we use is similar to the original architecture as described in Ref.Qi et al. (2016) but less complex. The choice of specific hyperparameters, including the number of nodes and layers, was driven by a combination of empirical experimentation and previous successful implementations in analyzing heavy-ion collision data. Our primary goal is to investigate the identifiability with machine learning perspective of the collective flow characteristics in p𝑝pitalic_p +++ Pb or Pb +++ Pb collisions. Point clouds of particles are processed with transformations order invariant operations to extract global features. While a fully connected deep neural network (DNN) tags each event as p𝑝pitalic_p +++ Pb or Pb +++ Pb collision.

III.2 Input and output of the machine learning

Following the CMS method as described in Ref.Chatrchyan et al. (2013b), the AMPT events for both p𝑝pitalic_p +++ Pb and Pb +++ Pb collisions are grouped into the data sets according to the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT measured in the kinetic window of |η|<𝜂absent|\eta|<| italic_η | < 2.4 and pT>subscript𝑝Tabsentp_{\rm T}>italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT > 0.4 GeV/c. These data are divided into various centrality classes for the colliding systems, which are the Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT bins of 90-120, 120-150, 150-185, 185-230, to compare different systems at similar volumes. Each data set has about 0.3 million events, which are divided into training events, validation events, and test events by the ratio of 5:2:3. The value and error bar of test accuracy are the mean value and the standard deviation of the test data set divided into 100 pieces, respectively. All the information about the final state particles within |η|<𝜂absent|\eta|<| italic_η | < 2.4 in each event is the input to the PCN as a sample, which consists of a list of particles with their information on (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT, pzsubscript𝑝𝑧p_{z}italic_p start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT, E𝐸Eitalic_E) or (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT). Simultaneously, the two true labels, p𝑝pitalic_p +++ Pb and Pb+++Pb, are marked on each event to perform the supervised training.

IV Training results and discussion

In our study, two cases of input data have been investigated. In the case 1, the input for training is an EbyE list of four-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT, pzsubscript𝑝𝑧p_{z}italic_p start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT, E𝐸Eitalic_E) of the selected final state hadrons from the AMPT model. The input data have a dimension of N×4𝑁4N\times 4italic_N × 4, where N𝑁Nitalic_N is the maximum number of selected particles in an event. Events with fewer particles are filled with zeros to maintain the same input dimension. The case 2 is the same as the case 1, but the input is a list of two-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT) of selected final state hadrons.

IV.1 Case 1: training with four-momentum of final hadrons

Refer to caption
Figure 2: (Color online) Test accuracy as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by training with four-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT, pzsubscript𝑝𝑧p_{z}italic_p start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT, E𝐸Eitalic_E) of final particles in pseudorapidity ranges -2.4<η<absent𝜂absent<\eta<< italic_η <2.4 (solid curve), 0<η<absent𝜂absent<\eta<< italic_η <2.4 (dashed curve), and -2.4<η<absent𝜂absent<\eta<< italic_η <0 (long-dashed curve).

Figure 2 shows test accuracy as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by learning four-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT, pzsubscript𝑝𝑧p_{z}italic_p start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT, E𝐸Eitalic_E) of final particles in three different pseudorapidity ranges. All the test accuracies are higher than 99%, which indicates that the two colliding systems are sufficiently distinguished by training the input of four-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT, pzsubscript𝑝𝑧p_{z}italic_p start_POSTSUBSCRIPT italic_z end_POSTSUBSCRIPT, E𝐸Eitalic_E) of final particles.

Refer to caption
Figure 3: (Color online) The pseudorapidity distributions of final hadrons for different Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT classes in Pb+++Pb collisions (solid curve) and p𝑝pitalic_p +++ Pb collisions (dashed curve).

Final particles in three pseudorapidity ranges (-2.4<η<absent𝜂absent<\eta<< italic_η <2.4, -2.4<η<absent𝜂absent<\eta<< italic_η <0 and 0<η<absent𝜂absent<\eta<< italic_η <2.4) were used to train, due to the different pseudorapidity distributions between Pb +++ Pb collisions and p𝑝pitalic_p +++ Pb collisions, which is shown in Fig. 3. The lower hadron yield in the p𝑝pitalic_p -going (forward) direction for p𝑝pitalic_p +++ Pb collisions was expected to induce the sensitivity to the test accuracy of the pseudorapidity range. However, in any given circumstance, the accuracy is high enough with the margin of error, which demonstrates the PCN is capable of identifying two colliding systems on an event-by-event basis, despite almost the same distribution between the two colliding systems in backward pseudorapidity range.

IV.2 Case 2: training with two-momentum of final hadrons

To investigate whether the PCN can learn the difference of collective flow between p+limit-from𝑝p+italic_p +Pb collisions and peripheral Pb +++ Pb collisions, three scenarios of lists are used as training inputs, i.e., two-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT), normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT), and normalized meanwhile randomly rotated two-momentum (pxnorm,randsuperscriptsubscript𝑝xnormrandp_{\rm x}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT, pynorm,randsuperscriptsubscript𝑝ynormrandp_{\rm y}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT), respectively. The normalized two-momentum pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT and pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT are defined as,

pxnormsuperscriptsubscript𝑝xnorm\displaystyle p_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT =\displaystyle== pxpT,subscript𝑝xsubscript𝑝T\displaystyle\frac{p_{\rm x}}{p_{\rm T}},divide start_ARG italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT end_ARG ,
pynormsuperscriptsubscript𝑝ynorm\displaystyle p_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT =\displaystyle== pypT,subscript𝑝ysubscript𝑝T\displaystyle\frac{p_{\rm y}}{p_{\rm T}},divide start_ARG italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT end_ARG , (5)

where pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT is the transverse momentum of each particle. The randomly rotated two-momentum pxrandsuperscriptsubscript𝑝xrandp_{\rm x}^{\rm rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_rand end_POSTSUPERSCRIPT and pyrandsuperscriptsubscript𝑝yrandp_{\rm y}^{\rm rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_rand end_POSTSUPERSCRIPT are,

pxrandsuperscriptsubscript𝑝xrand\displaystyle p_{\rm x}^{\rm rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_rand end_POSTSUPERSCRIPT =\displaystyle== px×cosϕrandpy×sinϕrand,subscript𝑝xsubscriptitalic-ϕrandsubscript𝑝ysubscriptitalic-ϕrand\displaystyle p_{\rm x}\times\cos{\phi_{\rm rand}}-p_{\rm y}\times\sin{\phi_{% \rm rand}},italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT × roman_cos italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT × roman_sin italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT ,
pyrandsuperscriptsubscript𝑝yrand\displaystyle p_{\rm y}^{\rm rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_rand end_POSTSUPERSCRIPT =\displaystyle== px×sinϕrandpy×cosϕrand,subscript𝑝xsubscriptitalic-ϕrandsubscript𝑝ysubscriptitalic-ϕrand\displaystyle p_{\rm x}\times\sin{\phi_{\rm rand}}-p_{\rm y}\times\cos{\phi_{% \rm rand}},italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT × roman_sin italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT × roman_cos italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT , (6)

where ϕrandsubscriptitalic-ϕrand\phi_{\rm rand}italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT is a random angle between 0 and 2π𝜋\piitalic_π. While, the normalized and randomly rotated two-momentum pxnorm,randsuperscriptsubscript𝑝xnormrandp_{\rm x}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT and pynorm,randsuperscriptsubscript𝑝ynormrandp_{\rm y}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT are,

pxnorm,randsuperscriptsubscript𝑝xnormrand\displaystyle p_{\rm x}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT =\displaystyle== px×cosϕrandpy×sinϕrandpT,subscript𝑝xsubscriptitalic-ϕrandsubscript𝑝ysubscriptitalic-ϕrandsubscript𝑝T\displaystyle\frac{p_{\rm x}\times\cos{\phi_{\rm rand}}-p_{\rm y}\times\sin{% \phi_{\rm rand}}}{p_{\rm T}},divide start_ARG italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT × roman_cos italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT × roman_sin italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT end_ARG ,
pynorm,randsuperscriptsubscript𝑝ynormrand\displaystyle p_{\rm y}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT =\displaystyle== px×sinϕrandpy×cosϕrandpT.subscript𝑝xsubscriptitalic-ϕrandsubscript𝑝ysubscriptitalic-ϕrandsubscript𝑝T\displaystyle\frac{p_{\rm x}\times\sin{\phi_{\rm rand}}-p_{\rm y}\times\cos{% \phi_{\rm rand}}}{p_{\rm T}}.divide start_ARG italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT × roman_sin italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT - italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT × roman_cos italic_ϕ start_POSTSUBSCRIPT roman_rand end_POSTSUBSCRIPT end_ARG start_ARG italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT end_ARG . (7)
Refer to caption
Figure 4: (Color online) Test accuracy as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by learning with input to be two-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT) (a) and the normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT) (b), without (solid curve) or with (dashed curve) random rotations.

Figure 4(a) shows test accuracy as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT with input to be two-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT) and its random rotation (pxrandsuperscriptsubscript𝑝xrandp_{\rm x}^{\rm rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_rand end_POSTSUPERSCRIPT and pyrandsuperscriptsubscript𝑝yrandp_{\rm y}^{\rm rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_rand end_POSTSUPERSCRIPT) of final particles. Test accuracy of two-momentum input is over 60%percent6060\%60 %, and it will drop less than 2%percent22\%2 %, if two-momentum are randomly rotated. It indicates that besides anisotropic flow there is other discrepancy between the two colliding systems, like in pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT distributions. To eliminate the effect from different pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT distribution of hadron, we train the PCN with normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT) and its random rotation (pxnorm,randsuperscriptsubscript𝑝xnormrandp_{\rm x}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT and pynorm,randsuperscriptsubscript𝑝ynormrandp_{\rm y}^{\rm norm,rand}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm , roman_rand end_POSTSUPERSCRIPT) of final particles. The test accuracy as a function of the number of final charged particles is shown in Fig.4(b). Compared to the normal two-momentum training, there is a more than 10% decrease in test accuracy after normalization. This indicates that pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT distribution of hadrons is the main feature that distinguishes the two colliding systems, if we train the PCN by two-momentum. Furthermore, the PCN can not distinguish the two colliding systems by learning normalized two-momentum with random rotations of final hadrons, because these operations eliminate the information about the magnitude of pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT and anisotropic flow. Compared to the training by the normalized two-momentum with random rotations, the test accuracy is improved by a few percentages by learning two-momentum with the normalization only. This discrepancy comes from the difference in anisotropic flow between two colliding systems. However, because the anisotropic flows between two colliding systems are very similar, the PCN can not distinguish the two different colliding systems very well, even though it is better than the case with random rotations.

Refer to caption
Figure 5: (Color online) Test accuracy as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by learning two-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT) of final hadrons in the AMPT model with different parton cross sections.

Figure 5 shows the test accuracy as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by learning two-momentum (pxsubscript𝑝xp_{\rm x}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT, pysubscript𝑝yp_{\rm y}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT) of final particles with different parton cross sections. It can be seen that all test accuracies increase with the increase of Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT and parton cross section. In addition, the PCN can not distinguish the two colliding systems with 0 mb parton cross section at low Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT. These results can basically be explained by Fig. 6, since the average transverse momentum pTdelimited-⟨⟩subscript𝑝T\left\langle p_{\rm T}\right\rangle⟨ italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT ⟩ can quantify the feature of pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra shape.

Refer to caption
Figure 6: (Color online) Average transverse momenta pTdelimited-⟨⟩subscript𝑝T\left\langle p_{\rm T}\right\rangle⟨ italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT ⟩ in Pb +++ Pb collisions (solid curve) and p +++ Pb collisions (dashed curve) as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT with parton cross section of 0 mb (a), 3 mb (b), and 10 mb (c).

Figure 6 shows average transverse momenta pTdelimited-⟨⟩subscript𝑝T\left\langle p_{\rm T}\right\rangle⟨ italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT ⟩ as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT with different parton cross sections. We can see that the pTdelimited-⟨⟩subscript𝑝T\left\langle p_{\rm T}\right\rangle⟨ italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT ⟩ in p𝑝pitalic_p +++ Pb collisions is larger than that in Pb +++ Pb collisions, and the difference is more significant with a larger parton cross section and higher multiplicity. Although we observe that the difference of pTdelimited-⟨⟩subscript𝑝T\left\langle p_{\rm T}\right\rangle⟨ italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT ⟩ is almost zero for 0 mb parton cross section, some test accuracies are larger than 0.5 for 0 mb in Fig. 5. It indicates that there is a difference in non-flow effect between small and large colliding systems. It could be attributed to jets, because the impact of the jet transverse momentum broadening and multiple scatterings Leonidov et al. (1997) has been found to be stronger in p𝑝pitalic_p +++ Pb collisions than in peripheral Pb +++ Pb collisions Abelev et al. (2009).

Refer to caption
Figure 7: (Color online) Test accuracy as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by learning normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT) of final hadrons in the AMPT model with different parton cross sections.

Figure 7 shows test accuracy as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT by learning normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT) of final particles with different parton cross sections. It can be seen that the test accuracy with 0 mb is close to the corresponding result in Fig. 5, and both of them increase with Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT, which indicates that the discrepancy of non-flow effect between the two systems is larger for higher multiplicity. However, the discrepancy between 0 mb and nonzero parton cross sections must come from anisotropic flow. The discrepancy increases with parton cross section, indicating a more significant discrepancy of anisotropic flow between two colliding systems due to more parton scatterings.

Refer to caption
Figure 8: (Color online) Averaged output y^delimited-⟨⟩^𝑦\left\langle\widehat{y}\right\rangle⟨ over^ start_ARG italic_y end_ARG ⟩ of the ensembles of two systems identified by the model trained by normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT) of final hadrons in the AMPT model with different parton cross sections.

If an event is identified as p𝑝pitalic_p +++ Pb or Pb +++ Pb, it is marked by the output of y^^𝑦\widehat{y}over^ start_ARG italic_y end_ARG=0 or 1, respectively, in our analysis. Thus, the averaged output y^delimited-⟨⟩^𝑦\left\langle\widehat{y}\right\rangle⟨ over^ start_ARG italic_y end_ARG ⟩ of our model can represent the probability of Pb +++ Pb of an event. In other words, The closer the y^delimited-⟨⟩^𝑦\left\langle\widehat{y}\right\rangle⟨ over^ start_ARG italic_y end_ARG ⟩ is to 0, the more likely the events are p𝑝pitalic_p +++ Pb collisions, and vice versa for Pb +++ Pb collisions. Figure 8 shows the averaged output y^delimited-⟨⟩^𝑦\left\langle\widehat{y}\right\rangle⟨ over^ start_ARG italic_y end_ARG ⟩ of the ensembles of each system identified by the model trained by normalized two-momentum (pxnormsuperscriptsubscript𝑝xnormp_{\rm x}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_x end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT, pynormsuperscriptsubscript𝑝ynormp_{\rm y}^{\rm norm}italic_p start_POSTSUBSCRIPT roman_y end_POSTSUBSCRIPT start_POSTSUPERSCRIPT roman_norm end_POSTSUPERSCRIPT) of final hadrons in the AMPT model with different parton cross sections. It can be seen that y^delimited-⟨⟩^𝑦\left\langle\widehat{y}\right\rangle⟨ over^ start_ARG italic_y end_ARG ⟩ of the ensembles of Pb +++ Pb events is larger than that for p𝑝pitalic_p +++ Pb events in every situation, which indicates that the PCN is able to distinguish two different ensembles of systems, according to the difference of averaged anisotropic flow between two systems.

To investigate the relationship between anisotropic flow and parton cross section, we further calculate the harmonic flow coefficients, e.g., elliptic flow v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, by fitting the long-range part of the two-particle azimuthal correlation function C(Δϕ)𝐶Δitalic-ϕC(\Delta\phi)italic_C ( roman_Δ italic_ϕ ), which defined as

C(Δϕ)=Ysame(Δϕ)Ymixed(Δϕ)×Ymixed(Δϕ)𝑑ΔϕYsame(Δϕ)𝑑Δϕ,𝐶Δitalic-ϕsubscript𝑌𝑠𝑎𝑚𝑒Δitalic-ϕsubscript𝑌𝑚𝑖𝑥𝑒𝑑Δitalic-ϕsubscript𝑌𝑚𝑖𝑥𝑒𝑑Δitalic-ϕdifferential-dΔitalic-ϕsubscript𝑌𝑠𝑎𝑚𝑒Δitalic-ϕdifferential-dΔitalic-ϕC(\Delta\phi)=\frac{Y_{same}(\Delta\phi)}{Y_{mixed}(\Delta\phi)}\times\frac{% \int Y_{mixed}(\Delta\phi)d\Delta\phi}{\int Y_{same}(\Delta\phi)d\Delta\phi},italic_C ( roman_Δ italic_ϕ ) = divide start_ARG italic_Y start_POSTSUBSCRIPT italic_s italic_a italic_m italic_e end_POSTSUBSCRIPT ( roman_Δ italic_ϕ ) end_ARG start_ARG italic_Y start_POSTSUBSCRIPT italic_m italic_i italic_x italic_e italic_d end_POSTSUBSCRIPT ( roman_Δ italic_ϕ ) end_ARG × divide start_ARG ∫ italic_Y start_POSTSUBSCRIPT italic_m italic_i italic_x italic_e italic_d end_POSTSUBSCRIPT ( roman_Δ italic_ϕ ) italic_d roman_Δ italic_ϕ end_ARG start_ARG ∫ italic_Y start_POSTSUBSCRIPT italic_s italic_a italic_m italic_e end_POSTSUBSCRIPT ( roman_Δ italic_ϕ ) italic_d roman_Δ italic_ϕ end_ARG , (8)

where Ysame(Δϕ=ϕ2ϕ1)subscript𝑌𝑠𝑎𝑚𝑒Δitalic-ϕsubscriptitalic-ϕ2subscriptitalic-ϕ1Y_{same}(\Delta\phi=\phi_{2}-\phi_{1})italic_Y start_POSTSUBSCRIPT italic_s italic_a italic_m italic_e end_POSTSUBSCRIPT ( roman_Δ italic_ϕ = italic_ϕ start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT - italic_ϕ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT ) and Ymixed(Δϕ)subscript𝑌𝑚𝑖𝑥𝑒𝑑Δitalic-ϕY_{mixed}(\Delta\phi)italic_Y start_POSTSUBSCRIPT italic_m italic_i italic_x italic_e italic_d end_POSTSUBSCRIPT ( roman_Δ italic_ϕ ) are the number of particle pairs at a given ΔϕΔitalic-ϕ\Delta\phiroman_Δ italic_ϕ and within a given pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT range for the same and mixed events. This definition of C(Δϕ)𝐶Δitalic-ϕC(\Delta\phi)italic_C ( roman_Δ italic_ϕ ) removes a trivial dependence on the number of produced particles Adler et al. (2006); Adare et al. (2015); Bzdak and Ma (2014).

Refer to caption
Figure 9: (Color online) Integrated elliptic v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and triangular v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT flow coefficient in Pb +++ Pb collisions (solid curve) and p +++ Pb collisions (dashed curve) as a function of the number of charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT with parton cross section of 0 mb (a), 3 mb (b), and 10 mb (c).

Figure 9 shows v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT with different parton cross sections. It can be observed that v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT increases with parton cross section for Pb +++ Pb collisions. But the dependence on parton cross section is non-monotonous for p𝑝pitalic_p +++ Pb collisions, which has already been found in Ref. Zhao et al. (2023). The most significant difference of v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT between the two systems appears when parton cross section is taken as 10 mb for high-multiplicity events, because there are the largest v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in Pb +++ Pb collisions and relatively small v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT in p𝑝pitalic_p +++ Pb collisions. On the other hand, v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT of the two systems are similar for the two parton cross sections of 0 mb and 10 mb. However, there is a more obvious difference of v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT between the two systems with parton cross section of 3 mb, because there are the largest v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT in p𝑝pitalic_p +++ Pb collisions and relatively small v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT in Pb +++ Pb collisions. Surprisingly, the discrepancy of v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT between two ensembles of each system can be captured by the PCN, even the similar Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT and parton cross section dependences are obtained in Figs. 7 and  8, although the test accuracies are not high enough for the PCN to distinguish two systems due to similar EbyE flow distributions which will be shown next.

Refer to caption
Figure 10: (Color online) The EbyE 2-D distributions of v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT vs v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT [P(v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT)] of Pb +++ Pb (red) and p𝑝pitalic_p +++ Pb (blue) collisions in the AMPT model with the parton cross section of 3 mb (first row) and 10 mb (second row) for the Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT bin of 90<Nch<absentsubscript𝑁𝑐absent<N_{ch}<< italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT <120 (first column), 120<Nch<absentsubscript𝑁𝑐absent<N_{ch}<< italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT <150 (second column), 150<Nch<absentsubscript𝑁𝑐absent<N_{ch}<< italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT <185 (third column), and 185<Nch<absentsubscript𝑁𝑐absent<N_{ch}<< italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT <230 (fourth column).
Refer to caption
Figure 11: (Color online) The percentage of the 2-D overlapping volume between the distribution P(v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT,v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT) for Pb +++ Pb collisions and that for p𝑝pitalic_p +++ Pb collisions as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT in the AMPT model with the parton cross section of 0 mb (solid curve), 3 mb (dashed curve), and 10 mb (dotted curve).

Figure 10 shows the EbyE 2-D distributions of v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT vs v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT [P(v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT)] in Pb +++ Pb collisions and p𝑝pitalic_p +++ Pb collisions with parton cross sections of 3 and 10 mb for different Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT classes 888In principle, the EbyE distribution of vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT, P(vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT), should be obtained by an unfolding method to suppress the nonflow contribution. To our knowledge, the response matrix in the unfolding method cannot be reliably obtained for small colliding systems. Therefore, for consistency, we did not use the unfolding method for both large and small colliding systems.. Based on P(v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT), the percentage of the overlapping volume of P(v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT) between Pb +++ Pb collisions and p𝑝pitalic_p +++ Pb collisions as a function of the number of final charged particles Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT can be calculated, which is shown in Fig. 11. Note that the result for 0 mb parton cross section is not shown, since vnsubscript𝑣𝑛v_{n}italic_v start_POSTSUBSCRIPT italic_n end_POSTSUBSCRIPT come from non-flow for this case. We can see that the overlapping percentage decreases with parton cross section, which indicates that v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT between two systems are more different with a larger parton cross section. This is also consistent with the result of test accuracy in Fig. 7. It suggests that a more pronounced difference in collective flow between the two colliding systems is produced by more parton collisions. On the other hand, the large overlapping volume percentage (over 80%percent8080\%80 %) indicates that the P(v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT, v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT) distributions between the two systems are so similar that they are difficult to distinguish. This is also in line with the observation that the PCN can not distinguish the two colliding systems very well in EbyE manner, when we train the PCN with input to be the normalized two-momentum of final particles, as shown in Fig. 7.

V Summary and outlook

In summary, we employ the point cloud network to identify the events of p𝑝pitalic_p +++ Pb and peripheral Pb +++ Pb collisions from a multiphase transport model. We reduce the input information for the PCN and observe the resulting changes in accuracy to verify the specific physical features learned by the PCN. Many different features between the two systems are learned and captured by the PCN, such as pseudorapidity distribution, pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra, and anisotropic flow. In four-dimensional momentum space, the point cloud network can well identify the two different colliding systems. In the transverse momentum plane, the point cloud network can learn the different features of pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra that can classify two different colliding systems. After normalizing the transverse momentum of final hadrons, the point cloud network finally distinguishes two different colliding systems according to the feature of collective flow. In this big data and ML approach, by changing the different input types, we confirm that the discrepancy between the two systems is more reflected in the pseudorapidity distribution and the pTsubscript𝑝Tp_{\rm T}italic_p start_POSTSUBSCRIPT roman_T end_POSTSUBSCRIPT spectra than in the anisotropic flow.

Despite these successes, the PCN faces challenges in distinguishing the two systems solely based on event-by-event (EbyE) collective flow, as the EbyE distributions of collective flow parameters P(v2,v3)𝑃subscript𝑣2subscript𝑣3P(v_{2},v_{3})italic_P ( italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT , italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT ) are quite similar between p𝑝pitalic_p +++ Pb and Pb +++ Pb collisions. However, the PCN could differentiate between ensembles of each system through features related to v2subscript𝑣2v_{2}italic_v start_POSTSUBSCRIPT 2 end_POSTSUBSCRIPT and v3subscript𝑣3v_{3}italic_v start_POSTSUBSCRIPT 3 end_POSTSUBSCRIPT, and it also revealed the dependence of these discrepancies on Nchsubscript𝑁𝑐N_{ch}italic_N start_POSTSUBSCRIPT italic_c italic_h end_POSTSUBSCRIPT and parton cross section. Notably, our findings indicate that the differences in collective flow between p𝑝pitalic_p +++ Pb and Pb +++ Pb collisions become more pronounced with larger parton scattering cross sections, consistent with the escape mechanism characteristics for collective flow in the transport model He et al. (2016); Lin et al. (2016).

While our PCN has shown some ability to distinguish between p𝑝pitalic_p +++ Pb and peripheral Pb +++ Pb collisions using event-by-event (EbyE) collective flow analysis, its current effectiveness is somewhat limited. Nevertheless, the application of machine learning has yielded some promising results. This study should be considered a preliminary analysis that lays the foundation for future work involving diverse models and methodologies. However, we emphasize that further research is essential to achieving our ultimate goal, i.e., utilizing machine learning to assist experiments in determining whether small systems also produce the QGP and investigating whether these systems exhibit different characteristics of collective flow compared to large systems. On the other hand, we hope that our PCN study can be applied to classify the origin of atmospheric particle showers, which could improve our understanding of the origins of cosmic rays. Additionally, we also have some prospects for further development of PCN, which holds potential as a valuable tool for various other research interests. For instance, it could be used for searching for the chiral magnetic effect Fukushima et al. (2008); Zhao et al. (2022) and exploring the nuclear deformation structure Jia et al. (2023); Zhao and Ma (2022) in high-energy heavy-ion collisions.

Acknowledgements.
We thank Dr. Ling-Xiao Wang for helpful discussions. This work is supported in part by the National Natural Science Foundation of China under Grants No. 12325507, No.12147101, No. 11890714, No. 11835002, No. 11961131011, No. 11421505, No. 12105054, the National Key Research and Development Program of China under Grant No. 2022YFA1604900, the Strategic Priority Research Program of Chinese Academy of Sciences under Grant No. XDB34030000, and the Guangdong Major Project of Basic and Applied Basic Research under Grant No. 2020B0301030008 (S. G., H.-S.W.and G.-L.M.), the CUHK-Shenzhen university development fund under grant No. UDF01003041 and the BMBF funded KISS consortium (05D23RI1) in the ErUM-Data action plan (K. Z.).

References

  • Qi et al. (2016) C. R. Qi, H. Su, K. Mo, and L. J. Guibas (2016), eprint 1612.00593.
  • Kashefi et al. (2021) A. Kashefi, D. Rempe, and L. J. Guibas, Physics of Fluids 33 (2021).
  • Kashefi and Mukerji (2022) A. Kashefi and T. Mukerji, Journal of Computational Physics 468, 111510 (2022).
  • Baek et al. (2022) H. Baek, W. J. Yun, and J. Kim (2022), eprint 2210.09728.
  • Chen et al. (2021) L. Chen, X. Yao, P. Xu, S. K. Moon, and G. Bi, Virtual and Physical Prototyping 16, 50 (2021).
  • Pang et al. (2018) L.-G. Pang, K. Zhou, N. Su, H. Petersen, H. Stöcker, and X.-N. Wang, Nature Commun. 9, 210 (2018), eprint 1612.04262.
  • Zhou et al. (2019) K. Zhou, G. Endrődi, L.-G. Pang, and H. Stöcker, Phys. Rev. D 100, 011501 (2019), eprint 1810.12879.
  • Graczykowski et al. (2022) L. K. Graczykowski, M. Jakubowska, K. R. Deja, and M. Kabus (ALICE), JINST 17, C07016 (2022), eprint 2204.06900.
  • ZHOU et al. (2022) M. ZHOU, Y. LUO, and H. SONG, Sci. China Phys. Mech. Astron. 52, 252002 (2022).
  • WANG et al. (2022) L. WANG, L. PANG, and K. ZHOU, Sci. China Phys. Mech. Astron. 52, 252003 (2022).
  • HE et al. (2022) W. HE, J. HE, R. Wang, and Y. MA, Sci. China Phys. Mech. Astron. 52, 252004 (2022).
  • DU et al. (2022) Y.-L. DU, D. PABLOS, and K. Tywoniuk, Sci. China Phys. Mech. Astron. 52, 252017 (2022).
  • LI et al. (2023) F. LI, L. PANG, and X. WANG, Nucl. Sci. Tech. 46, 040014 (2023).
  • He et al. (2023) W. He, Q. Li, Y. Ma, Z. Niu, J. Pei, and Y. Zhang, Sci. China Phys. Mech. Astron. 66, 282001 (2023), eprint 2301.06396.
  • Ma et al. (2023) Y.-G. Ma, L.-G. Pang, R. Wang, and K. Zhou, Chin. Phys. Lett. 40, 122101 (2023), eprint 2311.07274.
  • Zhou et al. (2024) K. Zhou, L. Wang, L.-G. Pang, and S. Shi, Prog. Part. Nucl. Phys. 135, 104084 (2024), eprint 2303.15136.
  • Boehnlein et al. (2022) A. Boehnlein et al., Rev. Mod. Phys. 94, 031003 (2022), eprint 2112.02309.
  • Steinheimer et al. (2019) J. Steinheimer, L. Pang, K. Zhou, V. Koch, J. Randrup, and H. Stoecker, JHEP 12, 122 (2019), eprint 1906.06562.
  • Omana Kuttan et al. (2020a) M. Omana Kuttan, J. Steinheimer, K. Zhou, A. Redelbach, and H. Stoecker, Phys. Lett. B 811, 135872 (2020a), eprint 2009.01584.
  • Omana Kuttan et al. (2021) M. Omana Kuttan, J. Steinheimer, K. Zhou, A. Redelbach, and H. Stoecker, Particles 4, 47 (2021).
  • Omana Kuttan et al. (2020b) M. Omana Kuttan, K. Zhou, J. Steinheimer, A. Redelbach, and H. Stoecker, JHEP 21, 184 (2020b), eprint 2107.05590.
  • Huang et al. (2022) Y. Huang, L.-G. Pang, X. Luo, and X.-N. Wang, Phys. Lett. B 827, 137001 (2022), eprint 2107.11828.
  • Adams et al. (2005) J. Adams et al. (STAR), Nucl. Phys. A 757, 102 (2005), eprint nucl-ex/0501009.
  • Adcox et al. (2005) K. Adcox et al. (PHENIX), Nucl. Phys. A 757, 184 (2005), eprint nucl-ex/0410003.
  • Aamodt et al. (2008) K. Aamodt et al. (ALICE), JINST 3, S08002 (2008).
  • Bzdak et al. (2020) A. Bzdak, S. Esumi, V. Koch, J. Liao, M. Stephanov, and N. Xu, Phys. Rept. 853, 1 (2020), eprint 1906.00936.
  • Luo and Xu (2017) X. Luo and N. Xu, Nucl. Sci. Tech. 28, 112 (2017), eprint 1701.02105.
  • Heinz and Snellings (2013) U. Heinz and R. Snellings, Ann. Rev. Nucl. Part. Sci. 63, 123 (2013), eprint 1301.2826.
  • Song and Heinz (2008) H. Song and U. W. Heinz, Phys. Rev. C 77, 064901 (2008), eprint 0712.3715.
  • Jeon and Heinz (2015) S. Jeon and U. Heinz, Int. J. Mod. Phys. E 24, 1530010 (2015), eprint 1503.03931.
  • Shen and Yan (2020) C. Shen and L. Yan, Nucl. Sci. Tech. 31, 122 (2020), eprint 2010.12377.
  • Gale et al. (2013) C. Gale, S. Jeon, and B. Schenke, Int. J. Mod. Phys. A 28, 1340011 (2013), eprint 1301.5893.
  • Yan (2018) L. Yan, Chin. Phys. C 42, 042001 (2018), eprint 1712.04580.
  • Alver and Roland (2010) B. Alver and G. Roland, Phys. Rev. C 81, 054905 (2010), [Erratum: Phys.Rev.C 82, 039903 (2010)], eprint 1003.0194.
  • Ma and Wang (2011) G.-L. Ma and X.-N. Wang, Phys. Rev. Lett. 106, 162301 (2011), eprint 1011.5249.
  • Yan et al. (2014) L. Yan, J.-Y. Ollitrault, and A. M. Poskanzer, Phys. Rev. C 90, 024903 (2014), eprint 1405.6595.
  • Qin et al. (2010) G.-Y. Qin, H. Petersen, S. A. Bass, and B. Muller, Phys. Rev. C 82, 064903 (2010), eprint 1009.1847.
  • Ollitrault (1992) J.-Y. Ollitrault, Phys. Rev. D 46, 229 (1992).
  • Stoecker (2005) H. Stoecker, Nucl. Phys. A 750, 121 (2005), eprint nucl-th/0406018.
  • Khachatryan et al. (2010) V. Khachatryan et al. (CMS), JHEP 09, 091 (2010), eprint 1009.4122.
  • Chatrchyan et al. (2013a) S. Chatrchyan et al. (CMS), Phys. Lett. B 718, 795 (2013a), eprint 1210.5482.
  • Abelev et al. (2013a) B. Abelev et al. (ALICE), Phys. Lett. B 719, 29 (2013a), eprint 1212.2001.
  • Aad et al. (2013a) G. Aad et al. (ATLAS), Phys. Rev. Lett. 110, 182302 (2013a), eprint 1212.5198.
  • Sirunyan et al. (2018) A. M. Sirunyan et al. (CMS), Phys. Rev. C 98, 044902 (2018), eprint 1710.07864.
  • Acharya et al. (2019) S. Acharya et al. (ALICE), Phys. Rev. Lett. 123, 142301 (2019), eprint 1903.01790.
  • Adare et al. (2015) A. Adare et al. (PHENIX), Phys. Rev. Lett. 114, 192301 (2015), eprint 1404.7461.
  • Adamczyk et al. (2015) L. Adamczyk et al. (STAR), Phys. Lett. B 747, 265 (2015), eprint 1502.07652.
  • Aidala et al. (2019) C. Aidala et al. (PHENIX), Nature Phys. 15, 214 (2019), eprint 1805.02973.
  • Acharya et al. (2022) U. A. Acharya et al. (PHENIX), Phys. Rev. C 105, 024901 (2022), eprint 2107.06634.
  • Abdulameer et al. (2023) N. J. Abdulameer et al. (PHENIX), Phys. Rev. C 107, 024907 (2023), eprint 2203.09894.
  • Dusling et al. (2016) K. Dusling, W. Li, and B. Schenke, Int. J. Mod. Phys. E 25, 1630002 (2016), eprint 1509.07939.
  • Loizides (2016) C. Loizides, Nucl. Phys. A 956, 200 (2016), eprint 1602.09138.
  • Nagle and Zajc (2018) J. L. Nagle and W. A. Zajc, Ann. Rev. Nucl. Part. Sci. 68, 211 (2018), eprint 1801.03477.
  • Bozek (2012) P. Bozek, Phys. Rev. C 85, 014911 (2012), eprint 1112.0915.
  • Bzdak et al. (2013) A. Bzdak, B. Schenke, P. Tribedy, and R. Venugopalan, Phys. Rev. C 87, 064906 (2013), eprint 1304.3403.
  • Shuryak and Zahed (2013) E. Shuryak and I. Zahed, Phys. Rev. C 88, 044915 (2013), eprint 1301.4470.
  • Qin and Müller (2014) G.-Y. Qin and B. Müller, Phys. Rev. C 89, 044902 (2014), eprint 1306.3439.
  • Bozek and Broniowski (2013) P. Bozek and W. Broniowski, Phys. Rev. C 88, 014903 (2013), eprint 1304.3044.
  • Bozek et al. (2015) P. Bozek, A. Bzdak, and G.-L. Ma, Phys. Lett. B 748, 301 (2015), eprint 1503.03655.
  • Song et al. (2017) H. Song, Y. Zhou, and K. Gajdosova, Nucl. Sci. Tech. 28, 99 (2017), eprint 1703.00670.
  • Lin et al. (2005) Z.-W. Lin, C. M. Ko, B.-A. Li, B. Zhang, and S. Pal, Phys. Rev. C 72, 064901 (2005), eprint nucl-th/0411110.
  • Chen et al. (2006) J. H. Chen, Y. G. Ma, G. L. Ma, X. Z. Cai, Z. J. He, H. Z. Huang, J. L. Long, W. Q. Shen, C. Zhong, and J. X. Zuo, Phys. Rev. C 74, 064902 (2006).
  • Lin (2014) Z.-W. Lin, Phys. Rev. C 90, 014904 (2014), eprint 1403.6321.
  • Ma and Lin (2016) G.-L. Ma and Z.-W. Lin, Phys. Rev. C 93, 054911 (2016), eprint 1601.08160.
  • Bzdak and Ma (2014) A. Bzdak and G.-L. Ma, Phys. Rev. Lett. 113, 252301 (2014), eprint 1406.2804.
  • Orjuela Koop et al. (2015) J. D. Orjuela Koop, A. Adare, D. McGlinchey, and J. L. Nagle, Phys. Rev. C 92, 054903 (2015), eprint 1501.06880.
  • Ma and Bzdak (2016) G.-L. Ma and A. Bzdak, Nucl. Phys. A 956, 745 (2016).
  • He et al. (2016) L. He, T. Edmonds, Z.-W. Lin, F. Liu, D. Molnar, and F. Wang, Phys. Lett. B 753, 506 (2016), eprint 1502.05572.
  • Lin et al. (2016) Z.-W. Lin, L. He, T. Edmonds, F. Liu, D. Molnar, and F. Wang, Nucl. Phys. A 956, 316 (2016), eprint 1512.06465.
  • Ma et al. (2021) L. Ma, G.-L. Ma, and Y.-G. Ma, Phys. Rev. C 103, 014908 (2021), eprint 2102.01872.
  • Wang and Ma (2022) H.-S. Wang and G.-L. Ma, Phys. Rev. C 106, 064907 (2022), eprint 2208.06854.
  • Aad et al. (2013b) G. Aad et al. (ATLAS), JHEP 11, 183 (2013b), eprint 1305.2942.
  • Adams et al. (2006) J. Adams et al. (STAR), Phys. Rev. D 74, 032006 (2006), eprint nucl-ex/0606028.
  • Abelev et al. (2014) B. B. Abelev et al. (ALICE), Phys. Lett. B 728, 25 (2014), eprint 1307.6796.
  • Adam et al. (2015) J. Adam et al. (ALICE), Phys. Rev. C 91, 064905 (2015), eprint 1412.6828.
  • Acharya et al. (2020) S. Acharya et al. (ALICE), Eur. Phys. J. C 80, 693 (2020), eprint 2003.02394.
  • Abelev et al. (2009) B. I. Abelev et al. (STAR), Phys. Rev. C 79, 034909 (2009), eprint 0808.2041.
  • Abelev et al. (2013b) B. B. Abelev et al. (ALICE), Phys. Lett. B 727, 371 (2013b), eprint 1307.1094.
  • Adcox et al. (2004) K. Adcox et al. (PHENIX), Phys. Rev. C 69, 024904 (2004), eprint nucl-ex/0307010.
  • Erdmann et al. (2018) M. Erdmann, J. Glombitza, and D. Walz, Astropart. Phys. 97, 46 (2018), eprint 1708.00647.
  • Erdmann et al. (2019) M. Erdmann, F. Schlüter, and R. Smida, JINST 14, P04005 (2019), eprint 1901.04079.
  • Lingenfelter (2019) R. E. Lingenfelter, Astrophys. J. Suppl. 245, 30 (2019), eprint 1903.06330.
  • Tokuno et al. (2012) H. Tokuno et al., Nucl. Instrum. Meth. A 676, 54 (2012), eprint 1201.0002.
  • Abraham et al. (2004) J. Abraham et al. (Pierre Auger), Nucl. Instrum. Meth. A 523, 50 (2004).
  • Gyulassy and Wang (1994) M. Gyulassy and X.-N. Wang, Comput. Phys. Commun. 83, 307 (1994), eprint nucl-th/9502021.
  • Lin and Ko (2002) Z.-w. Lin and C. M. Ko, Phys. Rev. C 65, 034904 (2002), eprint nucl-th/0108039.
  • Zhang (1998) B. Zhang, Comput. Phys. Commun. 109, 193 (1998), eprint nucl-th/9709009.
  • He and Lin (2017) Y. He and Z.-W. Lin, Phys. Rev. C 96, 014910 (2017), eprint 1703.02673.
  • Lin and Zheng (2021) Z.-W. Lin and L. Zheng, Nucl. Sci. Tech. 32, 113 (2021), eprint 2110.02989.
  • Li and Ko (1995) B.-A. Li and C. M. Ko, Phys. Rev. C 52, 2037 (1995), eprint nucl-th/9505016.
  • Voloshin and Zhang (1996) S. Voloshin and Y. Zhang, Z. Phys. C 70, 665 (1996), eprint hep-ph/9407282.
  • Ioffe and Szegedy (2015) S. Ioffe and C. Szegedy (2015), eprint 1502.03167.
  • Kingma and Ba (2014) D. P. Kingma and J. Ba (2014), eprint 1412.6980.
  • Srivastava et al. (2014) N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov, Journal of Machine Learning Research 15, 1929 (2014).
  • Ng (2004) A. Y. Ng, in Proceedings of the Twenty-First
    International Conference on Machine Learning
    (Association for Computing Machinery, New York, NY, USA, 2004), ICML ’04, p. 78, ISBN 1581138385.
  • Chatrchyan et al. (2013b) S. Chatrchyan et al. (CMS), Phys. Lett. B 724, 213 (2013b), eprint 1305.0609.
  • Leonidov et al. (1997) A. Leonidov, M. Nardi, and H. Satz, Z. Phys. C 74, 535 (1997).
  • Adler et al. (2006) S. S. Adler et al. (PHENIX), Phys. Rev. Lett. 97, 052301 (2006), eprint nucl-ex/0507004.
  • Zhao et al. (2023) X.-L. Zhao, Z.-W. Lin, L. Zheng, and G.-L. Ma, Phys. Lett. B 839, 137799 (2023), eprint 2112.01232.
  • Fukushima et al. (2008) K. Fukushima, D. E. Kharzeev, and H. J. Warringa, Phys. Rev. D 78, 074033 (2008), eprint 0808.3382.
  • Zhao et al. (2022) Y.-S. Zhao, L. Wang, K. Zhou, and X.-G. Huang, Phys. Rev. C 106, L051901 (2022), eprint 2105.13761.
  • Jia et al. (2023) J. Jia, G. Giacalone, and C. Zhang, Phys. Rev. Lett. 131, 022301 (2023), eprint 2206.10449.
  • Zhao and Ma (2022) X.-L. Zhao and G.-L. Ma, Phys. Rev. C 106, 034909 (2022), eprint 2203.15214.