Abstract
New sets of parameter tunes for two of the colour reconnection models, quantum chromodynamics-inspired and gluon-move, implemented in the pythia 8 event generator, are obtained based on the default CMS pythia 8 underlying-event tune, CP5. Measurements sensitive to the underlying event performed by the CMS experiment at centre-of-mass energies \(\sqrt{s}=7\) and 13\(\,\text {Te\hspace{-.08em}V}\), and by the CDF experiment at 1.96\(\,\text {Te\hspace{-.08em}V}\) are used to constrain the parameters of colour reconnection models and multiple-parton interactions simultaneously. The new colour reconnection tunes are compared with various measurements at 1.96, 7, 8, and 13\(\,\text {Te\hspace{-.08em}V}\) including measurements of the underlying-event, strange-particle multiplicities, jet substructure observables, jet shapes, and colour flow in top quark pair (\({{\text {t}} {}{\bar{\text {t}}}}\)) events. The new tunes are also used to estimate the uncertainty related to colour reconnection modelling in the top quark mass measurement using the decay products of \({{\text {t}} {}{\bar{\text {t}}}}\) events in the semileptonic channel at 13\(\,\text {Te\hspace{-.08em}V}\).
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Avoid common mistakes on your manuscript.
1 Introduction
Monte Carlo (MC) event generators, such as pythia 8 [1], are indispensable tools for measurements at the LHC proton–proton (pp) collider. To provide an accurate description of high-energy collisions, both the hard scattering and the so-called underlying event (UE) are computed for each simulated event. In the hard scattering process, two initial partons interact with a large exchange of transverse momentum, \(p_{\textrm{T}} > \mathcal {O}(\text {Ge\hspace{-.08em}V} {})\) (we use natural units with \(c=1\) throughout the paper). The UE represents additional activity occurring at lower energy scales that accompany the hard scattering. It consists of multiple-parton interactions (MPIs), initial- and final-state radiation (ISR and FSR), and beam-beam remnants (BBR). According to Quantum Chromodynamics (QCD), strong interactions are affected by colour charges that are carried by quarks and gluons. All of the coloured partons produced by these components are finally combined to form colourless hadrons through the hadronisation process.
Particularly relevant for the characterisation of the UE are the MPIs, which consist of additional 2-to-2 parton–parton interactions occurring within the single collision event. With increasing collision energy, the interaction probability for partons with small longitudinal momentum fractions also increases, which enhances MPI contributions.
The pythia 8 generator regularises the cross sections of the primary hard scattering processes and MPIs with respect to the perturbative 2-to-2 parton–parton differential cross section through an energy-dependent dampening parameter \(p_\textrm{T0}\), which depends on the centre-of-mass energy \(\sqrt{s}\). The energy dependence of the \(p_\textrm{T0}\) parameter in pythia 8 is described with a power law function of the form
where \(p_\textrm{T0} ^{\text {ref}}\) is the value of \(p_\textrm{T0}\) at a reference energy \(\sqrt{s_0}\), and \(\epsilon \) is a tunable parameter that determines the energy dependence. At a given \(\sqrt{s}\), the mean number of additional interactions from MPI depends on \(p_\textrm{T0}\), the parton distribution functions (PDFs), and the overlap of the matter distributions of the two colliding hadrons [2].
To track the colour information during the development of the parton shower, partons are represented and also connected by colour lines. Quarks and antiquarks are represented by colour lines with arrows pointing in the direction of the colour flow, and gluons are represented by a pair of colour lines with opposite arrows. Rules for colour propagation are shown in Fig. 1. Because each MPI system adds coloured partons to the final state, a dense net of colour lines that overlap with the coloured parton fields of the hard scattering and with each other is created. Parton shower algorithms, in general, use the leading colour (LC) approximation [3, 4] in which each successively emitted parton is colour connected only to its parent emitters in the limit of infinite number of colours. Colour reconnection (CR) models allow colour lines to be formed between partons also from different interactions and thus allow different colour topologies compared with a simple LC approach.
The CR was first included in minimum-bias (MB) simulations (see Sect. 4.1) to reproduce the increase of average transverse momentum \(\langle p_{\textrm{T}} \rangle \) of charged particles as a function of the measured multiplicity of the charged particles, \(N_{\textrm{ch}}\), and also to describe the \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) distribution [6, 7]. The pseudorapidity is defined as \(\eta =-\ln [\tan (\theta /2)]\), where the polar angle \(\theta \) is defined with respect to the anticlockwise-beam direction. Introducing correlations between partons, including those also resulting from MPIs, generally changes the number of charged particles in an event and allows a more realistic simulation of \(N_{\textrm{ch}}\), and \(\langle p_{\textrm{T}} \rangle \) vs \(N_{\textrm{ch}}\) distributions than in an event scenario without CR [7].
The CR effects are also important for processes occurring at larger scales in pp collisions. For example, in \({{\text {t}} {}{\bar{\text {t}}}}\) events, the inclusion of CR effects can lead to a significant improvement in the description of UE variables [8]. The effects of CR may become more prominent in precision measurements, such as the top quark mass \(m_{{\text {t}}}\). Uncertainties in \(m_{{\text {t}}}\) related to CR are usually estimated from comparing the prediction of a given model with and without CR, which might underestimate their effect [9]. A better way to approach the uncertainty estimation would be to consider a variety of CR models and variations of their parameters [10] that probe the effects of the underlying soft physics of pp collisions on the relevant observable.
Various phenomenological models for CR have been developed and are included in pythia 8. In these models, the general idea is to determine the partonic configuration that reproduces the minimal total string length. In the Lund string fragmentation model [11] used in pythia 8, the confining colour field between two partons is approximated by a one-dimensional string stretched between the partons according to the colour flow. The fragmentation of a string with a probability given by the fragmentation function produces a set of hadrons. Thus, the colour flow of an event determines the string configuration and therefore hadronic production.
None of the MPI processes or the CR models are completely determined from first principles, and they all include free parameters. A specified set of such parameters that is adjusted to better fit some aspects of the data is referred to as a “tune”. It is possible to derive a tune that describes the data at a particular \(\sqrt{s}\). However, such a model, without energy dependence, will be biased and cannot provide any reliable information about other \(\sqrt{s}\). Thus, whenever the collision energy (\(\sqrt{s}\)) has changed, additional constraints on the models must be applied using the information obtained from the new measurements. This is not a straightforward procedure since no single tune can describe all the data with the same precision. The default CMS pythia 8 tune CUETP8M1 for 7\(\,\text {Te\hspace{-.08em}V}\) [12] was derived using the inputs from the 0.9, 1.96 and 7\(\,\text {Te\hspace{-.08em}V}\) measurements, and it describes the data at 7\(\,\text {Te\hspace{-.08em}V}\) quite well. The default CMS pythia 8 tune CP5, where CP stands for “CMS pythia 8 ” for 13\(\,\text {Te\hspace{-.08em}V}\) [13] was derived using the inputs from the 1.96, 7 and 13\(\,\text {Te\hspace{-.08em}V}\) measurements. The CUETP8M1 describes data at 7\(\,\text {Te\hspace{-.08em}V}\) better than CP5, but the overall performance of CP5 is much better than CUETP8M1 when 13\(\,\text {Te\hspace{-.08em}V}\) data are also included.
This paper presents results from two tunes, which make use of the QCD-inspired [14] and the gluon-move [9] CR models. The new CR tunes presented are based on the default CMS pythia 8 tune CP5. Along with the CP5 tune, which is derived from the MPI-based CR model, the performance of the new CR tunes (CP5-CR1 and CP5-CR2 defined below) is studied using several observables. These tunes can be used for the evaluation of the uncertainties due to CR effects, and deepening the understanding of the CR mechanism.
The paper is organised as follows. In Sect. 2, the different colour reconnection models implemented in pythia 8 and used in this study are introduced. In Sect. 3, the tuning strategy is explained in detail and the parameters of the new tunes are presented. Section 4 shows a selection of validation plots related to observables measured at \(\sqrt{s}=1.96\), 7, 8, and 13\(\,\text {Te\hspace{-.08em}V}\) by various experiments compared with the predictions of the new tunes. In Sect. 5 a study of the uncertainty in the top quark mass \(m_{{\text {t}}}\) measurement because of the CR modelling is presented before summarising the results in Sect. 6.
2 Colour reconnection models
The MPI-based CR model was the only CR model implemented in pythia 8 until pythia 8.2, which was released with two additional CR models. The models implemented in pythia 8.2, referred to as the “MPI-based”, “QCD-inspired”, and “gluon-move” CR models, are briefly described in the following:
-
MPI-based model (CP5): The simplest model [6, 15] implemented in MC event generators introduces only one tunable parameter. In this model, the partons are classified according to the MPI system to which they belong. Each parton interaction is originally a \(2\rightarrow 2\) scattering. For an MPI system with a hardness scale \(p_{\textrm{T}}\) of the \(2\rightarrow 2\) interaction, a CR probability is defined as:
$$\begin{aligned} P = \frac{p_{\textrm{T}_{\textrm{Rec}}} ^2}{(p_{\textrm{T}_{\textrm{Rec}}} ^2+p_{\textrm{T}} ^2)}, \end{aligned}$$(2)with \(p_{\textrm{T}_{\textrm{Rec}}} = r p_\textrm{T0} \), where r is a tunable parameter and \(p_\textrm{T0}\) is the energy-dependent dampening parameter defined in Eq. (1). The parameter \(p_\textrm{T0}\) avoids a divergence of the partonic cross section at low \(p_{\textrm{T}}\). According to Eq. (2), MPI systems at high \(p_{\textrm{T}}\) would tend to escape from the interaction point, without being colour reconnected to the hard scattering system. Colour fields originating from a low-\(p_{\textrm{T}}\) MPI system would instead more likely exchange colour. Once the systems to be connected are determined, partons of low-\(p_{\textrm{T}}\) systems are added to strings defined by the highest \(p_{\textrm{T}}\) system to achieve a minimal total string length.
-
QCD-inspired model (CP5-CR1): The QCD-inspired model [14] implemented in pythia 8 adds the QCD colour rules on top of the minimisation of the string length. The model constructs all pairs of QCD dipoles allowed to be reconnected by QCD colour rules that determine the colour compatibility of two strings. This is done iteratively until none of the allowed reconnection possibilities result in a shortening of the total string length. It uses a simple picture to causally connect the produced strings in spacetime through a string length measure \(\lambda \) to determine favoured reconnections. The default parametrisation for \(\lambda \) is
$$\begin{aligned} \lambda = \ln \left( 1+\sqrt{2}\frac{E_1}{m_0}\right) + \ln \left( 1+\sqrt{2}\frac{E_2}{m_0}\right) , \end{aligned}$$(3)where \(E_1\) and \(E_2\) represent the energies of the coloured partons in the rest frame of the QCD dipole, and \(m_0\) is a constant with the dimension of energy [14]. In addition, the QCD-inspired model allows us to create junction structures. A junction is a topological structure and is formed when three colour lines meet at a single point. The presence of junctions reduces the number of colour lines that need to be connected to the beam remnant, which in turn can affect the number of particles produced in a collision. Since the QCD-inspired CR model allows for different color topologies beyond LC, it can successfully describe the baryon production measured at the CMS experiment [14, 16], which is not the case for previously available pythia 8 tunes.
-
Gluon-move model (CP5-CR2): In this scheme [9], final-state gluons are identified along with all the colour-connected pairs of partons. Then an iterative process starts. The difference between string lengths when a final-state gluon belonging to two connected partons is moved to another connected two-parton system is calculated. The gluon is moved to the string for which the move gives the largest reduction in total string length. This procedure can be repeated for all or a fraction of the gluons in the final state, which is controlled by the pythia 8 parameter ColourReconnection:fracGluon. In this scheme, quarks would not be reconnected, i.e. they would remain in the same position without any colour exchange. To improve this picture, the flip mechanism of the gluon-move model can be included. The flip mechanism basically allows reconnection of two different string systems, i.e. a quark can connect to a different antiquark. Junctions (Y-shaped three-quark configurations) are allowed to take part in the flip step as well, but no considerable differences are expected due to the limitation of the junction formation in this model. The flip mechanism has not been extensively studied and its effect on diffractive events is not known. For this reason the flip mechanism is switched off in pythia 8 and not used in this paper. The main free parameters of the gluon-move model account for the lower limit of the string length allowed for colour reconnection, the fraction of gluons allowed to move, and the lower limit of the allowed reduction of the string lengths.
In addition to these models, the effects of early resonance decay (ERD) [9] in top quark decays are also studied. With this option, top quark decay products are allowed to participate directly in CR. Normally the ERD option is switched off in pythia 8 but in Sect. 4.5 we investigate the ERD effects.
Usually, MPI and CR effects are investigated and constrained using fits to measurements sensitive to the UE in hadron collisions. The UE measurements have been performed at various collision energies by ATLAS, CMS, and CDF Collaborations [17,18,19,20,21]. The measurements are typically performed by studying the multiplicity and the scalar \(p_{\textrm{T}}\) sum of the charged particles (\(p_{\textrm{T}} ^{\text {sum}}\)), measured as a function of the \(p_{\textrm{T}}\) of the leading charged particle in the event, \(p_{\textrm{T}} ^{\text {max}}\).
Different regions of the plane transverse to the direction of the beams are defined by the direction of the leading charged particle. A sketch of the different regions is shown in Fig. 2. A “toward” region includes mainly the products of the hard scattering, whereas the “away” region includes the recoiling objects belonging to the hard scattering. The two “transverse” regions contain the products of MPIs and are affected by contributions from ISR and FSR.
In Refs. [17, 18, 21], the transverse region is further subdivided into “transMIN” and “transMAX”, defined to be the regions with the minimum and maximum number of particles between the two transverse regions. This is done to disentangle contributions from MPI, ISR, and FSR. For events with large ISR or FSR, the transMAX region contains at least one “transverse-side” jet, whereas both the transMAX and transMIN regions contain particles from the MPI and BBR. Thus, the transMIN region is sensitive to MPI and BBR, whereas the difference between transMAX and transMIN (referred to as the transDIFF region) is sensitive to ISR and FSR.
The CMS Collaboration showed that a consistent description of the \(N_{\textrm{ch}}\) and the \(p_{\textrm{T}} ^{\text {sum}}\) distributions is not possible using only the pythia 8 hadronisation model without taking into account the CR effects [12]. In general, the largest difference between the predictions from tunes and the data is observed in the soft region (\(p_{\textrm{T}} \sim 2\text {--}5\,\text {Ge\hspace{-.08em}V} \)), where CR effects are expected to be more relevant.
The new CR models, QCD-inspired and gluon-move, were implemented in pythia 8.226 after tuning the model parameters to the existing data at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \) and at lower centre-of-mass energies [9, 14]. The models were tuned to different data sets starting from different baseline tune settings. The model predictions, with their default parameter settings in pythia 8.226 and CP5, are given in Fig. 3 for \(N_{\textrm{ch}}\) and \(p_{\textrm{T}} ^{\text {sum}}\) densities measured by the CMS experiment at 13\(\,\text {Te\hspace{-.08em}V}\) [17] in the transMIN and transMAX regions, and in Fig. 4 for the \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) distribution measured by CMS at 13\(\,\text {Te\hspace{-.08em}V}\) [22]. In these figures, the data points, shown in black, are well described by CP5. The predictions for CP5-“QCD-inspired” and CP5-“gluon-move” were obtained by replacing the MPI-based CR model in CP5 with the QCD-inspired and gluon-move CR model, respectively. As mentioned earlier, these models were tuned to data at 7\(\,\text {Te\hspace{-.08em}V}\) and at lower centre-of-mass energies. The comparisons show that the models must be retuned to describe the underlying soft physics of pp collisions at 13\(\,\text {Te\hspace{-.08em}V}\).
3 The new CMS colour reconnection tunes
A new set of event tunes, based on UE data from the CMS and CDF experiments, are derived using the QCD-inspired and the gluon-move CR models, as implemented in the pythia 8.226 event generator. Having tunes for different CR models allows a consistent way of evaluating systematic uncertainties because of colour reconnection effects in specific measurements. The Rivet 2.4.0 [24] routines used as inputs to the fits, as well as the centre-of-mass energy values and the names of the Rivet distributions, the x-axis ranges (fit ranges), and the relative importance (R) of the distributions are displayed in Table 1 for the tunes CP5-CR1 and CP5-CR2. The CP5 tune is used as a baseline for the CR tuning since it is the default pythia 8 tune for most of the new CMS analyses using data at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) published since 2017, and it has explicitly been tested against a large number of different final states (MB, QCD, top quark, and vector boson + jets) and observables [13].
The parameters and their ranges in the fits are shown in Table 2. The minimum and maximum values of the parameters are first taken from pythia 8, then the ranges of the values are further limited using the Professor 1.4.0 software [25]. The ranges are chosen such that the sampled MC space does not destroy the definition of a particular observable in the fits.
Tune CP5 uses the next-to-next-to-leading order (NNLO) NNPDF31_nnlo_as_0118 [26] PDF set, the strong coupling parameter \(\alpha _\textrm{S}\) value of 0.118 for ISR, FSR, and MPI, and the MPI-based CR model. It also uses a double-Gaussian functional form with two tunable parameters, coreRadius and coreFraction, to model the overlap of the matter distribution of the two colliding protons [6]. The tune parameters are documented in Ref. [13] and displayed in Table 3. Also in Figs. 11 and 12 in Ref. [13], predictions of the CP5 tune are compared with event shape observables measured at LEP. The results show that a value of \(\alpha _\textrm{S} ^\textrm{FSR}(m_\text {Z})\) \(\sim \)0.120 better describes the data compared with higher values of \(\alpha _\textrm{S} ^\textrm{FSR}(m_\text {Z})\) which generally overestimates the number of final-state partons. As concluded in the Ref. [13], LEP event shape observables are well described by MadGraph 5_amc@nlo + pythia 8 with CP5.
The new tunes are obtained by constraining simultaneously the parameters controlling the contributions of the MPI and of each of the CR models. The strategy followed to obtain the CP5-CR1 and CP5-CR2 tunes is similar to that used for the CP5 tune, i.e. the same observables sensitive to MPI are considered to constrain the parameters. These are the \(N_{\textrm{ch}}\) and average \(p_{\textrm{T}} ^{\text {sum}}\) as functions of the leading charged particle transverse momentum \(p_{\textrm{T}} ^{\text {max}}\), measured in the transMIN and transMAX regions by the CMS experiment at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) [17] and 7\(\,\text {Te\hspace{-.08em}V}\) [19] and by the CDF experiment at 1.96\(\,\text {Te\hspace{-.08em}V}\) [21]. The \(N_{\textrm{ch}}\) as a function of \(\eta \), measured by CMS at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) [22] is also used in the fit. In Ref. [17], the transMIN and transMAX regions are defined with respect to both the leading charged particles and the leading charged-particle jets as reference objects. The uncertainty in measurements using leading charged particles as reference objects is lower than the uncertainty in measurements using leading charged-particle jets as reference objects. This is one of the reasons why we choose to use leading charged particle observables instead of leading charged-particle jet observables in the fits. Another reason is that we want to use the same observables that were used to derive CP5, and CP5 was derived using leading charged particle observables in the fits. As a cross-check, we also derived another version of the CP5-CR1 tune using leading charged-particle jet observables, such as \(N_{\textrm{ch}}\) and average \(p_{\textrm{T}} ^{\text {sum}}\), as functions of the transverse momentum of the leading charged-particle jets in the fits. The results showed that the use of leading charged-particle jet observables in the fits makes a very small difference, which is negligible when tune uncertainties are taken into account. As for CP5, the region with \(p_{\textrm{T}} ^{\text {max}}\) between 0.5 and 2.0 or 3.0\(\,\text {Ge\hspace{-.08em}V}\) is excluded depending on the distribution from the fit, since this region is affected by diffractive processes whose free parameters are not considered in the tuning procedure.
The MPI-related parameters that are kept free in both the CP5-CR1 and CP5-CR2 tunes are:
-
MultipartonInteractions:pT0Ref, the parameter \(p_\textrm{T0} ^{\text {ref}}\) included in the regularisation of the partonic QCD cross section as described in Eq. (1). It sets the lower cutoff scale for MPIs;
-
MultipartonInteractions:ecmPow, the exponent \(\epsilon \) of the \(\sqrt{s}\) dependence as shown in Eq. (1);
-
MultipartonInteractions:coreRadius, the width of the core when a double-Gaussian matter profile is assumed for the overlap distribution between the two colliding protons [6]. A double-Gaussian form identifies an inner, dense part, which is called core, and an outer, less dense part;
-
MultipartonInteractions:coreFraction, the fraction of quarks and gluons contained in the core when a double-Gaussian matter profile is assumed.
The tunable CR parameters in CP5-CR1 that are considered in the fit are:
-
ColourReconnection:m0, the variable that determines whether a possible reconnection is actually favoured in the \(\lambda \) measure in Eq. (3);
-
ColourReconnection:junctionCorrection, the multiplicative correction for junction formation, applied to the m0 parameter;
-
ColourReconnection:timeDilationPar, the parameter controlling the time dilation that forbids colour reconnection between strings that are not in causal contact.
More details on these parameters are reported in Ref. [1]. For the CP5-CR1 tune, the parameters related to the hadronisation, StringZ:aLund, StringZ:bLund, StringFlav:probQQtoQ, and StringFlav:probStoUD, proposed in Ref. [14], are also used as fixed inputs to the tune. The first two of these parameters govern the longitudinal fragmentation function used in the Lund string model in pythia 8, whereas the latter two are the probability of diquark over quark fragmentation, and the ratio of strange to light quark production, respectively.
For the optimisation of CP5-CR2, the following parameters are considered:
-
ColourReconnection:m2lambda, an approximate hadronic mass-square scale and the parameter used in the calculation of \(\lambda \);
-
ColourReconnection:fracGluon, the probability that a given gluon will be moved. It thus gives the average fraction of gluons being considered.
The remaining parameters of pythia 8 are kept the same as in the CP5 tune.
The fits are performed using the Professor 1.4.0 software, which takes random values for each parameter in the defined multidimensional parameter space, and Rivet, which provides the data points and uncertainties, and produces the individual generator predictions for the considered observables. About 200 different choices of parameters are considered to build a random grid in the parameter space. For each choice of parameters, one million pp inelastic scattering events, including contributions from single-diffractive dissociation (SD), double-diffractive dissociation (DD), central diffraction (CD), and nondiffractive (ND) processes, are generated. The bin-by-bin envelopes of the different MC predictions are checked. After building the grid in the parameter space, Professor performs an interpolation of the bin values for the observables in the parameter space using a third-order polynomial function. We verified that the degree of the polynomial used for the interpolation does not affect the tune results significantly. The function \(f^{b}(\textbf{p})\) models the MC response of each bin b of the observable O as a function of the parameter vector \(\textbf{p}\). The final step is the minimisation of the \(\chi ^{*2}\) function given by:
where \(\mathcal {R}_{b}\) is the data value for each bin b, and \(\varDelta _{b}^2\) expresses the total bin uncertainty of the data.
The \(\chi ^{*2}\) is not a true \(\chi ^2\) function as explained in the following. Treating equally all distributions that are used as inputs to the fit for the CP5-CR2 tune results in a tune that describes the data poorly; in particular, it underestimates the \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) distribution measured in data at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) by about 30%. This is because the \(\chi ^2\) definition treats all bins equally and the importance of \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) may be lost because of its relatively low precision with respect to other observables. The \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) distribution is one of the key observables that is sensitive to a number of processes and, therefore, increasing the importance of this observable in the fit is reasonable.
In Professor, this is done by using weights with a nonstandard \(\chi ^2\) definition. To keep the standard properties of a \(\chi ^2\) fit, we increase the total uncertainties of the other distributions. The total uncertainty in each bin is scaled up by \(1/\sqrt{R}\) with R (relative importance) values displayed in Table 1. Therefore, the total uncertainty of each bin of \(p_{\textrm{T}} ^{\text {sum}}\) in the transMIN and transMAX regions at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) is scaled up by \(\sqrt{2}\) and that of all other distributions by \(\sqrt{10}\). These scale factors ensure that the distributions are well described after the tuning. For the CP5-CR1 model, a good description of the input observables is obtained without scaling, meaning that all distributions are considered equally important.
The experimental uncertainties used in the fit, in general, have bin-to-bin correlations. However, some of the bins of the UE distributions used in the fit, e.g. \(p_{\textrm{T}} ^{\text {max}} > 10\,\text {Ge\hspace{-.08em}V} \), are dominated by statistical uncertainties, which are uncorrelated between bins. In the minimisation procedure, because the correlations between bins are not available for the input measurements, the experimental uncertainties are assumed to be uncorrelated between data points.
The parameters obtained from the CP5-CR1 and CP5-CR2 fits, as well as the value of the goodness of the fit are shown in Table 3. Uncertainties in the parameters of these tunes are discussed in Appendix B. In Ref. [13], the number of degrees of freedom (\(N_\textrm{dof}\)), defined as the sum of the number of bins of fit observables minus the number of fit parameters, for the tune CP5 is given as 63. However, this value of \(N_\textrm{dof}\) corresponds to the case when only 13\(\,\text {Te\hspace{-.08em}V}\) distributions are used. The value of \(N_\textrm{dof}\) for CP5 consistent with our calculation in this paper is 183. The tune CP5 was derived using two additional distributions in the fits; \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) at 13\(\,\text {Te\hspace{-.08em}V}\) with NSD-enhanced selection and SD-enhanced selection. Since these two observables depend on modelling of single diffraction dissociation, which is not well understood, they are not included in the fits for CP5-CR tunes. Therefore, the \(N_\textrm{dof}\) values for CP5-CR tunes are lower than the \(N_\textrm{dof}\) of CP5. The slight difference in the \(N_\textrm{dof}\) values between the CP5-CR1 and CP5-CR2 tunes is due to the difference in the number of fit parameters used in each tune, which are 7 and 6 respectively. Although the fit ranges for the CP5-CR tunes differ slightly, as shown in Table 1, the sum of number of bins of fit observables is the same for both tunes.
A preliminary version of the CP5-CR2 tune was derived including several jet substructure observables [27,28,29] in the fits. This tune, called CP5-CR2-j, has been used in the MC production in the CMS experiment. The CP5-CR2 and CP5-CR2-j tunes have very similar predictions in all final states discussed in this paper, because the tunes differ slightly only in the following parameters, where the listed values are for CP5-CR2-j:
-
MultipartonInteractions:ecmPow = 0.056,
-
MultipartonInteractions:coreRadius = 0.653,
-
MultipartonInteractions:coreFraction = 0.439,
-
ColourReconnection:m2lambda = 4.395,
-
MultipartonInteractions:fracGluon = 0.990.
The CP1 and CP2 are the two tunes in the CPX (X = 1–5) tune family [13] that use an LO PDF set [26]. We also derive CR tunes based on the CP1 and CP2 settings to study the effect of using a leading order (LO) PDF set with alternative CR models, although they are not used in precision measurements. We find that the predictions of the CR tunes based on CP1 and CP2 for the MB and UE observables are similar to the predictions of CR tunes based on CP5. However, CP1-CR1 (i.e. CP1 with the QCD-inspired colour reconnection model) has a different trend in particle multiplicity distributions compared with the predictions of other tunes discussed in this study. This different trend of CP1-CR1 cannot be attributed to the use of LO PDF set, because both CP1 and CP2 use the same LO PDF set and we do not see a different trend with CP2-CR1. The different trend observed with CP1-CR1 in the particle multiplicity distributions may become a collective effect rather than a single parameter effect, and could be an input for further tuning and development of the QCD-inspired model. Therefore, in Appendix A of this paper, we present the tune settings of the CR tunes based on CP1 and CP2, along with their predictions in the particle multiplicity distributions.
4 Performance of the tunes
In Figs. 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18 we show the observables measured at centre-of-mass energies of 1.96, 7, 8, and 13\(\,\text {Te\hspace{-.08em}V}\). The data points are shown in black, and are compared with simulations obtained from the pythia 8 event generator with the tunes CP5 (red), CP5-CR1 (blue), and CP5-CR2 (green). For simplicity, the tunes CP5-CR1 and CP5-CR2 will be referred to as CP5-CR when convenient. The lower panels show the ratios between each MC prediction and the data.
For the plots presented in Figs. 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 in Sects. 4.1 and 4.2, inelastic events (i.e. ND, SD, DD, and CD) are simulated with pythia 8.226 and compared with data at different centre-of-mass energies. The rest of the plots are produced with pythia 8.235. An update to the description of the elastic scattering component in pythia 8.235 led to a slight decrease in the default ND cross section. The default ND cross section in pythia 8.226, which is 55.5\(\text {\,mb}\) at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \), is lowered to 55.1\(\text {\,mb}\) in pythia 8.235. Hence, to reproduce the conditions of pythia 8.226 in pythia 8.235 or in a newer version, one should set the ND cross section manually.
4.1 Underlying-event and minimum-bias observables
MB is a generic term used to describe events collected with a loose selection process that are dominated by relatively soft particles. Although these events generally correspond to inelastic scattering, including ND and SD+DD+CD processes, these contributions may vary depending on the trigger requirements used in the experiments. For example, a sample of non-single-diffractive-enhanced (NSD-enhanced) events is selected by suppressing the SD contribution at the trigger level.
The UE observables measured by the CMS experiment at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) [17], namely \(N_{\textrm{ch}}\) density and the average \(p_{\textrm{T}} ^{\text {sum}}\) in the transMIN and transMAX regions are well described by all tunes in the plateau region as shown in Fig. 5. The region up to \({\approx }5\) \(\,\text {Ge\hspace{-.08em}V}\) of \(p_{\textrm{T}} ^{\text {max}} \) is highly sensitive to diffractive contributions [30]. There is a lack of measurements in this region where the tunes, in general, do not perform well. Although the optimisation of these components is beyond the scope of this study, we have extended the fit range to \({\approx }\)2–3\(\,\text {Ge\hspace{-.08em}V}\) as long as the data are well described. The rising part of the spectrum excluding the region up to \(\approx \)5\(\,\text {Ge\hspace{-.08em}V}\) of the \(N_{\textrm{ch}}\) density distributions is similarly described by all tunes, whereas in the \(p_{\textrm{T}} ^{\text {sum}}\) density distributions the predictions of CP5 differ slightly from the predictions of the CR tunes. These show that the CP5 tune has a harder \(p_{\textrm{T}}\) spectrum at low \(p_{\textrm{T}} ^{\text {max}} \) values. Through tuning the \(N_{\textrm{ch}}\) and average \(p_{\textrm{T}} ^{\text {sum}}\) density in the transMIN and transMAX regions, a satisfactory agreement is obtained for the same observables in the transDIFF region as well. Figure 6 shows the pseudorapidity distribution of charged hadrons in inelastic pp collisions measured by the CMS experiment at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) [22]. This observable is sensitive to the softer part of the MPI spectrum and well described by all tunes.
A crucial test for the performance of UE tunes, and of the CR simulation in particular, is the description of the average \(p_{\textrm{T}}\) of the charged particles as a function of \(N_{\textrm{ch}}\). Comparisons of the mean average \(p_{\textrm{T}}\) to the measurements by the ATLAS Collaboration at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) in the transMAX and transMIN regions [18] are displayed in Fig. 7. The tune CP5 describes the central values of the data perfectly for \(N_{\textrm{ch}} >7\), whereas the CR tunes show an almost constant discrepancy of 5–10% because of the harder \(p_{\textrm{T}}\) spectrum predicted by the tune CP5 for low-\(p_{\textrm{T}}\) particles. All CR tunes show a reasonable agreement with the data, confirming the accuracy of the parameters obtained for the new CR models. The improvement in the tuned CR models and their success in describing the data is seen by comparing Fig. 5 with Fig. 3, and Fig. 6 with Fig. 4. In these figures, CP5 tune predictions are also shown for easier comparison of CR tunes predictions with CP5.
In Fig. 8, charged-particle and \(p_{\textrm{T}} ^{\text {sum}}\) densities measured by the CMS experiment at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \) [19] in the transMIN and transMAX regions, as functions of \(p_{\textrm{T}} ^{\text {max}}\), are compared with predictions from the tunes CP5 and CP5-CR. The data are reasonably well described for \(p_{\textrm{T}} ^{\text {max}} >5\,\text {Ge\hspace{-.08em}V} \).
In Fig. 9, charged particle and \(p_{\textrm{T}} ^{\text {sum}}\) densities in the transverse region, as functions of \(p_{\textrm{T}} ^{\text {max}}\), and the average \(p_{\textrm{T}}\) in the transverse region as functions of \(p_{\textrm{T}} ^{\text {max}}\) and of the \(N_{\textrm{ch}}\), measured by the ATLAS experiment at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \) [20], are compared with the predictions from the tunes CP5 and CP5-CR. The central values of the average \(p_{\textrm{T}}\) in bins of the leading charged particle \(p_{\textrm{T}}\) and of the \(N_{\textrm{ch}}\) are consistent with the data points within 10%. A similar level of agreement as observed at 13\(\,\text {Te\hspace{-.08em}V}\) is achieved by the new tunes at 7\(\,\text {Te\hspace{-.08em}V}\).
The performance of the new tunes is also checked at 7\(\,\text {Te\hspace{-.08em}V}\) using inclusive measurements of charged-particle pseudorapidity distributions. In Fig. 10, the CMS measurements for \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) at 7\(\,\text {Te\hspace{-.08em}V}\) [31] with at least one charged particle in \(|\eta |<2.4\) are compared with predictions from the tunes CP5 and CP5-CR. The CP5 and CP5-CR1 have similar predictions, while CP5-CR2 predicts about 4% less charged particles than the first two tunes in all \(\eta \) bins of the measurement. Although all tunes provide a reasonable description of \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) with deviations up to \(\approx \)10%, the data and MC simulation show different trends for \(|\eta |>1.2\), where the trend for the data is not described well by the tunes. In the more central region, i.e. \(|\eta |<1.2\), the shape of the predictions agrees well with the data but there is a difference in normalisation. For example, CP5 and CP5-CR1 predict 3–4% and CP5-CR2 predicts about 7% fewer charged particles in all bins for \(|\eta |<1.2\) compared with the data.
In Fig. 11, charged-particle and \(p_{\textrm{T}} ^{\text {sum}}\) densities measured as functions of \(p_{\textrm{T}} ^{\text {max}}\) at \(\sqrt{s}=1.96\,\text {Te\hspace{-.08em}V} \) by the CDF experiment [21] in the transMIN and transMAX regions are compared with predictions from the tunes CP5 and CP5-CR, respectively. All predictions reproduce the UE observables within \(\approx \)10% at \(\sqrt{s}=1.96\), 7, and 13\(\,\text {Te\hspace{-.08em}V}\).
We compare the new CMS tunes also with MB and UE data measured at forward pseudorapidities. The energy density, \(\text{ d }E/\text{d }\eta \), measured in MB events and in NSD events by the CMS experiment at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \), is shown in Fig. 12. The data are well described by CP5-CR2 within uncertainties and for all measured \(|\eta |\) bins. The predictions of CP5 and CP5-CR1 overestimate the data in \(4.2<|\eta |<4.9\).
The pseudorapidity of charged particles, \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \), in the ranges \(|\eta |<2.2\) and \(5.3<|\eta |<6.4\) measured by the CMS and TOTEM experiments at \(\sqrt{s}=8\,\text {Te\hspace{-.08em}V} \) [32] is presented in Fig. 13. The events are required to have at least one charged particle in \(5.3<\eta <6.5\) or \(-6.5<\eta <-5.3\) with \(p_{\textrm{T}} >0\). All tunes describe the data within the uncertainties. Additionally, Fig. 13 shows the pseudorapidity of charged particles, \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \), in \(5.3<|\eta |<6.4\) in events with at least one charged particle with \(p_{\textrm{T}} >40\,\text {Me\hspace{-.08em}V} \), measured by the TOTEM experiment at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \) [33]. Both CP5 and CP5-CR1 describe the data within the uncertainties, whereas CP5-CR2 underestimates the data by 15%.
4.2 Particle multiplicities
Figure 14 shows the strange particle production for \(\varLambda \) baryons and \(\text {K}_{\textrm{S}}^{0}\) mesons as a function of rapidity (y) measured by the CMS experiment [16] in NSD events at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \). The rapidity is defined as \(y=\frac{1}{2}\ln {\frac{E+p_L}{E-p_L}}\), where E is the particle energy and \(p_L\) is the particle momentum along the anticlockwise-beam direction. It is shown in Ref. [14] that the new CR models might be beneficial for describing the ratios of strange particle multiplicities, for example \(\varLambda /\) \(\text {K}_{\textrm{S}}^{0}\) in pp collisions. We observe that all CP5 tunes, regardless of the CR model, describe particle production for \(\text {K}_{\textrm{S}}^{0}\) mesons as a function of rapidity very well. However, they underestimate particle production for \(\varLambda \) versus rapidity by about 30%. Therefore, the ratio \(\varLambda /\) \(\text {K}_{\textrm{S}}^{0}\) is not perfectly described but this could be improved by different hadronisation models [35, 36]. Including these observables, as well as the recent measurements of baryon production from the ALICE and LHCb experiments [37, 38], could be beneficial in future tune derivations. This is discussed in Appendix C.
The multiplicities of identified particles are also investigated in simulated MB events (ND+SD+DD+CD). Figure 15 shows the ratio of proton over pion production, as a function of particle \(p_{\textrm{T}}\) [39]. All the tunes predict a similar trend, showing that the new CR models do not lead to a significant improvement in the description of the ratio of proton to pion production. However, it is known that this observable is strongly correlated with event particle multiplicity [39,40,41] and not only CR, since also hadronisation and MPI play a key role in describing the ratios of particle yields.
4.3 Jet substructure observables
The number of charged particles contained in jets is an important observable that makes it possible to distinguish quark-initiated jets from gluon-initiated jets. The average number of charged hadrons with \(p_{\textrm{T}} >500\) \(\,\text {Me\hspace{-.08em}V}\) inside the jets measured by the CMS experiment as a function of the jet \(p_{\textrm{T}}\) is shown in Fig. 16 [27]. The predictions of the CR tunes are comparable, and produce roughly 5% fewer charged particles than the CP5 tune. All predictions show a reasonable description of the data.
Figure 17 presents the distributions of \(F(z)=(1/N_\text {jet})(\text{ d } N_{\textrm{ch}}/\text{d } z)\), where z is the longitudinal momentum fraction, and \(N_{\textrm{ch}}\) is the charged-particle multiplicity in the jet, measured by the ATLAS experiment at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \) [28]. The F(z) parameter is related to the fragmentation function and is presented for \(p_{\textrm{T}} ^{\text {jet}} =25\text {--}40\) \(\,\text {Ge\hspace{-.08em}V}\) and \(p_{\textrm{T}} ^{\text {jet}} =400\text {--}500\) \(\,\text {Ge\hspace{-.08em}V}\). The CR tunes describe low-\(p_{\textrm{T}} ^{\text {jet}}\) data better than CP5, and their predictions reasonably agree with the high-\(p_{\textrm{T}} ^{\text {jet}}\) data, except for the last bin. The high-\(p_{\textrm{T}} ^{\text {jet}}\) data are well described by the CP5 tune within the uncertainties, and its central values agree better with the predictions of the CP5 tune than with those of the CP5-CR tunes.
4.4 Drell–Yan events
Drell–Yan (DY) events [42, 43] with the Z boson decaying to \({{\upmu }}^{+} {{\upmu }}^{-} \) were generated with pythia 8 and compared with CMS data at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \). Figure 18 shows the \(N_{\textrm{ch}}\) and \(p_{\textrm{T}}\) flow as a function of the Z boson \(p_{\textrm{T}}\) (in the invariant \({{\upmu }}^{+} {{\upmu }}^{-} \) mass window of 81–101\(\,\text {Ge\hspace{-.08em}V}\)) in the region transverse to the boson momentum [44], which is expected to be dominated by the UE.
The CP5 tunes predict up to 15% too many charged particles at low Z boson \(p_{\textrm{T}}\), where additional effects, such as the intrinsic transverse momentum of the interacting partons (i.e. primordial \(k_{\textrm{T}}\)) are expected to play a role. Higher-order corrections, as implemented in MadGraph 5_amc@nlo v2.4.2 [45] with FxFx merging [46], are necessary to describe the total \(p_{\textrm{T}}\) flow. The impact of the different CR models is negligible in DY events.
4.5 Top quark observables
4.5.1 Jet substructure in \({{\text {t}} {}{\bar{\text {t}}}}\) events
A study of the UE in \({{\text {t}} {}{\bar{\text {t}}}}\) events [8] also estimated the effects of the CR on the top quark decay products by investigating the differences between predictions using pythia 8 with the ERD off and on options. In Ref. [8], in addition to the QCD-inspired and gluon-move models, predictions of the rope hadronisation model [47, 48] are also compared with the data. In the rope hadronisation model, overlapping strings are treated to act coherently as a “rope”. The interactions between overlapping strings are described by an interaction potential inspired by the phenomenology of superconductors [35, 36, 47,48,49,50,51,52]. The ERD off and on options allow the CR to take place before or after the top quark decay, respectively. In particular, the ERD option allows the top quark decay products to be colour reconnected with the partons from MPI systems. Ref. [8] showed that these different models and options produce similar predictions for UE observables in \({{\text {t}} {}{\bar{\text {t}}}}\) events. However, some jet-shape distributions in \({{\text {t}} {}{\bar{\text {t}}}}\) events display a more significant effect [53], e.g. in the number of charged particles in jets. In the following, we investigate how the pythia 8 CR tunes describe the CMS \({{\text {t}} {}{\bar{\text {t}}}}\) jet substructure data [53]. In the CMS measurement, jets reconstructed using the anti-\(k_{\textrm{T}}\) algorithm [54] with a distance parameter of \(R=0.4\) as implemented in FASTJET 3.1 [55] are used. Jets with \(p_{\textrm{T}} >30\,\text {Ge\hspace{-.08em}V} \) within \(|\eta |<2\) are selected. Jet pairs (\(\mathrm {j_1}\) and \(\mathrm {j_2}\)) are required to be far from each other in \(\eta \)-\(\phi \) space, \(\varDelta R(\mathrm {j_1},\mathrm {j_2}) = \sqrt{\smash [b]{(\eta _\mathrm {j_1}-\eta _\mathrm {j_2})^2 - (\phi _\mathrm {j_1}-\phi \mathrm {j_2})^2}}>0.8\). Jet substructure observables are calculated from jet constituents with \(p_{\textrm{T}} >1\,\text {Ge\hspace{-.08em}V} \), e.g. in the plateau region of high track finding efficiency and low misidentification rate. Here we focus on two variables, (i) \(\lambda _0^0(N)\), which is the number of charged particles with \(p_{\textrm{T}} > 1\,\text {Ge\hspace{-.08em}V} \) in the jet, and (ii) the separation between two groomed subjets, \(\varDelta R_g\), that are shown in Fig. 19 for gluon jets and inclusive jets, respectively. A “groomed jet” refers to a jet with soft and wide-angle radiation removed by a dedicated grooming algorithm [56, 57].
The compatibility of data and MC predictions is evaluated using a measure defined as \(\chi ^2 = \varDelta ^T C^{-1} \varDelta \), where \(\varDelta \) is the difference vector between measured and predicted values, and C is the total covariance matrix of the measurement. Since the measured distribution is normalised to unity, its covariance matrix is singular, i.e. not invertible. To render C invertible, the vector entry and matrix row/column corresponding to one measured bin need to be discarded; we choose to remove the last bin. The results are displayed in Table 4 for all jets inclusively as well as for each jet flavour separately. We observe that none of the tunes describe the \(\lambda _0^0(N)\) data well for all jet flavours. As concluded in Ref. [53], flavour-dependent improvements in the nonperturbative physics modelling may be required for a better description of the data. The angle between the groomed subjets, on the other hand, is infrared and collinear safe and can be described very well by an increase in the \(\alpha _\textrm{S} ^\textrm{FSR}(m_\text {Z})\), which corresponds to a decrease in the FSR renormalisation scale \(\mu _R^\textrm{FSR}\). Table 4 shows the results obtained by varying \(\mu _R^\textrm{FSR}\) by factors of 0.5 and 2.
4.5.2 Pull angle in \({{\text {t}} {}{\bar{\text {t}}}}\) events
Figure 20 displays the normalised \({{\text {t}} {}{\bar{\text {t}}}}\) differential cross section for the jet pull angle [59] defined using the jets originating from the decay of a W boson in \({{\text {t}} {}{\bar{\text {t}}}}\) events, as measured by the ATLAS experiment [58]. The observable is shown for the case where only the charged constituents of the jet are used in the calculation. The data are compared with predictions from POWHEG v2 [60]+pythia 8 using the CP5 tunes or the corresponding CR tunes. The \(\chi ^2\) values are calculated as described in Sect. 4.5.1 and are shown in the last column of Table 4. The pull angle is particularly sensitive to the setting of the ERD option. With ERD turned off, the decay products of the W boson in \({{\text {t}} {}{\bar{\text {t}}}}\) events are not included in CR, and the predictions using the tunes with the various CR models are similar to each other. With ERD enabled, CR now modifies the pull angle between the two jets, which is observed in Fig. 20. The predictions of each tune also show significant changes when ERD is enabled. For both the nominal and CP5-CR1 (QCD-inspired) tunes, the prediction with ERD improves the description of the data, and the difference between the predictions with or without ERD is larger for the CP5-CR1-based tune. We observe the opposite for the CP5-CR2-based (gluon-move) tunes, for which the choice without ERD is preferable. This picture might be different if the flip mechanism had been added in the tuning of the gluon-move model. The move step in the gluon-move model is more restrictive because it allows only gluons to move between the string end-points. The inclusion of the flip mechanism would also allow the string end-points to be mixed with each other and, therefore, could further reduce the total string length in an event. However, as indicated earlier, the effect of the flip mechanism on diffractive events is not well understood and, therefore, this mechanism is not used in this paper.
Overall, the QCD-inspired model with ERD provides the best description of the jet pull angle. The differences between the predictions using the different tunes observed here indicate that the inclusion of observables, such as the jet pull angle and other jet substructure observables, could be beneficial in future tune derivations.
5 Uncertainty in the top quark mass due to colour reconnection
The top quark mass has been measured with high precision using the 7, 8, and 13\(\,\text {Te\hspace{-.08em}V}\) \({{\text {t}} {}{\bar{\text {t}}}}\) data at the LHC [10, 61,62,63,64,65,66,67,68,69,70,71,72,73]. The most precise value of \(m_{{\text {t}}} =172.44\pm 0.13\,\text {(stat)} \pm 0.47\,\text {(syst)} \) \(\,\text {Ge\hspace{-.08em}V}\) was measured by the CMS Collaboration combining 7 and 8\(\,\text {Te\hspace{-.08em}V}\) data [67]. To further improve the precision of \(m_{{\text {t}}}\) measurements, a complete analysis of the systematic uncertainties in the measurement is crucial. One of the dominant systematic uncertainties is due to the modelling of CR in top quark decays [67]. The procedure for estimating this uncertainty used for the LHC Run 1 (years 2009–2013) analyses at \(\sqrt{s}=7\) and 8\(\,\text {Te\hspace{-.08em}V}\) was based on a comparison of two values of \(m_{{\text {t}}}\), calculated by using predictions with the same UE tune with and without CR effects. In Ref. [67], this is done using the tune “Perugia 2011” with and without CR effects included. The “Perugia 2011” tunes family is the updated version of the “Perugia (Tevatron)” tunes family and also takes into account lessons learned from LHC MB and UE data at 0.9 and 7\(\,\text {Te\hspace{-.08em}V}\) [74]. The new CMS tunes, presented in Sect. 2, which use different CR models, can be used to give a better evaluation of the CR uncertainty. In particular, the uncertainty is now calculated by comparing results for \(m_{{\text {t}}}\) values obtained from different realistic CR models, such as CP5, CP5-CR1, and CP5-CR2.
Additionally, one can also estimate the effects of the CR on the top quark decay products by investigating the differences between predictions using pythia 8 with the option ERD off and on, which was done for the UE observables [8].
A determination of \(m_{{\text {t}}}\) using a kinematic reconstruction of the decay products in semileptonic \({{\text {t}} {}{\bar{\text {t}}}}\) events at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) is reported in Ref. [10]. In these events, one of the W bosons from the top quark decays into a muon or electron and a neutrino, and the other into a quark–antiquark pair. In this analysis, \(m_{{\text {t}}}\) and the jet energy scale factor were determined simultaneously through a joint-likelihood fit to the selected events. The results with the QCD-inspired and gluon-move models were also compared. The pythia 8 CUETP8M2T4 [75] UE tune was used, and the parameters of the CR models were tuned to UE and MB data at \(\sqrt{s}=13\,\text {Te\hspace{-.08em}V} \) [10]. They found that the gluon-move model results in a 0.31\(\,\text {Ge\hspace{-.08em}V}\) shift from the \(m_{{\text {t}}}\) value obtained with the default simulation. This shift, which is larger than the shifts caused by the other CR models, is assumed to be the uncertainty due to the modelling of CR in the measured \(m_{{\text {t}}}\). It is much larger than the shift, 0.01\(\,\text {Ge\hspace{-.08em}V}\), due to the CR modelling in the Run 1 measurement [67]. This is the largest source of uncertainty in the measured \(m_{{\text {t}}}\), where the total uncertainty is 0.62\(\,\text {Ge\hspace{-.08em}V}\) [10]. Similar studies using single top quark final states are reported in Refs. [71, 76].
We compare the \(m_{{\text {t}}}\) and W boson mass values obtained with different tune configurations based on our new tunes in Table 5. Top quark candidates are constructed by a Rivet routine in a sample of simulated semileptonic \({{\text {t}} {}{\bar{\text {t}}}}\) events. Events must contain exactly one lepton with \(p_{\textrm{T}} >30\,\text {Ge\hspace{-.08em}V} \) and \(|\eta |<2.1\). Leptons are “dressed” with the surrounding photons within a cone of \(\varDelta R = 0.1\) and are required to yield an invariant mass window of 5\(\,\text {Ge\hspace{-.08em}V}\) centred at 80.4\(\,\text {Ge\hspace{-.08em}V}\), when combined with a neutrino in the event. The events must also contain at least four jets, reconstructed with the anti-\(k_{\textrm{T}}\) algorithm, with \(p_{\textrm{T}} >30\,\text {Ge\hspace{-.08em}V} \) within \(|\eta |<2.4\). At least two of the jets are required to originate from the fragmentation of bottom quarks, and at least two other jets, referred to as light-quark jets, must not originate from bottom quarks. One jet originating from a bottom quark is combined with the lepton and neutrino to form a leptonically decaying top quark candidate, whereas the other jet originating from a bottom quark is combined with two other jets to form a hadronically decaying top quark candidate. The difference in the invariant mass window of the two top quark candidates is required to be less than 20\(\,\text {Ge\hspace{-.08em}V}\), and the invariant mass of the two light-quark jets is within a window of 10\(\,\text {Ge\hspace{-.08em}V}\) centred at 80.4\(\,\text {Ge\hspace{-.08em}V}\). If more than one combination of jets satisfy these criteria when combined with the lepton and neutrino, then only one combination is chosen based on how similar the invariant masses of the two top quark candidates are to each other and on how close the invariant mass of the light-quark jets is to \(80.4\,\text {Ge\hspace{-.08em}V} \). The invariant mass of the hadronically decaying top quark candidates constructed in this way for each of the different tune configurations is shown in Fig. 21. The top quark and W boson mass values are obtained from these hadronically decaying top quark candidates by fitting a Gaussian function within an 8\(\,\text {Ge\hspace{-.08em}V}\) mass window around the corresponding mass peak. Table 5 also contains the differences from the nominal \(m_{{\text {t}}}\) and \(m_{\text {W}} \) values (\(\varDelta m_{{\text {t}}} \), and \(\varDelta m_{\text {W}} \)) and the difference in \(\varDelta m_{{\text {t}}} ^\text {hyb}\), a quantity that was introduced in Ref. [67] to incorporate both an in situ jet scale factor determined from the reconstructed \(m_{\text {W}}\) as well as prior knowledge about the jet energy scale in a hybrid approach to extract \(m_{{\text {t}}}\). Here, \(\varDelta m_{{\text {t}}} ^\text {hyb}\) is approximated as \(\varDelta m_{{\text {t}}}-0.5\varDelta m_{\text {W}} \). From Table 5, we observe that the largest deviation from the predictions of CP5 is CP5-CR2 ERD (0.32\(\,\text {Ge\hspace{-.08em}V}\)) similar to the largest shift found in Ref. [10] using CUETP8M2T4. However, CP5-CR2 ERD is not able to describe the available colour flow data, and can therefore be excluded from the list of modelling uncertainties.
6 Summary and conclusion
New sets of parameters for two of the colour reconnection (CR) models implemented in the pythia 8 event generator, QCD-inspired and gluon-move, are obtained, based on the default CMS pythia 8 tune CP5. Measurements sensitive to underlying-event (UE) contributions performed at hadron-colliders at \(\sqrt{s}=1.96\), 7, and 13\(\,\text {Te\hspace{-.08em}V}\) are used to constrain the parameters for the CR and for the multiple-parton interactions simultaneously. Various measurements at 1.96, 7, 8, and 13\(\,\text {Te\hspace{-.08em}V}\) are used to evaluate the performance of the new tunes. The central values predicted by the new CR tunes for the UE and minimum-bias events describe the data significantly better than the CR models with their default parameters before tuning. The predictions of the new tunes achieve a reasonable agreement in many UE observables, including the ones measured at forward pseudorapidities. However, the models after tuning do not generally perform better than the CP5 tune for the observables presented in this study. Although the new CR tunes presented in this work are not intended to improve the description of the measurements of strange particle multiplicities for \(\varLambda \) baryons and \(\text {K}_{\textrm{S}}^{0}\) mesons, we test the new tunes against them. We find that the new CR models, when tuned using only measurements that are sensitive to the UE, do not provide a better description of the distribution of strange particle production as a function of rapidity for \(\varLambda \) baryons. However, we observe that all CP5 tunes, irrespective of the CR model, describe particle production for \(\text {K}_{\textrm{S}}^{0}\) as a function of rapidity well. Including these observables in the fits, along with the latest measurements of baryon/meson production, could be beneficial for future tune derivations.
The predictions of the new tunes for jet shapes and colour flow measurements done with top quark pair events are also compared with data. All tunes give similar predictions, but none of the tunes describe the jet shape distributions well. Some differences are also observed with respect to the colour flow data, which is particularly sensitive to the early resonance decay option in the CR models. The differences between the predictions using the different tunes observed here indicate that the inclusion of observables, such as the jet pull angle and other jet substructure observables, could be beneficial in tuning studies. A study of the uncertainty in the top quark mass measurement due to CR effects is also presented. The new CR tunes will play a role in the evaluation of systematic uncertainties associated with the modelling of colour reconnection.
Data Availability
This manuscript has no associated data or the data will not be deposited. [Authors’ comment: Release and preservation of data used by the CMS Collaboration as the basis for publications is guided by the CMS policy as stated in https://cms-docdb.cern.ch/cgi-bin/PublicDocDB/RetrieveFile?docid=6032 &filename=CMSDataPolicyV1.2.pdf &version=2 CMS data preservation, re-use and open access policy.]
References
T. Sjöstrand et al., An introduction to PYTHIA 8.2. Comput. Phys. Commun. 191, 159 (2015). https://doi.org/10.1016/j.cpc.2015.01.024. arXiv:1410.3012
R. Corke, T. Sjöstrand, Multiparton interactions with an x-dependent proton size. JHEP 05, 009 (2011). https://doi.org/10.1007/JHEP05(2011)009. arXiv:1101.5953
A. Buckley et al., General-purpose event generators for LHC physics. Phys. Rep. 504, 145 (2011). https://doi.org/10.1016/j.physrep.2011.03.005. arXiv:1101.2599
G. Gustafson, Dual description of a confined color field. Phys. Lett. B 175, 453 (1986). https://doi.org/10.1016/0370-2693(86)90622-2
Particle Data Group Collaboration, Review of Particle Physics. PTEP 2020(8), 083C01 (2020). https://doi.org/10.1093/ptep/ptaa104
T. Sjöstrand, M. van Zijl, A multiple interaction model for the event structure in hadron collisions. Phys. Rev. D 36, 2019 (1987). https://doi.org/10.1103/PhysRevD.36.2019
T. Sjöstrand, Colour reconnection and its effects on precise measurements at the LHC (2013). arXiv:1310.8073
CMS Collaboration, Study of the underlying event in top quark pair production in pp collisions at 13 TeV. Eur. Phys. J. C 79, 123 (2019). https://doi.org/10.1140/epjc/s10052-019-6620-z. arXiv:1807.02810
S. Argyropoulos, T. Sjöstrand, Effects of color reconnection on \(\rm {t\bar{t}}\) final states at the LHC. JHEP 11, 043 (2014). https://doi.org/10.1007/JHEP11(2014)043. arXiv:1407.6653
CMS Collaboration, Measurement of the top quark mass with lepton+jets final states using pp collisions at \(\sqrt{s}=13\) TeV. Eur. Phys. J. C 78, 891 (2018). https://doi.org/10.1140/epjc/s10052-018-6332-9. arXiv:1805.01428
B. Andersson, G. Gustafson, G. Ingelman, T. Sjostrand, Parton fragmentation and string dynamics. Phys. Rep. 97, 31 (1983). https://doi.org/10.1016/0370-1573(83)90080-7
CMS Collaboration, Event generator tunes obtained from underlying event and multiparton scattering measurements. Eur. Phys. J. C 76, 155 (2016). https://doi.org/10.1140/epjc/s10052-016-3988-x. arXiv:1512.00815
CMS Collaboration, Extraction and validation of a new set of CMS PYTHIA8 tunes from underlying-event measurements. Eur. Phys. J. C 80, 4 (2020). https://doi.org/10.1140/epjc/s10052-019-7499-4. arXiv:1903.12179
J.R. Christiansen, P.Z. Skands, String formation beyond leading colour. JHEP 08, 003 (2015). https://doi.org/10.1007/JHEP08(2015)003. arXiv:1505.01681
T. Sjöstrand, P.Z. Skands, Multiple interactions and the structure of beam remnants. JHEP 03, 053 (2004). https://doi.org/10.1088/1126-6708/2004/03/053. arXiv:hep-ph/0402078
CMS Collaboration, Strange particle production in pp collisions at \(\sqrt{s}=0.9\) and 7 TeV. JHEP 05, 064 (2011). https://doi.org/10.1007/JHEP05(2011)064. arXiv:1102.4282
CMS Collaboration, Underlying event measurements with leading particles and jets in pp collisions at \(\sqrt{s}=13\) TeV. CMS Physics Analysis Summary CMS-PAS-FSQ-15-007 (2015)
ATLAS Collaboration, Measurement of charged-particle distributions sensitive to the underlying event in \(\sqrt{s}=13\) TeV proton-proton collisions with the ATLAS detector at the LHC. JHEP 03, 157 (2017). https://doi.org/10.1007/JHEP03(2017)157. arXiv:1701.05390
CMS Collaboration, Measurement of the underlying event activity in pp collisions at the LHC at 7 TeV and comparison with 0.9 TeV. CMS Physics Analysis Summary CMS-PAS-FSQ-12-020 (2012)
ATLAS Collaboration, Measurement of underlying event characteristics using charged particles in pp collisions at \(\sqrt{s}=900\) GeV and 7 TeV with the ATLAS detector. Phys. Rev. D 83, 112001 (2011). https://doi.org/10.1103/PhysRevD.83.112001. arXiv:1012.0791
CDF Collaboration, Study of the energy dependence of the underlying event in proton-antiproton collisions. Phys. Rev. D 92, 092009 (2015). https://doi.org/10.1103/PhysRevD.92.092009. arXiv:1508.05340
CMS Collaboration, Pseudorapidity distribution of charged hadrons in proton-proton collisions at \(\sqrt{s}=13\) TeV. Phys. Lett. B 751, 143 (2015). https://doi.org/10.1016/j.physletb.2015.10.004. arXiv:1507.05915
CMS Collaboration, Measurement of pseudorapidity distributions of charged particles in proton-proton collisions at \(\sqrt{s}=13\) TeV by the CMS experiment. CMS Physics Analysis Summary CMS-PAS-FSQ-15-008 (2016)
A. Buckley et al., Rivet user manual. Comput. Phys. Commun. 184, 2803 (2013). https://doi.org/10.1016/j.cpc.2013.05.021. arXiv:1003.0694
A. Buckley et al., Systematic event generator tuning for the LHC. Eur. Phys. J. C 65, 331 (2010). https://doi.org/10.1140/epjc/s10052-009-1196-7. arXiv:0907.2973
NNPDF Collaboration, Parton distributions from high-precision collider data. Eur. Phys. J. C 77, 663 (2017). https://doi.org/10.1140/epjc/s10052-017-5199-5. arXiv:1706.00428
CMS Collaboration, Shape, transverse size, and charged hadron multiplicity of jets in pp collisions at 7 TeV. JHEP 06, 160 (2012). https://doi.org/10.1007/JHEP06(2012)160. arXiv:1204.3170
ATLAS Collaboration, Measurement of the jet fragmentation function and transverse profile in proton-proton collisions at a center-of-mass energy of 7 TeV with the ATLAS detector. Eur. Phys. J. C 71, 1795 (2011). https://doi.org/10.1140/epjc/s10052-011-1795-y. arXiv:1109.5816
ATLAS Collaboration, Measurement of the charged-particle multiplicity inside jets from \(\sqrt{s}=8\) TeV pp collisions with the ATLAS detector. Eur. Phys. J. C 76, 322 (2016). https://doi.org/10.1140/epjc/s10052-016-4126-5. arXiv:1602.00988
CMS and TOTEM Collaborations, Measurement of single-diffractive dijet production in proton-proton collisions at \(\sqrt{s}=8\) TeV with the CMS and TOTEM experiments. Eur. Phys. J. C 80, 1164 (2020). https://doi.org/10.1140/epjc/s10052-020-08562-y. arXiv:2002.12146 [Erratum: https://doi.org/10.1140/epjc/s10052-021-08863-w]
CMS Collaboration, Pseudorapidity distributions of charged particles in pp collisions at \(\sqrt{s}=7\) TeV with at least one central charged particle. CMS Physics Analysis Summary CMS-PAS-QCD-10-024 (2010)
CMS, TOTEM Collaboration, Measurement of pseudorapidity distributions of charged particles in proton-proton collisions at \(\sqrt{s}\) = 8 TeV by the CMS and TOTEM experiments. Eur. Phys. J. C 74(10), 3053 (2014). https://doi.org/10.1140/epjc/s10052-014-3053-6. arXiv:1405.0722
TOTEM Collaboration, Measurement of the forward charged particle pseudorapidity density in \(pp\) collisions at \(\sqrt{s} = 7\) TeV with the TOTEM experiment. Europhys. Lett. 98(3), 31002 (2012). https://doi.org/10.1209/0295-5075/98/31002. arXiv:1205.4105
CMS Collaboration, Measurement of the energy density as a function of pseudorapidity in proton-proton collisions at \(\sqrt{s}=13\) TeV. Eur. Phys. J. C 79, 391 (2019). https://doi.org/10.1140/epjc/s10052-019-6861-x. arXiv:1812.04095
C. Bierlich, G. Gustafson, L. Lönnblad, A shoving model for collectivity in hadronic collisions (2016). arXiv:1612.05132
C. Bierlich, Rope hadronization and strange particle production. Eur. Phys. J. Web Conf. 171, 14003 (2018). https://doi.org/10.1051/epjconf/201817114003. arXiv:1710.04464
ALICE Collaboration, Measurement of prompt D\(^{0}\), \(\Lambda _{c}^{+}\), and \(\Sigma _{c}^{0,++}\)(2455) production in proton-proton collisions at \(\sqrt{s}=13\) TeV. Phys. Rev. Lett. 128, 012001 (2022). https://doi.org/10.1103/PhysRevLett.128.012001. arXiv:2106.08278
LHCb Collaboration, Measurement of \(b\) hadron fractions in 13 TeV pp collisions. Phys. Rev. D 100, 031102 (2019). https://doi.org/10.1103/PhysRevD.100.031102. arXiv:1902.06794
CMS Collaboration, Measurement of charged pion, kaon, and proton production in proton-proton collisions at \(\sqrt{s}=13\) TeV. Phys. Rev. D 96, 112003 (2017). https://doi.org/10.1103/PhysRevD.96.112003. arXiv:1706.10194
CMS Collaboration, Study of the inclusive production of charged pions, kaons, and protons in pp collisions at \(\sqrt{s}=0.9\), 2.76, and 7 TeV. Eur. Phys. J. C 72, 2164 (2012). https://doi.org/10.1140/epjc/s10052-012-2164-1. arXiv:1207.4724
CMS Collaboration, Study of the production of charged pions, kaons, and protons in pPb collisions at \(\sqrt{s_{NN}}=5.02\) TeV. Eur. Phys. J. C 74, 2847 (2014). https://doi.org/10.1140/epjc/s10052-014-2847-x. arXiv:1307.3442
S.D. Drell, T.-M. Yan, Massive lepton pair production in hadron-hadron collisions at high-energies. Phys. Rev. Lett. 25, 316 (1970). https://doi.org/10.1103/PhysRevLett.25.316 [Erratum: Phys. Rev. Lett. 25, 902 (1970)]
J.H. Christenson et al., Observation of massive muon pairs in hadron collisions. Phys. Rev. Lett. 25, 1523 (1970). https://doi.org/10.1103/PhysRevLett.25.1523
CMS Collaboration, Measurement of the underlying event activity in inclusive Z boson production in proton-proton collisions at \(\sqrt{s}=13\) TeV. JHEP 07, 032 (2018). https://doi.org/10.1007/JHEP07(2018)032. arXiv:1711.04299
J. Alwall et al., The automated computation of tree-level and next-to-leading order differential cross sections, and their matching to parton shower simulations. JHEP 07, 079 (2014). https://doi.org/10.1007/JHEP07(2014)079. arXiv:1405.0301
R. Frederix, S. Frixione, Merging meets matching in MC@NLO. JHEP 12, 061 (2012). https://doi.org/10.1007/JHEP12(2012)061. arXiv:1209.6215
C. Bierlich, G. Gustafson, L. Lönnblad, A. Tarasov, Effects of overlapping strings in pp collisions. JHEP 03, 148 (2015). https://doi.org/10.1007/JHEP03(2015)148. arXiv:1412.6259
C. Bierlich, J.R. Christiansen, Effects of color reconnection on hadron flavor observables. Phys. Rev. D 92, 094010 (2015). https://doi.org/10.1103/PhysRevD.92.094010. arXiv:1507.02091
C. Bierlich, G. Gustafson, L. Lönnblad, Collectivity without plasma in hadronic collisions. Phys. Lett. B 779, 58 (2018). https://doi.org/10.1016/j.physletb.2018.01.069. arXiv:1710.09725
V.A. Abramovsky, E.V. Gedalin, E.G. Gurvich, O.V. Kancheli, Long range azimuthal correlations in multiple production processes at high-energies. JETP Lett. 47, 337 (1988)
I. Altsybeev, Mean transverse momenta correlations in hadron-hadron collisions in MC toy model with repulsing strings. AIP Conf. Proc. 1701, 100002 (2016). https://doi.org/10.1063/1.4938711. arXiv:1502.03608
P. Cea, L. Cosmai, F. Cuteri, A. Papa, Flux tubes in the SU(3) vacuum: London penetration depth and coherence length. Phys. Rev. D 89, 094505 (2014). https://doi.org/10.1103/PhysRevD.89.094505. arXiv:1404.1172
CMS Collaboration, Measurement of jet substructure observables in \({{\rm t}}\overline{{\rm t}}\) events from proton-proton collisions at \(\sqrt{s}=\) 13 TeV. Phys. Rev. D 98, 092014 (2018). https://doi.org/10.1103/PhysRevD.98.092014. arXiv:1808.07340
M. Cacciari, G.P. Salam, G. Soyez, The anti-\({k_{\rm T }}\) jet clustering algorithm. JHEP 04, 063 (2008). https://doi.org/10.1088/1126-6708/2008/04/063. arXiv:0802.1189
M. Cacciari, G.P. Salam, G. Soyez, FastJet user manual. Eur. Phys. J. C 72, 1896 (2012). https://doi.org/10.1140/epjc/s10052-012-1896-2. arXiv:1111.6097
M. Dasgupta, A. Fregoso, S. Marzani, G.P. Salam, Towards an understanding of jet substructure. JHEP 09, 029 (2013). https://doi.org/10.1007/JHEP09(2013)029. arXiv:1307.0007
A.J. Larkoski, S. Marzani, G. Soyez, J. Thaler, Soft drop. JHEP 05, 146 (2014). https://doi.org/10.1007/JHEP05(2014)146. arXiv:1402.2657
ATLAS Collaboration, Measurement of colour flow with the jet pull angle in \({\rm t\bar{t}}\) events using the ATLAS detector at \(\sqrt{s}=8\) TeV. Phys. Lett. B 750, 475 (2015). https://doi.org/10.1016/j.physletb.2015.09.051. arXiv:1506.05629
J. Gallicchio, M.D. Schwartz, Seeing in color: jet superstructure. Phys. Rev. Lett. 105, 022001 (2010). https://doi.org/10.1103/PhysRevLett.105.022001. arXiv:1001.5027
S. Frixione, P. Nason, G. Ridolfi, A positive-weight next-to-leading-order Monte Carlo for heavy flavour hadroproduction. JHEP 09, 126 (2007). https://doi.org/10.1088/1126-6708/2007/09/126. arXiv:0707.3088
ATLAS, CDF, CMS, and D0 Collaborations, First combination of Tevatron and LHC measurements of the top-quark mass, Technical Reports ATLAS-CONF-2014-008, CDF-NOTE-11071, CMS-PAS-TOP-13-014, D0-NOTE-6416, FERMILAB-TM-2582-E (2014). arXiv:1403.4427
ATLAS Collaboration, Measurement of the top-quark mass in the fully hadronic decay channel from ATLAS data at \(\sqrt{s}=7\) TeV. Eur. Phys. J. C 75, 158 (2015). https://doi.org/10.1140/epjc/s10052-015-3373-1. arXiv:1409.0832
ATLAS Collaboration, Measurement of the top quark mass in the \({\rm t\bar{t}}\rightarrow \text{lepton+jets} \) and \({\rm t\bar{t}}\rightarrow \text{ dilepton } \) channels using \(\sqrt{s}=7\) tev ATLAS data. Eur. Phys. J. C 75, 330 (2015). https://doi.org/10.1140/epjc/s10052-015-3544-0. arXiv:1503.05427
ATLAS Collaboration, Measurement of the top quark mass in the \({\rm t\bar{t}}\rightarrow \) dilepton channel from \(\sqrt{s}=8\) TeV ATLAS data. Phys. Lett. B 761, 350 (2016). https://doi.org/10.1016/j.physletb.2016.08.042. arXiv:1606.02179
ATLAS Collaboration, Top-quark mass measurement in the all-hadronic \({\rm t\overline{t} }\) decay channel at \(\sqrt{s}=8\) TeV with the ATLAS detector. JHEP 09, 118 (2017). https://doi.org/10.1007/JHEP09(2017)118. arXiv:1702.07546
ATLAS Collaboration, Measurement of the top quark mass in the \({\rm t\bar{t}}\rightarrow \) lepton+jets channel from \(\sqrt{s}=8\) TeV ATLAS data and combination with previous results. Eur. Phys. J. C 79, 290 (2019). https://doi.org/10.1140/epjc/s10052-019-6757-9. arXiv:1810.01772
CMS Collaboration, Measurement of the top quark mass using proton-proton data at \(\sqrt{s}=7\) and 8 TeV. Phys. Rev. D 93, 072004 (2016). https://doi.org/10.1103/PhysRevD.93.072004. arXiv:1509.04044
CMS Collaboration, Measurement of the top-quark mass in \({\rm t}{\bar{\text{ t }}}\) events with lepton+jets final states in pp collisions at \(\sqrt{s}=7\) TeV. JHEP 12, 105 (2012). https://doi.org/10.1007/JHEP12(2012)105. arXiv:1209.2319
CMS Collaboration, Measurement of the top-quark mass in \({\rm t}{\bar{\text{ t }}}\) events with dilepton final states in pp collisions at \(\sqrt{s}=7\) TeV. Eur. Phys. J. C 72, 2202 (2012). https://doi.org/10.1140/epjc/s10052-012-2202-z. arXiv:1209.2393
CMS Collaboration, Measurement of the top-quark mass in all-jets \({\rm t}{\bar{\text{ t }}}\) events in pp collisions at \(\sqrt{s}=7\) TeV. Eur. Phys. J. C 74, 2758 (2014). https://doi.org/10.1140/epjc/s10052-014-2758-x. arXiv:1307.4617
CMS Collaboration, Measurement of the top quark mass using single top quark events in proton-proton collisions at \(\sqrt{s}=8\) TeV. Eur. Phys. J. C 77, 354 (2017). https://doi.org/10.1140/epjc/s10052-017-4912-8. arXiv:1703.02530
CMS Collaboration, Measurement of the \({{\rm t}}\overline{\rm t}\) production cross section, the top quark mass, and the strong coupling constant using dilepton events in pp collisions at \(\sqrt{s}=13\) TeV. Eur. Phys. J. C 79, 368 (2019). https://doi.org/10.1140/epjc/s10052-019-6863-8. arXiv:1812.10505
CMS Collaboration, Measurement of the top quark mass in the all-jets final state at \(\sqrt{s}=13\) TeV and combination with the lepton+jets channel. Eur. Phys. J. C 79(313) (2019). https://doi.org/10.1140/epjc/s10052-019-6788-2. arXiv:1812.10534
P.Z. Skands, Tuning Monte Carlo generators: the Perugia tunes. Phys. Rev. D 82, 074018 (2010). https://doi.org/10.1103/PhysRevD.82.074018. arXiv:1005.3457
CMS Collaboration, Investigations of the impact of the parton shower tuning in PYTHIA8 in the modelling of \({{\rm t}}\overline{{\rm t}}\) at \(\sqrt{s}=8\) and 13 TeV. CMS Physics Analysis Summary CMS-PAS-TOP-16-021 (2016)
CMS Collaboration, Measurement of the top quark mass using events with a single reconstructed top quark in pp collisions at \(\sqrt{s}=13\) TeV. JHEP 12(161) (2021). https://doi.org/10.1007/JHEP12(2021)161. arXiv:2108.10407
Acknowledgements
We congratulate our colleagues in the CERN accelerator departments for the excellent performance of the LHC and thank the technical and administrative staffs at CERN and at other CMS institutes for their contributions to the success of the CMS effort. In addition, we gratefully acknowledge the computing centres and personnel of the Worldwide LHC Computing Grid and other centres for delivering so effectively the computing infrastructure essential to our analyses. Finally, we acknowledge the enduring support for the construction and operation of the LHC, the CMS detector, and the supporting computing infrastructure provided by the following funding agencies: BMBWF and FWF (Austria); FNRS and FWO (Belgium); CNPq, CAPES, FAPERJ, FAPERGS, and FAPESP (Brazil); MES and BNSF (Bulgaria); CERN; CAS, MoST, and NSFC (China); MINCIENCIAS (Colombia); MSES and CSF (Croatia); RIF (Cyprus); SENESCYT (Ecuador); MoER, ERC PUT and ERDF (Estonia); Academy of Finland, MEC, and HIP (Finland); CEA and CNRS/IN2P3 (France); BMBF, DFG, and HGF (Germany); GSRI (Greece); NKFIH (Hungary); DAE and DST (India); IPM (Iran); SFI (Ireland); INFN (Italy); MSIP and NRF (Republic of Korea); MES (Latvia); LAS (Lithuania); MOE and UM (Malaysia); BUAP, CINVESTAV, CONACYT, LNS, SEP, and UASLP-FAI (Mexico); MOS (Montenegro); MBIE (New Zealand); PAEC (Pakistan); MES and NSC (Poland); FCT (Portugal); MESTD (Serbia); MCIN/AEI and PCTI (Spain); MOSTR (Sri Lanka); Swiss Funding Agencies (Switzerland); MST (Taipei); MHESI and NSTDA (Thailand); TUBITAK and TENMAK (Turkey); NASU (Ukraine); STFC (United Kingdom); DOE and NSF (USA). Individuals have received support from the Marie-Curie programme and the European Research Council and Horizon 2020 Grant, contract Nos. 675440, 724704, 752730, 758316, 765710, 824093, 884104, and COST Action CA16108 (European Union); the Leventis Foundation; the Alfred P. Sloan Foundation; the Alexander von Humboldt Foundation; the Belgian Federal Science Policy Office; the Fonds pour la Formation à la Recherche dans l’Industrie et dans l’Agriculture (FRIA-Belgium); the Agentschap voor Innovatie door Wetenschap en Technologie (IWT-Belgium); the F.R.S.-FNRS and FWO (Belgium) under the “Excellence of Science – EOS” – be.h project n. 30820817; the Beijing Municipal Science & Technology Commission, No. Z191100007219010; the Ministry of Education, Youth and Sports (MEYS) of the Czech Republic; the Hellenic Foundation for Research and Innovation (HFRI), Project Number 2288 (Greece); the Deutsche Forschungsgemeinschaft (DFG), under Germany’s Excellence Strategy – EXC 2121 “Quantum Universe” – 390833306, and under project number 400140256 – GRK2497; the Hungarian Academy of Sciences, the New National Excellence Program – ÚNKP, the NKFIH research grants K 124845, K 124850, K 128713, K 128786, K 129058, K 131991, K 133046, K 138136, K 143460, K 143477, 2020-2.2.1-ED-2021-00181, and TKP2021-NKTA-64 (Hungary); the Council of Science and Industrial Research, India; the Latvian Council of Science; the Ministry of Education and Science, project no. 2022/WK/14, and the National Science Center, contracts Opus 2021/41/B/ST2/01369 and 2021/43/B/ST2/01552 (Poland); the Fundação para a Ciência e a Tecnologia, grant CEECIND/01334/2018 (Portugal); the National Priorities Research Program by Qatar National Research Fund; MCIN/AEI/10.13039/501100011033, ERDF “a way of making Europe”, and the Programa Estatal de Fomento de la Investigación Científica y Técnica de Excelencia María de Maeztu, grant MDM-2017-0765 and Programa Severo Ochoa del Principado de Asturias (Spain); the Chulalongkorn Academic into Its 2nd Century Project Advancement Project, and the National Science, Research and Innovation Fund via the Program Management Unit for Human Resources & Institutional Development, Research and Innovation, grant B05F650021 (Thailand); the Kavli Foundation; the Nvidia Corporation; the SuperMicro Corporation; the Welch Foundation, contract C-1845; and the Weston Havens Foundation (USA).
Author information
Authors and Affiliations
Consortia
Ethics declarations
Conflict of interest
The authors declare that they have no conflict of interest.
Appendices
Colour reconnection tunes with a leading-order PDF set
The list of input Rivet routines used as inputs for the fits, as well as the centre-of-mass energy values, the \(\eta \) ranges, the names of the distributions, the x-axis ranges, and the R values of the distributions are displayed in Table 6 for the tunes CP1-CR1 and CP1-CR2, and in Table 7 for the tunes CP2-CR1 and CP2-CR2. The baseline tunes CP1 and CP2 use the NNPDF31_lo_as_0130 [26] PDF set, with an \(\alpha _\textrm{S} (m_\text {Z})\) value of 0.130 for ISR, FSR, and MPI, and the MPI-based CR model. The parameters of the tunes are documented in Ref. [13] and displayed in Tables 8 and 9. The parameters obtained from the CP1-CR1, CP1-CR2, CP2-CR1, and CP2-CR2 fits, as well as the value of the goodness of the fits are displayed in Tables 8 and 9. The predictions of these new CR tunes for particle multiplicities are shown in Figs. 22, and 23. The CR tunes based on CP1 and CP2 describe MB and UE data as well as the CR tunes based on CP5, except for the different trend observed with CP1-CR1 in the particle multiplicity distributions.
Parameter ranges and uncertainties in the tunes
The parameter ranges are chosen such that the sampled MC space does not destroy the definition of a particular observable in the fits. In Fig. 24, some sample histograms showing the range of variation available on the observable histograms are given for CP5-CR1. The results are similar for other observables used in the fits as well as for CP5-CR2.
The CP5-CR1 and CP5-CR2 tunes were developed to evaluate the uncertainty in CP5 that results from different color reconnection models. The uncertainties in the parameters for these tunes were estimated using eigentunes provided by Professor. Eigentunes represent variations of the tuned parameters in the parameter space along the maximally independent directions. The magnitude of the variation corresponds to a change in the \(\chi ^{*2}\) (\(\varDelta \chi ^{*2}\)) equal to the \(\chi ^{*2}\) of the fit. The choice of \(\varDelta \chi ^{*2}\), which is recommended by the Professor Collaboration, is based on empirical grounds since modifying it in equation Eq. (4) does not yield a statistically meaningful variation. Such a change, \(\varDelta \chi ^{*2} = \chi ^{*2}\), is considered reasonable for reflecting the combined statistical and systematic uncertainty in the model parameters, and results in variations similar in magnitude to the uncertainties in the fitted data points. However, this approach may result in uncertainties that do not fully encompass the data in every bin. If the uncertainties in the fitted data points are uncorrelated, their magnitudes will depend on the bin widths. For the data used in the fit, the uncertainties are mostly correlated between bins. However, for UE observables with high \(p_{\textrm{T}} ^{\text {max}}\) (\(p_{\textrm{T}} ^{\text {max}} \gtrsim 10\,\text {Ge\hspace{-.08em}V} \)) statistical uncertainties, which are uncorrelated between bins, dominate. This creates some dependence of the eigentunes on the bin widths of the data used in the fit, and leads to uncertainties in the tunes that are much larger and more asymmetric than those on the data points when all eigentunes are added in quadrature.
The number of eigentunes is equal to twice the number of free parameters used in the fit. For the QCD-inspired and gluon-move models, there are 14 and 12 eigentunes, respectively. However, using all 12 or 14 eigentunes to calculate the tune uncertainty for a given observable is computationally inefficient. Therefore, the “up” and “down” tune settings are calculated by comparing the positive and negative differences between each eigentune and the central prediction of the nominal tune for each bin of the observable. The upper and lower bounds of the uncertainty in each bin are defined by adding the positive differences in quadrature and taking the square root, and similarly for the negative differences. These “up” and “down” variations are then fit using the same procedure as in Sect. 3 to obtain new parameter sets that can be used to estimate the uncertainties in the nominal tune.
The predictions of the CP5-CR1 and CP5-CR2 tunes are compared with observables at 13\(\,\text {Te\hspace{-.08em}V}\) in Figs. 25, 26, 27, 28. The shaded bands in these figures correspond to the envelope of the predictions of the eigentunes of each tune. The parameters of the “up” and “down” tunes for CP5-CR1 and CP5-CR2 are given in Tables 10 and 11, respectively.
The ColourReconnection:junctionCorrection parameter
The QCD-inspired model implemented in pythia 8 allows for the creation of string junctions when three color lines meet at a single point. The presence of these junctions can affect the number of particles produced in a collision and result in the production of additional gluons and quark–antiquark pairs. The junctionCorrection parameter in pythia 8 controls the strength of this effect and can impact various observables, including the charged particle pseudorapidity distribution.
The model predictions, with their default parameter settings in pythia 8.226 and CP5, are given in Fig. 29 for the \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) distribution measured by the CMS experiment at 13\(\,\text {Te\hspace{-.08em}V}\) [22], and in Fig. 30 for \(N_{\textrm{ch}}\) and \(p_{\textrm{T}} ^{\text {sum}}\) densities measured by CMS at 13\(\,\text {Te\hspace{-.08em}V}\) [17] in the transMIN and transMAX regions. The predictions for CP5-“QCD-inspired” were obtained by replacing the MPI-based CR model in CP5 with the QCD-inspired model, where the default value of the junctionCorrection parameter is 1.2. The other predictions presented in the figures were obtained by setting the junctionCorrection parameter to 4.0 and to 0.05, respectively. These values were chosen arbitrarily to test how the prediction changes when a relatively high or low value is set for the junctionCorrection parameter. These comparisons demonstrate the sensitivity of the junctionCorrection parameter to these observables. According to Ref. [14], the junctionCorrection parameter is most sensitive to the baryon/meson ratio in pp collisions. The sensitivity of the junctionCorrection parameter to the production of \(\varLambda \) baryons and \(\text {K}_{\textrm{S}}^{0}\) mesons, measured by the CMS experiment at \(\sqrt{s}=7\,\text {Te\hspace{-.08em}V} \) [16], is shown in Fig. 31.
We also derived a new version of CP5-CR1 by including the rapidity distributions of \(\varLambda \) baryons and \(\text {K}_{\textrm{S}}^{0}\) mesons, as well as some recent baryon and meson measurements from ALICE and LHCb experiments [37, 38]. The new tune, named CP5-CR2-v2, resulted in a significant improvement in the description of the \(\varLambda \) rapidity distribution and reasonable agreement with the data for \(\varLambda \)/\(\text {K}_{\textrm{S}}^{0}\), but it was not able to reproduce the \(\text{ d }N_{\textrm{ch}}/\text{d }\eta \) at 13 TeV. The values of the parameters obtained in the fits of the CP5-CR1-v2 are presented in Table 12. The remaining parameters of pythia 8 are kept the same as in the CP5 tune.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
Funded by SCOAP3. SCOAP3 supports the goals of the International Year of Basic Sciences for Sustainable Development.
About this article
Cite this article
Tumasyan, A., Adam, W., Andrejkovic, J.W. et al. CMS pythia 8 colour reconnection tunes based on underlying-event data. Eur. Phys. J. C 83, 587 (2023). https://doi.org/10.1140/epjc/s10052-023-11630-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjc/s10052-023-11630-8