1 Introduction

The strong coupling constant, \(\alpha _{\text {s}}\), is one of the least precisely known fundamental parameters in the StandardModel of particle physics. Because of its importance for precision phenomenology at the LHC and elsewhere, large efforts have been undertaken in the past decades to reduce uncertainties in determinations of \(\alpha _{\text {s}}\) [1,2,3,4].

With the advent of modern particle detectors and sophisticated algorithms for their simulation and calibration, jet measurements have become very precise. Many determinations of \(\alpha _{\text {s}}\) in deep-inelastic scattering (DIS) and in hadron-hadron collisions are therefore based on measurements of the inclusive jet cross section, which is directly proportional to \(\alpha _{\text {s}}\) in DIS in the Breit frame and \(\alpha _{\text {s}} ^2\) in hadron-hadron collisions. Using the most precise predictions of perturbative quantum chromodynamics (pQCD) available at the time, all previous \(\alpha _{\text {s}}\) extractions (except for Ref. [5]) were performed at next-to-leading order (NLO) in \(\alpha _{\text {s}}\). Their total uncertainty is dominated by the contribution related to the renormalisation scale dependence of the NLO pQCD results. The recent completion of next-to-next-to-leading order (NNLO) predictions for the inclusive jet cross section [6, 7] promises a considerable reduction of the renormalisation scale dependence and will allow the inclusion of \(\alpha _{\text {s}}\) results from inclusive jet data in future determinations of the world average value of \(\alpha _{\text {s}}\) [1].

A determination of \(\alpha _{\text {s}}\) at NNLO from jet measurements in hadron-hadron collisions is still not readily achievable, because the new NNLO pQCD calculations are computationally very demanding and cannot yet be repeated quickly for different parton distribution functions (PDFs) or values of \(\alpha _{\text {s}} (M_{\text {Z}})\). In preparation of such a determination, it is desirable to study a simultaneous analysis of data sets from different processes and experiments. This study includes an investigation of the consistency of the various data sets and an estimation of the reduction of the experimental contributions to the \(\alpha _{\text {s}}\) uncertainty. The groundwork for these two aspects is presented in this article.

We review inclusive jet cross section data over a wide kinematic range, from different experiments for various initial states and centre-of-mass energies, and study their potential for determinations of \(\alpha _{\text {s}}\). The consistency of the diverse data sets is examined and the benefit of their simultaneous inclusion is demonstrated. Different methods for the statistical analysis of the data are compared and one method is proposed for a coherent treatment of all data sets in an extraction of \(\alpha _{\text {s}} (M_{\text {Z}})\).

The article is structured as follows: The experimental data sets and the theoretical predictions are introduced in Sects. 2 and 3, respectively. Methods and results from previous \(\alpha _{\text {s}}\) determinations by different experimental collaborations are discussed and employed in Sect. 4. The strategy for a determination of \(\alpha _{\text {s}}\) from multiple data sets and the final result are presented in Sect. 5.

2 Experimental data

The first measurement of the inclusive jet cross section has been performed in 1982 by the UA2 Collaboration at the \(\hbox {Sp}{\bar{\hbox {p}}}\hbox {S}\) collider at a centre-of-mass energy of 540 GeV [8]. Further measurements have been conducted at centre-of-mass energies of

  • 540 GeV, 546 GeV, and 630 GeV at the \(\text {p}\bar{\text {p}}\) collider \(\hbox {Sp}{\bar{\hbox {p}}}\hbox {S}\) by the UA1 [9, 10] and UA2 experiments [11],

  • 546 GeV, 630 GeV, 1.8 TeV, and 1.96 TeVat the Tevatron \(\text {p}\bar{\text {p}}\) collider by the CDF [12,13,14,15,16,17] and D0 experiments [18,19,20],

  • 300 GeV and 320 GeV at the \(\text {ep}\) collider HERA by the H1 [21,22,23,24,25,26,27,28] and ZEUS experiments [29,30,31,32,33,34,35],

  • 200 GeV in \(\text {pp}\) collisions at RHIC by the STAR experiment [36],

  • and of 2.76 TeV, 7 TeV, 8 TeV, and 13 TeVin \(\text {pp}\) collisions at the LHC by the ALICE [37], ATLAS [38,39,40,41,42,43], and CMS experiments [44,45,46,47,48,49,50].

While earlier measurements established the inclusive jet cross section as a useful quantity to study QCD, large experimental uncertainties limited their use for QCD phenomenology. When the NLO pQCD corrections were computed [51,52,53], studies revealed collinear- or infrared-safety issues in the jet definitions used in the experimental measurements [54]. These issues were subsequently addressed and improved jet definitions were developed [55, 56] and applied in recent measurements.

Previously, \(\alpha _{\text {s}} (M_{\text {Z}})\) determinations were based on inclusive jet cross section data from individual experiments, as summarised in Table 1. An extraction of \(\alpha _{\text {s}} (M_{\text {Z}})\) from multiple inclusive jet cross section data sets has not been performed so far, except in the context of global PDF analyses, in which PDFs and \(\alpha _{\text {s}} (M_{\text {Z}})\) are determined simultaneously. These analyses, however, require data for a variety of measured quantities [21, 57,58,59,60]. In this article, \(\alpha _{\text {s}} (M_{\text {Z}})\) is determined in a fit to multiple inclusive jet cross section measurements from experiments at HERA, RHIC, the Tevatron, and the LHC. The analysis is based on one selected measurement from each, the H1, ZEUS, STAR, CDF, D0, ATLAS, and CMS collaborations, as listed in Table 2.

Table 1 Summary of previous determinations of \(\alpha _{\text {s}} (M_{\text {Z}})\) from inclusive jet cross sections. Rows 1–4 list the recent \(\alpha _{\text {s}} (M_{\text {Z}})\) extractions from double-differential inclusive jet cross sections by experimental collaborations that are studied in more detail in this work. Rows 5–12 and 13–16 summarise further determinations of \(\alpha _{\text {s}} (M_{\text {Z}})\) by experimental collaborations and by independent authors, respectively. The results in Refs. [61, 67] are reported for approximate NNLO (aNNLO) and NLO used for the pQCD predictions. In Ref. [61] only 22 out of the 110 D0 data points were used in the \(\alpha _{\text {s}} (M_{\text {Z}})\) extraction; the decomposition of the uncertainties is only provided for the aNNLO result. In case of Ref. [66], we only consider scale, PDF, and NP related uncertainties as theoretical uncertainty for reasons of comparability to the other listed results
Table 2 Overview of the inclusive jet data sets used in the \(\alpha _{\text {s}}\) determinations. For each data set the process (proc), the centre-of-mass energy \(\sqrt{s}\), the integrated luminosity \({\mathcal {L}}\), the number of data points, and the jet algorithm are listed. In case of \(\text {ep}\) collider data, the kinematic range may be defined by the four-momentum transfer squared \(Q^2\), the inelasticity \(y_{\mathrm{DIS}}\), or the angle of the hadronic final state \(|\cos \gamma _h|\) of the NC DIS process. In all cases, jets are required to be within a given range of pseudorapidity \(\eta \) or rapidity y in the laboratory frame

Whenever experiments provide multiple measurements, we include those measured with a collinear- and infrared-safe jet algorithm (\(k_{\text {T}}\) [68] or anti-\(k_{\text {T}}\) [69]) and with a larger jet size parameter R, which improves the stability of fixed-order pQCD calculations. The STAR experiment published inclusive jet data collected by two different triggers with partially overlapping jet \(p_{\text {T}}\) ranges. We choose the data set collected with the trigger covering the higher jet \(p_{\text {T}}\) range from 7.6 GeV up to 50 GeV. The measurements from the STAR and D0 experiments are using the midpoint cone jet algorithms (MP) [55]. The infrared-unsafety of this jet algorithm [54] prohibits NNLO pQCD predictions for these data sets, but it does not affect calculations at NLO. Four new measurements [28, 42, 43, 50] could not be included in this study; they are left for a future extension.

3 Theoretical predictions and tools

Predictions for the inclusive jet cross section in processes with initial-state hadrons are calculated as the convolution of the partonic cross section \({\hat{\sigma }}\) (computed in pQCD) and the PDFs of the hadron(s). The inclusive jet cross section in hadron-hadron collisions can be written as [1, 70]

$$\begin{aligned}&\sigma _{\text {pQCD,hh}}(\mu _{\text {r}},\mu _{\text {f}}) = \sum _{i,j} \int dx_1 \int dx_2 \nonumber \\&\quad f_{i/h_1}(x_1,\mu _{\text {f}})\,f_{j/h_2}(x_2,\mu _{\text {f}}) \, {\hat{\sigma }}_{ij\rightarrow \text {jet}+X}(\mu _{\text {r}},\mu _{\text {f}}) \, , \end{aligned}$$
(1)

where the sum is over all combinations of parton flavors i and j (quarks, anti-quarks, and the gluon). The \(f_{i,j/h_{1,2}}\) denote the PDFs for the parton flavours i or j in the initial-state hadrons \(h_1\) and \(h_2\), and \(x_{1}\) and \(x_{2}\) correspond to the fractional hadron momenta carried by the partons i and j, respectively. The partonic cross section \({\hat{\sigma }}_{ij\rightarrow \text {jet}+X}\) is computed as a perturbative expansion in \(\alpha _{\text {s}}\) as

$$\begin{aligned} {\hat{\sigma }}_{ij\rightarrow \text {jet}+X}(\mu _{\text {r}},\mu _{\text {f}}) \, = \, \sum _{n} \alpha ^n_s(\mu _{\text {r}}) \, c^{(n)}_{ij\rightarrow \text {jet}+X}(\mu _{\text {r}},\mu _{\text {f}}) \, , \end{aligned}$$
(2)

where the \(c^{(n)}_{ij\rightarrow \text {jet}+X}\) are computed from the pQCD matrix elements and the sum is over all orders of \(\alpha _{\text {s}}\) taken into account in the perturbative calculation. The renormalisation and factorisation scales are labelled \(\mu _{\text {r}} \) and \(\mu _{\text {f}} \), respectively. For inclusive jet production in hadron-hadron collisions, the first non-vanishing order (i.e. the leading order, LO) is given by \(n=2\), while \(n=3\) corresponds to the NLO corrections. For inclusive jet production in DIS in the Breit frame the partonic cross sections are convoluted with a single PDF and the LO (NLO) contribution is given by \(n=1\) (\(n=2\)). Hence, inclusive jet production in \(\text {pp}\), \(\text {p}\bar{\text {p}}\), and \(\text {ep}\) collisions is sensitive to \(\alpha _{\text {s}}\) already at LO .

For transverse jet momenta at the  TeVscale accessible at the LHC, electroweak (EW) tree-level effects of \(\mathcal {O}\left( \alpha \alpha _{\text {s}},\alpha ^2\right) \) and loop effects of \(\mathcal {O}\left( \alpha \alpha _{\text {s}} ^2\right) \) become sizeable [71]. A recent study of the complete set of QCD and EW NLO corrections has been presented in Ref. [72].

Non-perturbative (NP) corrections to the cross section due to multiparton interactions and hadronisation can be estimated by using Monte Carlo (MC) event generators. An overview of MC event generators for the LHC is presented in Ref. [73]. The size of this correction depends on the jet size R, shrinks with increasing jet \(p_{\text {T}}\), and becomes negligible at the  TeVscale. The total theory prediction for the inclusive jet cross section is given by

$$\begin{aligned} \sigma _{\text {theory}} = \sigma _{\text {pQCD}}\cdot \,c_{\text {EW}}\cdot \,c_{\text {NP}}, \end{aligned}$$
(3)

where \(c_{\text {EW}}\) and \(c_{\text {NP}}\) are the correction factors for electroweak and non-perturbative corrections, respectively.

The partonic cross section is computed at NLO accuracy for five massless quark flavours using the NLOJet++ program version 4.1.3 [74, 75] within the fastNLO framework at version 2 [76, 77] to allow us fast recalculations for varying PDFs, scales \(\mu _{\text {r}}\) and \(\mu _{\text {f}}\), and assumptions on \(\alpha _{\text {s}} (M_{\text {Z}})\). Jet algorithms are taken either from the FastJet software library [78] or, for jet cross sections in DIS, from NLOJet++. The PDFs are evaluated via the LHAPDF interface [79, 80] at version 6. The running of \(\alpha _{\text {s}} (\mu _{\text {r}})\) is performed at 2-loop order using the package CRunDec with five massless quark flavours [81, 82]. The minimal subtraction (\(\overline{\text {MS}}\)) scheme [83,84,85] has been adopted for the renormalisation procedure in these calculations.

For the computation of the inclusive jet cross section in hadron-hadron collisions, the renormalisation and factorisation scales, \(\mu _{\text {r}}\) and \(\mu _{\text {f}}\), are identified with each jet’s \(p_{\text {T}}\), i.e. \(\mu _{\text {r}} = \mu _{\text {f}} = p_{\text {T}}^\mathrm{jet} \). In neutral current (NC) DIS, the scales are chosen to be \({\mu _{\text {r}}}^2 =\frac{1}{2}\left( Q^2 + ({p_{\text {T}}^\mathrm{jet}})^2\right) \) and \({\mu _{\text {f}}}^2 = Q^2\) as used by the H1 Collaboration [27]. Alternative scale choices have been discussed with respect to NNLO predictions [5, 7, 86, 87], but are beyond the scope of this article.

The EW corrections, \(c_{\text {EW}}\), relevant for the LHC data are provided by the experimental collaborations together with the data, based on Ref. [71]. These are considered to have negligible uncertainties. Due to restrictions of the scale choices in this calculation, the leading jet’s transverse momentum, \(p_{\text {T}}^\text {max}\), is used to define the scales \(\mu _{\text {r}}\) and \(\mu _{\text {f}}\). The NP correction factors \(c_{\text {NP}}\), except for the STAR data [88], are also provided by the experimental collaborations, together with an estimate of the corresponding uncertainty [16, 20, 27, 32, 33, 41, 46, 61, 62].

4 Comparison of three extraction methods for \({\varvec{\alpha _{\text {s}} (M_{\text {Z}})}}\)

Commonly, the value of \(\alpha _{\text {s}} (M_{\text {Z}})\) is determined from inclusive jet cross sections in a comparison of pQCD predictions to the measurements. These \(\alpha _{\text {s}} (M_{\text {Z}})\) results therefore depend on details of the extraction method such as the treatment of uncertainties in the characterisation of differences between theory and data, or the evaluation and propagation of theoretical uncertainties to the final result. An overview of previous determinations of \(\alpha _{\text {s}} (M_{\text {Z}})\) from fits to inclusive jet cross section data is provided in Table 1. We choose the three \(\alpha _{\text {s}} (M_{\text {Z}})\) determinations performed by the CMS [62], D0 [61], and H1 [27] collaborations listed in the upper part of Table 1 for further study.

The three extraction methods differ in the following aspects:

  • the definition of the \(\chi ^2\) function to quantify the agreement between theory and data,

  • the uncertainties considered in the \(\chi ^2\) function,

  • the strategy to determine the central result for \(\alpha _{\text {s}} (M_{\text {Z}})\),

  • the propagation of the uncertainties to the value of \(\alpha _{\text {s}} (M_{\text {Z}})\),

  • the choice of PDF sets,

  • the consideration of the \(\alpha _{\text {s}} (M_{\text {Z}})\) dependence of the PDFs, and

  • the treatment of further theoretical uncertainties.

To study the impact of these differences, we have implemented the three methods in our computational framework and will refer to them as “CMS-type”, “D0-type”, and “H1-type”, respectively. As an integral part of these methods and to reproduce as precisely as possible the respective published results, we restrict ourselves in this section to the use of the original somewhat older PDF sets. The CMS result was obtained with the CT10 PDF set [89], and the D0 and H1 results with MSTW2008 PDFs [90]. The CMS-type and D0-type methods use the entire \(\alpha _{\text {s}}^\text {PDF}(M_{\text {Z}})\) series available for the PDF set, whereas the H1-type method uses a PDF determined with a value of \(\alpha _{\text {s}}^\text {PDF}(M_{\text {Z}}) =0.1180\).

Each method is then employed to extract \(\alpha _{\text {s}} (M_{\text {Z}})\) from each of the individual data sets selected in Sect. 2, cf. also Table 2. The experimental uncertainties and their correlations are treated according to the respective prescriptions by the experiments. The resulting \(\alpha _{\text {s}} (M_{\text {Z}})\) values are listed in Table 3.

Table 3 Values of \(\alpha _{\text {s}} (M_{\text {Z}})\) with experimental uncertainties obtained using the three extraction methods CMS-type, D0-type, and H1-type together with the CT10 or MSTW2008 PDF set at NLO . The underlined values can be directly compared with the results published in Refs. [27, 61, 62]. Some fits to the STAR data do not exhibit a local minimum, in which case no value is listed

In a first step, these results are compared to the ones obtained by the CMS [62], D0 [61], and H1 [27] collaborations as listed in Table 1. All three central results are reproduced, the H1 result exactly, and the CMS and D0 results within \(+0.0003\) and \(+0.0001\). Such small differences can easily be caused already by using different versions of LHAPDF (e.g. changes from version 5 to version 6). The experimental uncertainties of the CMS and H1 analyses are exactly reproduced.Footnote 1

Fig. 1
figure 1

Values of \(\alpha _{\text {s}} (M_{\text {Z}})\) with experimental uncertainties obtained using the three extraction methods CMS-type, D0-type, and H1-type. The upper row compares the results for each data set employing the PDF set CT10 (left) or MSTW2008 (right). In addition, the world average value [1] is shown together with a band representing its uncertainty. The bottom row displays the values of \(\chi ^2/n_{\text {dof}} \) for each fit using the CT10 (left) or MSTW2008 PDF set (right). The colours illustrate the values of \(\chi ^2/n_{\text {dof}} \). For the STAR data and the MSTW2008 PDF set no local minimum was found in case of the CMS-type and D0-type fits (blank areas)

In a second step, the \(\alpha _{\text {s}} (M_{\text {Z}})\) results and their experimental uncertainties are compared to each other and their dependencies on the extraction method and PDFs are studied. The \(\alpha _{\text {s}} (M_{\text {Z}})\) results determined for each data set are displayed in Fig. 1 (top row) for the three different extraction methods using CT10 PDFs (left) and MSTW2008 PDFs (right). For the STAR data, \(\alpha _{\text {s}} (M_{\text {Z}})\) results cannot be determined in case of the CMS-type and D0-type methods with MSTW2008 PDFs, since no local \(\chi ^2 \) minima are found. In all other cases the \(\alpha _{\text {s}} (M_{\text {Z}})\) results obtained with MSTW2008 PDFs are rather independent of the extraction method for all data sets. This is different when using CT10 PDFs: While in this case the extraction method has little impact on the \(\alpha _{\text {s}} (M_{\text {Z}})\) results from HERA data (H1 and ZEUS), it notably affects the results for the LHC data (ATLAS and CMS), and has large effects for the Tevatron data (CDF and D0). In the latter cases, the D0-type method produces significantly lower \(\alpha _{\text {s}} (M_{\text {Z}})\) results as compared to the other two methods.

The \(\chi ^2/n_{\text {dof}} \) values for the \(\alpha _{\text {s}} (M_{\text {Z}})\) extractions are displayed in Fig. 1 (bottom row) for the three extraction methods using CT10 PDFs (left) and MSTW2008 PDFs (right). Overall, the fits exhibit reasonable values of\(\chi ^2/n_{\text {dof}} \), thus indicating agreement between theory and data. Exceptions are observed for the ZEUS data with rather low values of \(\chi ^2/n_{\text {dof}} \), and for the ATLAS data, where the values of \(\chi ^2/n_{\text {dof}} \) are large as also observed elsewhere [42, 91, 92].

The PDF dependence is displayed in Fig. 2, where the \(\alpha _{\text {s}} (M_{\text {Z}})\) results for CT10 and MSTW2008 PDFs are compared to each other, both obtained using the H1-type method. While the PDF choice has no significant effect for the results from the H1, ZEUS, and D0 data, smaller variations are seen for the CDF data, and a large dependence for the ATLAS and CMS data. Re-investigating this PDF dependence in the context of a common determination of \(\alpha _{\text {s}} (M_{\text {Z}})\) as described in the next section, we observe that differences with respect to the updated PDF sets, CT14 [93] and MMHT2014 [60], are reduced.

5 Determination of \({\varvec{\alpha _{\text {s}} (M_{\text {Z}})}}\) from multiple inclusive jet data sets

The analysis of multiple data sets requires their correlations to be taken into account. For the present study, measurements from different colliders are considered to be uncorrelated because of the largely complementary kinematic ranges of the data sets and different detector calibration techniques. Furthermore, investigations with respect to H1 and ZEUS data [58], CDF and D0 data [94], or ATLAS and CMS data [95, 96], did not identify a relevant source of experimental correlation. This only leaves theoretical uncertainties as a source of potential correlations in this study. For the determination of NP effects and their uncertainties various methods and MC event generators have been employed [16, 20, 27, 32, 36, 41, 61, 62, 88]. While a consistent derivation of these corrections with corresponding correlations is desirable, this is beyond the scope of this analysis. Hence, the NP correction factors and their uncertainties are considered to be uncorrelated between the different data sets. In contrast, the PDF uncertainties and the uncertainties due to the renormalisation and factorisation scale variations are treated as fully correlated; the relative variations with respect to the nominal scales are performed simultaneously for all data sets.

The method employed for the simultaneous \(\alpha _{\text {s}}\) extraction combines components of the individual methods outlined in the previous section and is referred to as “CMS-type method”. The central \(\alpha _{\text {s}} (M_{\text {Z}})\) result is found in an iterative \(\chi ^2\) minimisation procedure adopted from the H1-type method, where a normal distribution is assumed for the relative uncertainties. The exact \(\chi ^2\) formula is given by Eq. (4) of Appendix A. Whereas in the H1-type \(\chi ^2\) expression, only experimental uncertainties are taken into account, the common-type method also accounts for the NP and PDF uncertainties in the \(\chi ^2\) expression, as in the CMS-type and D0-type methods. This \(\chi ^2\) definition treats variances as relative values and thus has advantages, e.g. when numerically inverting the covariance matrix. Moreover, uncertainties of experimental and theoretical origin are put on an equal footing. As in the H1-type method, and in contrast to the CMS-type and D0-type ones, only PDF sets obtained with a fixed value of \(\alpha _{\text {s}} (M_{\text {Z}}) =0.1180\) are employed in the determination of the central \(\alpha _{\text {s}} (M_{\text {Z}})\) result, leaving the \(\alpha _{\text {s}}\) dependence of the PDFs to be treated as a separate uncertainty.

In contrast to Sect. 4, where the three investigated methods were introduced, we do not need to restrict ourselves to PDF sets available at the time of their publication. Instead we take advantage from significant progress in the PDF determination with respect to theoretical ingredients, computational techniques, and data input and consider the following newer PDF sets ABMP16 [97, 98], CT14 [93], HERAPDF2.0 [58], MMHT2014 [60], and NNPDF3.0 [99]. Since in case of CT14, MMHT2014, and NNPDF3.0 also jet data we analyse here have been used, the effect of potential correlations due to the dual usage of the data has been studied in Appendix B, where additional five variants of NNPDF3.0 [99] with different input data selections are employed. The effect of such correlations is found to be small and is covered by an additional uncertainty as defined below.

Fig. 2
figure 2

Comparison of the \(\alpha _{\text {s}} (M_{\text {Z}})\) results with their experimental uncertainties obtained using the H1-type extraction methods for CT10 and MSTW2008 PDFs. The world average value [1] is shown together with a band representing its uncertainty

Fig. 3
figure 3

Left: Illustration of the \(\chi ^2/n_{\text {dof}} \) values for fits to each data set individually. Right: Illustration of the \(\chi ^2/n_{\text {dof}} \) values for simultaneous fits omitting a single data set at a time. The included or respectively excluded data set is indicated on the y axis and the PDF set on the x axis. The fits are performed for each PDF set in the envelope definition of the \(\text{ PDFset }\) uncertainty

In summary, the individual contributions to the total uncertainty of the \(\alpha _{\text {s}} (M_{\text {Z}})\) result are evaluated as follows: The experimental uncertainty (exp) is obtained from the Hesse algorithm [100] when performing the \(\alpha _{\text {s}} (M_{\text {Z}})\) extraction with only the uncertainties of the measurements included. The NP and PDF uncertainties are derived by repeating the \(\alpha _{\text {s}} (M_{\text {Z}})\) extraction while successively including the corresponding uncertainty contributions and calculating the quadratic differences. Further sources of systematic effects are considered as follows:

  • The “PDF\(\alpha _{\text {s}}\) ” uncertainty accounts for the initial assumption of \(\alpha _{\text {s}}^\text {PDF}(M_{\text {Z}}) =0.1180\) made in the PDF extraction, which is not necessarily consistent with the value of \(\alpha _{\text {s}} (M_{\text {Z}})\) used in the pQCD calculation. It is calculated as the maximal difference between any of the results obtained with PDF sets determined for \(\alpha _{\text {s}}^\text {PDF}(M_{\text {Z}}) = 0.1170\), 0.1180, and 0.1190, and therefore covers a difference of \(\varDelta \alpha _{\text {s}}^\text {PDF}(M_{\text {Z}}) =0.0020\), which is somewhat more conservative than the recommendation in Ref. [101].

  • The “\(\text{ PDFset }\) ” uncertainty covers differences due to the considered PDF set. These are caused by the data selection, assumptions made on parameterisation, parameter values, theory input, or the analysis method for the PDF determination. It also comprises the potential effect of the dual use of jet cross sections, first in the PDF determination and then in the fit of \(\alpha _{\text {s}} (M_{\text {Z}})\). Therefore, we define it as half of the width of the envelope constructed from the five values for the ABMP16, CT14, MMHT2014, and NNPDF3.0_nojet PDFs together with the NNPDF3.0 result. These global PDF sets either comprise a maximum of data other than jet cross sections or no jet cross sections at all.

  • The uncertainty due to variations of the renormalisation and factorisation scales customarily is taken as an estimate for the error of a fixed-order calculation caused by the truncation of the perturbative series. It is obtained using six additional \(\alpha _{\text {s}} (M_{\text {Z}})\) determinations, in which the nominal scales \((\mu _{\text {r}},\mu _{\text {f}})\) are varied by the conventional factors of (1 / 2, 1 / 2), (1 / 2, 1), (1, 1 / 2), (1, 2), (2, 1), and (2, 2). The scale factor combinations of (1 / 2, 2) and (2, 1 / 2) are customarily omitted [102,103,104].

The NP, PDF, PDF\(\alpha _{\text {s}}\), \(\text{ PDFset }\), and scale uncertainties are added in quadrature to give the theoretical uncertainty (theo). The total uncertainty (tot) further includes the experimental uncertainty.

Table 4 Values of \(\alpha _{\text {s}} (M_{\text {Z}})\) for the simultaneous fit to the H1, ZEUS, STAR, CDF, D0, and CMS data using the common-type method for various PDF sets. The experimental, NP, \(\text{ PDFset }\), and scale uncertainties remain mostly unchanged under a change of the PDF set and are quoted only once for NNPDF3.0

In the previous section, cf. Fig. 1, it was found that the \(\chi ^2/n_{\text {dof}} \) values differ significantly from unity for some of the data sets. This necessitates to investigate in further detail the consistency of the data within an individual data set as well as among the different data sets. Therefore, the common-type method is employed to extract \(\alpha _{\text {s}} (M_{\text {Z}})\) from each individual data set and for each of the PDFs ABMP16, CT14, HERAPDF2.0, MMHT2014, and NNPDF3.0. The resulting \(\chi ^2/n_{\text {dof}} \) values are displayed in Fig. 3 left. Detailed listings of the \(\alpha _{\text {s}} (M_{\text {Z}})\) results and their uncertainties are given in Appendix B.

For a given data set, \(\chi ^2/n_{\text {dof}} \) is rather independent of the PDF set used for the predictions and varies between 0.8 and 1.2. These values indicate reasonable agreement of the predictions with the data. Exceptions are rather low values of \(\chi ^2/n_{\text {dof}} \) around 0.54 found for all PDF sets with the ZEUS data and large \(\chi ^2/n_{\text {dof}} \) values between 1.9 and 3.5 exhibited by the ATLAS data, also for all PDFs. Exceptionally large \(\chi ^2/n_{\text {dof}} \) values appear for the Tevatron or LHC data together with theory predictions using the HERAPDF2.0 set, and for the STAR data in conjunction with the ABMP16 PDF set.

To further investigate the consistency among the data sets, a series of \(\alpha _{\text {s}} (M_{\text {Z}})\) extractions is performed, in which \(\alpha _{\text {s}} (M_{\text {Z}})\) is determined simultaneously from all data sets but one. This is repeated for each PDF set. The resulting \(\chi ^2/n_{\text {dof}} \) values are displayed in Fig. 3 right. Apparently, the exclusion of the ATLAS data leads to significantly smaller \(\chi ^2/n_{\text {dof}} \) values independent of the PDF set used. This hints at a compatibility issue when using all data sets together, which is not present when the ATLAS data set is ignored. Therefore, we choose to exclude the ATLAS data for our main result, which is thus obtained from the CDF, CMS, D0, H1, STAR, and ZEUS inclusive jet data. The choice of the NNPDF3.0 set for the central result yields

$$\begin{aligned} \alpha _{\text {s}} (M_{\text {Z}})= & {} 0.1192\,(12)_{\text {exp}}\,(5)_{\text {NP}}\\&(7)_{\mathrm{PDF}}\,(5)_{{\mathrm{PDF}\, \alpha _{\mathrm{s}}}}\,(7)_{\mathrm{PDFset}}\, (^{+59}_{-38})_{\text {scale}}, \end{aligned}$$

with \(\chi ^2 = 328\) for 381 data points. This result is consistent with the world average value of \(0.1181\,(11)\) [1], which has been derived exclusively using QCD theory at NNLO or higher. The experimental uncertainty for the extraction from multiple data sets is significantly smaller than each of the experimental uncertainties reported previously for the separate \(\alpha _{\text {s}} (M_{\text {Z}})\) determinations. Results obtained with further PDF sets constitute the \(\text{ PDFset }\) uncertainty as defined previously. They are listed in Tables 4 and 7 together with the PDFFootnote 2 and PDF\(\alpha _{\text {s}}\) uncertainties as appropriate for the respective PDF set. The values of \(\chi ^2/n_{\text {dof}} \) for the main PDF sets can be read off from row six of Fig. 3 right.

Fig. 4
figure 4

The \(\alpha _{\text {s}} (M_{\text {Z}})\) values from fits to individual data sets are compared to our simultaneous fit to H1, ZEUS, STAR, CDF, D0, and CMS data, and to the world average value [1]. The inner error bars represent the experimental uncertainty and the outer ones the total uncertainty. For reasons explained in the text, the ATLAS data are excluded from the common fit and only the result of a separate fit is indicated by the open circle

Fig. 5
figure 5

Ratio of data over theory for the selected inclusive jet cross sections listed in Table 2 as a function of jet \(p_{\text {T}}\). The NLO predictions are computed with the NNPDF3.0 PDF set for the fitted \(\alpha _{\text {s}} (M_{\text {Z}})\) value of 0.1192, which is determined considering all presented data except for the ATLAS data set. They are complemented with non-perturbative corrections and, where appropriate, with electroweak corrections

The \(\alpha _{\text {s}} (M_{\text {Z}})\) values from fits using the various PDF sets given in Table 4 are found to be consistent within the experimental uncertainty. The NP, PDF, and PDF\(\alpha _{\text {s}}\) uncertainties are smaller than the experimental uncertainty, while the \(\text{ PDFset }\) uncertainty is of a similar size as the experimental one. The scale uncertainty is the largest individual uncertainty and is more than three times larger than any other uncertainty. Results of the \(\alpha _{\text {s}} (M_{\text {Z}})\) extractions from single data sets, cf. Appendix B, from the simultaneous \(\alpha _{\text {s}} (M_{\text {Z}})\) extraction from all data sets, and the world average [1] are compared in Fig. 4 and are seen to be consistent with each other.

The ratio of data to the predictions as a function of jet \(p_{\text {T}}\) for all selected data sets is presented in Fig. 5. The predictions are computed for \(\alpha _{\text {s}} (M_{\text {Z}}) =0.1192\) as obtained in this analysis. Visually, all data sets are well described by the theory predictions.

6 Summary and outlook

Inclusive jet cross section data from different experiments at various particle colliders with jet transverse momenta ranging from 7 GeV up to 2 TeVare explored for determinations of \(\alpha _{\text {s}} (M_{\text {Z}})\) using next-to-leading order predictions.

Previous \(\alpha _{\text {s}} (M_{\text {Z}})\) determinations reported by the CMS, D0, and H1 collaborations [27, 61, 62] are taken as a baseline, and these \(\alpha _{\text {s}} (M_{\text {Z}})\) extraction methods, which differ in various aspects, are applied to inclusive jet cross section data measured by the ATLAS, CDF, CMS, D0, H1, STAR, and ZEUS experiments [16, 20, 27, 32, 36, 41, 62]. Differencesamong the \(\alpha _{\text {s}} (M_{\text {Z}})\) results due to the extraction technique are found to be negligible in most cases. A new extraction method is proposed, which combines aspects of the baseline approaches above.

In a statistical analysis, data measured by the CDF, CMS, D0, H1, STAR, and ZEUS experiments are found to be well described by pQCD predictions at next-to-leading order, and hence are considered to be mutually consistent. Moreover, the values of \(\alpha _{\text {s}} (M_{\text {Z}})\) determined from each individual data set are found to be consistent among each other. By determining \(\alpha _{\text {s}} (M_{\text {Z}})\) simultaneously from these data, the experimental uncertainty of \(\alpha _{\text {s}} (M_{\text {Z}})\) is reduced to 1.0%, as compared to 1.9% when only the single most precise data set of that selection is considered.

The largest contribution to the uncertainty of \(\alpha _{\text {s}} (M_{\text {Z}})\) originates from the renormalisation scale dependence of the next-to-leading order pQCD calculation. This uncertainty is expected to be reduced once the next-to-next-to-leading order predictions become available for such studies. Furthermore, a reevaluation of the non-perturbative corrections and their uncertainties for all data sets in a consistent manner is recommended for a determination of \(\alpha _{\text {s}} (M_{\text {Z}})\) at high precision. The presented study and the developed analysis framework provide a solid basis for future determinations of \(\alpha _{\text {s}} (M_{\text {Z}})\) and facilitate the inclusion of additional data sets, further observables, and improved theory predictions.