Properties of $g\rightarrow b\bar{b}$ at small opening angles in $pp$ collisions with the ATLAS detector at $\sqrt{s}=13$ TeV

The fragmentation of high-energy gluons at small opening angles is largely unconstrained by present measurements. Gluon splitting to $b$-quark pairs is a unique probe into the properties of gluon fragmentation because identified $b$-tagged jets provide a proxy for the quark daughters of the initial gluon. In this study, key differential distributions related to the $g\rightarrow b\bar{b}$ process are measured using 33 fb$^{-1}$ of $\sqrt{s}=13$ TeV $pp$ collision data recorded by the ATLAS experiment at the LHC in 2016. Jets constructed from charged-particle tracks, clustered with the anti-$k_t$ jet algorithm with radius parameter $R = 0.2$, are used to probe angular scales below the $R=0.4$ jet radius. The observables are unfolded to particle level in order to facilitate direct comparisons with predictions from present and future simulations. Multiple significant differences are observed between the data and parton shower Monte Carlo predictions, providing input to improve these predictions of the main source of background events in analyses involving boosted Higgs bosons decaying into $b$-quarks.


Introduction
The search for highly Lorentz-boosted Higgs bosons produced by Standard Model (SM) processes [1] or by beyond-the-SM (BSM) processes [2][3][4][5][6][7] is of crucial importance at the Large Hadron Collider (LHC). As the branching ratio for the Higgs boson to decay into bottom quark pairs dominates the total decay rate, the boosted H → bb channel can be the most sensitive to BSM effects entering at high Higgs boost. Algorithms for identifying jets resulting from bottom quark fragmentation are very powerful, so the main background for searches with boosted Higgs bosons contains b-quarks. The main contribution to this background is gluon splitting to bb pairs at small opening angles since the angle between the b-quarks in H → bb scales with the Higgs boson mass (m H ) and momentum (p H ) as m H /p H . The g → bb process also contributes to many other important SM measurements and searches by providing a source of additional real b-quark jets that can fake a signal for b-quarks originating from other processes (see e.g. Refs. [8][9][10][11][12]).
The high transverse momentum and low angular separation regime for g → bb can be probed at the LHC using b-tagged small-radius jets within large-radius jets. This topology is used to calibrate b-tagging in dense environments [50][51][52] and is studied phenomenologically [53,54]. The measurement shown in this paper builds on these studies by using data collected by the ATLAS detector from √ s = 13 TeV pp collisions in order to perform a differential cross-section measurement of g → bb inside jets at high transverse momentum -see Figure 1 for a representative Feynman diagram. Small-radius jets built from charged-particle tracks are used as proxies for b-quarks and can be used as precision probes of the small opening-angle regime.
This paper is organized as follows. After a brief introduction to the ATLAS detector in Section 2, the data and simulations used for the measurement are documented in Section 3. Section 4 describes the event selection and Section 5 lists and motivates the observables to be measured. The key challenge in the measurement is the estimation of background processes, which is performed using a data-driven approach illustrated in Section 6. The data are unfolded to correct for detector effects to allow direct comparisons to particle-level predictions. This procedure is explained in Section 7 and the associated systematic uncertainties are detailed in Section 8. The results are presented in Section 9 and the paper concludes with Section 10.  Figure 1: A representative diagram for the high-p T g → bb process studied in this paper.

ATLAS detector
The ATLAS detector [55] is a multipurpose particle detector with a forward/backward-symmetric cylindrical geometry. The detector has a nearly 4π coverage in solid angle1 and consists of an inner tracking detector, electromagnetic and hadronic calorimeters, and a muon spectrometer. The inner detector (ID) is surrounded by a superconducting solenoid providing a 2 T magnetic field and covers a pseudorapidity range of |η| < 2.5. The ID is composed of silicon pixel and microstrip detectors as well as a transition radiation tracker. For the LHC √ s = 13 TeV run, the silicon pixel detector has been upgraded to include an additional layer close to the beam interaction point [56]. The lead/liquid-argon electromagnetic sampling calorimeters measure electromagnetic energies with high granularity for the pseudorapidity region of |η| < 3.2. Hadron energies are measured by a hadronic (steel/scintillator tile) calorimeter with |η| < 1.7. The forward and endcap regions between 1.5 < |η| < 4.9 are instrumented with liquid-argon calorimeters for both the electromagnetic and hadronic measurements. Surrounding the calorimeters, the muon spectrometer includes three large superconducting toroidal magnets with eight coils each. The muon spectrometer has a system of precision tracking chambers covering |η| < 2.7 and fast trigger chambers covering |η| < 2.4. A two-level trigger system is used to select events for further analysis [57]. The first-level trigger is implemented in hardware and utilizes partial detector information to reduce the accepted event rate to 100 kHz. The high-level trigger is based on software and accepts events at a rate of 1 kHz.

Datasets
This measurement uses the dataset of pp collisions recorded by the ATLAS detector in 2016, corresponding to an integrated luminosity of 33 fb −1 at a center-of-mass-energy of √ s = 13 TeV. Events are considered only if they were collected during stable beam conditions and satisfy data quality requirements. Due to the high instantaneous luminosity and the large total inelastic proton-proton scattering cross section, on average there are about 25 simultaneous (pileup) collisions in each bunch crossing.
The measurement presented in this paper uses a variety of Monte Carlo (MC) simulated event samples to correct for detector effects as well as for direct comparisons with the unfolded data. Inclusive jet events were generated at leading order in the strong coupling constant, α S , with P 8.1 [58] using a 2 → 2 matrix element, the NNPDF2.3LO PDF set [59], and a set of generator parameter values called the A14 tune [60]. The E G [61] program was used to model the heavy-flavor decays to agree with experimental data. Additional inclusive jet events were simulated using a different generator in order to study the impact of modeling uncertainties related to both the perturbative and non-perturbative aspects of fragmentation. considered before the addition of pileup and therefore there is no ambiguity in selecting the collision vertex.
Calorimeter jets are built from calibrated topological calorimeter-cell clusters [67] using the anti-k t [68] algorithm with radius parameter R = 1.0 as implemented in FastJet [69]. Jets are groomed using a trimming procedure [70]. This procedure reclusters the constituents of a jet into subjets with a smaller radius of size R sub = 0.2 and removes those subjets with a low fraction of the full jet momentum f cut = 0.05. Following jet grooming, the mass and momentum of the resulting jets are corrected so that the detector-level values match the particle-level values on average [71]. These large-radius jets are proxies for the gluons.
Smaller-radius jets are used as proxies for the b-quarks originating from the gluons. For this purpose, jets are clustered using tracks as inputs (track-jets). Track-jets clustered from tracks with p T > 500 MeV that have been well-matched to the primary vertex2 and are composed of at least one pixel detector hit and at least six hits in the silicon tracker [72] are constructed using the anti-k t algorithm with R = 0.2. Track-jets are required to have at least two tracks. Four-vectors are calculated for each track, assuming the mass of the charged pion. Small-radius jets are matched to large-radius jets via ghost association [73]. This matching procedure creates ghost versions of the small-radius jets with the same direction but infinitesimal p T . Jet clustering is repeated and small-radius jets are assigned to the large-radius jet that contains their ghosted version. Since the jet finding algorithm is infrared safe, the four-momenta of the jets are unaffected by the addition of ghosts.
Particle-level jets are clustered using the same algorithms as for detector-level jets, except the inputs to jet finding are all stable particles (cτ > 10 mm) excluding all muons and neutrinos. The same trimming algorithm applied to calorimeter jets is also applied to the large-radius particle-level jets. Particle-level track-jets are formed from all stable charged particles that have p T > 500 MeV and |η| < 2.5, excluding muons.
Track-jets that are likely to have originated from the fragmentation of a b-quark (b-jet) are identified using the MV2c10 algorithm [74, 75], which is a combination of three baseline algorithms IP3D, SV, and JetFitter. The IP3D algorithm uses log-likelihood ratios of the three-dimensional signed impact parameter significance of tracks associated with jets. The SV tagger reconstructs the secondary decay vertices of b-hadrons. The JetFitter algorithm reconstructs the topology of detached vertices along the b-hadron decay axis. Finally, the MV2c10 algorithm combines the outputs of the baseline algorithms with a boosted decision tree and assigns a probability of a jet being a b-jet, c-jet, or light-flavor jet. The selected working point corresponds to a b-jet efficiency, a c-jet efficiency, and a light-flavor-jet rejection of b = 60%, c = 15%, and 1/ light = 480, respectively, as measured in tt events for jets with p T > 10 GeV and |η| < 2.5. A scale factor is applied to MC events to account for the measured efficiency difference between data and MC events at the chosen working point [76].
Particle-level track-jets are tagged as b-jets if there is a b-hadron from the simulated event record with p T > 5 GeV that is ghost-associated with the jet. If instead a hadron containing a c-quark from the event record with p T > 5 GeV can be matched to the jet by ghost-association, it is declared a c-jet. All other jets are declared to be light-flavor jets.

Event selection
At detector level events are selected using single-jet triggers. In the first-level trigger, a sliding-window algorithm based on low-granularity calorimeter towers records events with transverse energy greater than 100 GeV. In the high-level trigger, R = 1.0 anti-k t jets are formed from calibrated calorimeter-cell clusters and the p T threshold is 420 GeV. This trigger scheme is fully efficient for calibrated offline jets with p T > 450 GeV and |η| < 2 and therefore these kinematic requirements are used to select jets for the measurement. The offline analysis requires the highest-p T calorimeter jet to have at least two associated track-jets with p T > 10 GeV and |η| < 2.5. If there are more than two track-jets, only the leading two are used for subsequent analysis. In order to enhance the g → bb purity, the leading track-jet associated with the selected calorimeter jet must be b-tagged by the MV2c10 algorithm at the 60% efficiency working point. Requiring both track-jets to be b-tagged increases the purity but degrades the precision of the background fit described in Section 6 and so only one is required.
At particle level events are required to have at least one large-radius jet with p T > 450 GeV. The leading jet needs to have at least two associated particle-level track-jets with p T > 10 GeV. Both of the associated small-radius jets must be tagged as b-jets.
This inclusive event selection produces a sample where QCD scattering processes dominate.

Observables
The kinematic properties of the g → bb process are characterized by three quantities: the opening angle between the b-quarks, the momentum sharing between the b-quarks, and the orientation of the gluon splitting relative to the gluon production plane. The first of these quantities is probed by measuring the ∆R(b, b) = ∆φ(b, b) 2 + ∆η(b, b) 2 between track-jets. Momentum sharing is explored using the quantity z(p T ) = p T,2 /(p T,1 + p T,2 ), where p T,1 and p T,2 are the transverse momenta of the leading and subleading track-jets, respectively. A quantity sensitive to the relative orientation of the gluon splitting is ∆θ ppg,gbb , which is the angle between the plane spanned by the beam-line and the vector sum of the two track-jets and the plane spanned by the two track-jets. The angular quantities ∆R(b, b) and ∆θ ppg,gbb are depicted in Figure 2.
In addition to these quantities, the dimensionless mass log(m bb /p T ) is also measured, where the mass and p T in the logarithm are computed from the four-vector sum of the two track-jets. The bb mass is an important observable for measurements and searches with Higgs and Z bosons. Track-jets, in contrast to the calorimeter-based subjets from trimming, are used due to their excellent angular resolution; in simulation, there is little difference between using the directions from the track-jets, the full jets (including neutrals), or the b-hadrons from the b-quark fragmentation. There is some discrepancy between these different objects for the energy-dependent observables, but the track-jets are still useful due to their excellent momentum resolution in the gluon p T range probed in this measurement. The excellent angular and momentum resolutions are presented in Figure 3, which shows the detector response for all four observables targeted with this measurement.

Side view
Top view Figure 2: Schematic diagrams illustrating the ∆R(b, b) and ∆θ ppg,gbb observables. In this example, the gluon is emitted at η = 0.

Background estimation
After the event selection, the contribution from large-radius jets that do not have two associated track-jets containing B-hadrons is subtracted from data, as described below, before correcting for detector effects.
The fraction of background events may not be well-modeled by the simulation, so correction factors are determined from data template fits to the impact parameter distribution and applied for each bin of the four target observables of the analysis prior to subtraction. In each bin of the target observable distributions, the distribution of the signed impact parameter significance s d 0 is fitted to data using templates from simulation while letting the fraction of each flavor component float in the fit [50,51]. For a given track, , where d 0 is the transverse impact parameter relative to the beam-line and σ(d 0 ) is the uncertainty in d 0 from the track fit and the variable s j is the sign of d 0 with respect to the jet axis: where ì r IP,xy and ì r PV,xy are the locations of the track impact parameter and primary vertex, respectively, in the transverse plane.
Due to the long lifetime of b-hadrons, the values of s d 0 for tracks in b-jets tend to be larger than those for tracks in c-jets and light-flavor jets. Therefore, the distribution of s d 0 can be used to extract the fractions of b-jets, c-jets, and light-flavor jets using templates from simulation. For each track-jet j i , the s d 0 from the track with the second largest |s d 0 |, called s sub d 0 ( j i ), is used for the extraction. The leading and third-leading s d 0 values (ordered by |s d 0 |) are used as a validation of the flavor-fraction fitting procedure and produce consistent results. The leading s d 0 is not as well-modeled as s sub d 0 and therefore the χ 2 resulting from the fit procedure described below is slightly worse. The value of s sub d 0 does not have a strong dependence on jet p T , so the fit is performed inclusively. This choice was validated by using p T -binned fits, which produce

ATLAS Simulation
Figure 3: The detector response is represented as the conditional probability of the detector-level quantity given the particle-level quantity, written as Pr(detector-level|particle-level), in simulation for , and log(m bb /p T ) (bottom right). The small anti-diagonal component for ∆θ ppg,gbb is due to cases where the leading and subleading track-jets are swapped between detector level and particle level so ∆θ ppg,gbb → π − ∆θ ppg,gbb .
results consistent with the inclusive approach.
A binned maximum-likelihood fit to s sub d 0 ( j 1 ) and s sub d 0 ( j 2 ) is performed to extract the flavor fractions. Given the flavors of track-jets j 1 and j 2 (p T,1 ≥ p T,2 ), s sub d 0 ( j 1 ) and s sub d 0 ( j 2 ) are well-approximated as being statistically independent (linear correlation is less than 5%). Therefore, the probability distribution p(s sub d 0 ( j 1 ), s sub d 0 ( j 2 )) can be approximated by the product of marginals p(s sub ). This approximation reduces a two-dimensional fit to a simultaneous fit of two one-dimensional distributions. In order to increase the robustness of the fit, flavor combinations with similar templates are merged. The three templates used for each bin of the target observable are BB (signal), B, and L+C. The BB template only includes events where both jets are labeled as b-jets using MC particle-level flavor labeling. The B template is an aggregation of bl and bc events in which the leading track-jets are labeled as b and the subleading track-jets are labeled as light-flavor or c in particle-level flavor labeling. This template also includes cases in which a g → bb splitting was fully contained inside one small-radius track-jet and the second track-jet is due to a light quark or gluon. The rest of the events are merged into the L+C template. Uncertainties in the templates, in particular resulting from the template merging scheme, are described in Section 8.1.
Examples of the flavor-fraction determination fits are shown in Figure 4 for one bin of ∆R(b, b). Since the leading track-jet is required to be b-tagged, the distribution of s sub d 0 ( j 1 ) is broader and shifted toward more positive values than the distribution of s sub d 0 ( j 2 ). In contrast, the subleading jet is most often a light-flavor jet for both the B and L+C categories and therefore s sub is nearly symmetric about zero. The BB template is similar in shape for the leading and subleading track-jets. For this particular fit, the χ 2 per degree of freedom improves from 72/22 (pre-fit) to 13.5/22 (post-fit). As a result of the fit, the background fraction changes from 79.6% to 82.8%. A comparison of the pre-fit and post-fit flavor fractions for all bins of ∆R as well as the other observables is presented in Figure 5. Except for ∆θ ppg,gbb and the highest bin of z(p T ) the P simulation prediction for the shape of the flavor-fraction distribution is accurate for the BB fraction. In all cases the BB yield is slightly overestimated. The flavor fractions from B and L+C are inverted between P and data.

Unfolding
After subtracting the background from the detector-level distributions, as described in Section 6, the data are corrected for resolution and acceptance effects. The fiducial volume of the measurement is described by the particle-level object and event selection in Section 4. First, the data are corrected for events that pass the detector-level selection but not the particle-level selection using the simulations introduced in Section 3. Then, the iterative Bayes (IB) unfolding technique [77] is used to correct for the detector resolution in events that pass both the detector-level and particle-level selections. The IB method is applied with four iterations implemented in the RooUnfold framework [78]. After the application of the response matrix, a final correction is applied to account for events that pass the particle-level but not detector-level selection. Uncertainties in the unfolding procedure are described in Section 8.

Uncertainties
Systematic and statistical uncertainties were assessed for each aspect of the analysis, including the background subtraction, acceptance and efficiency correction factors, response matrix, and unfolding method. For each uncertainty, a component of the analysis chain is varied and then the entire procedure including the background subtraction is repeated. Table 1 provides a summary of the systematic uncertainties for each observable. The jet energy scale, the unfolding, and the theoretical modeling uncertainties dominate. Further details about each category are provided in Section 8.1 for the uncertainties associated with each analysis object, in Section 8.2 for the background fit procedure uncertainty, and in Section 8.3 for the unfolding method and theoretical modeling uncertainties.

Object reconstruction
Each object used in the analysis has an associated uncertainty. These uncertainties affect the acceptance factors and the response matrix, as well as the background fit templates.   Table 1.

Calorimeter jets:
The energies of large-radius jets are shifted and smeared to account for uncertainties in both the bias and variance of the reconstructed energy. Jet energy scale uncertainties are determined by comparing calorimeter-based and tracker-based jet energy measurements in inclusive dijet events, and range from 2% to 6% [79].
Flavor tagging: Data/MC corrections ('scale-factors') and uncertainties in the b-tagging efficiencies and c-jet misidentification rates are determined from tt events [80]. Light-flavor misidentification rates are studied using dijet events. The b-jet uncertainties are 5-10%, while the c-jet uncertainties are 20% and light-flavor jet uncertainties are about 50%. Uncertainties for track-jets with p T > 300 GeV are extrapolated from low-p T jets as there are too few events with high-p T jets for a proper calibration. Extrapolation uncertainties are evaluated by varying quantities such as impact parameter resolutions and descriptions of the detector material. The extrapolated uncertainties range from 20% to 100% depending on the p T and the flavor of the jet. As the flavor fractions are constrained in situ, there is a significant reduction from the prior flavor-tagging uncertainty values described above. In particular, there is little sensitivity to the background scale factors and there is no sensitivity to inclusive flavor-tagging scale factors for b-jets. Residual ∆R(b, b)-dependent scale factor uncertainties that account for differences in the modeling of isolated versus non-isolated b-jets are derived from dedicated performance studies [50,51].
Tracking: Systematic uncertainties are estimated for the track reconstruction efficiency, fake rate, and track parameter scales and resolutions. The main source of inclusive tracking inefficiency is multiple scattering in the ID, so the uncertainty is set by the accuracy with which the ID material is simulated [81]. This leads to a 0.5% uncertainty for |η| < 0.1, which grows to 2.7% at the end of the ID acceptance. An additional source of inefficiency arises inside the high-multiplicity environment in the cores of jets due to silicon pixel and microstrip cluster merging. The uncertainty in the modeling of this density-induced inefficiency is about 0.8% [82,83]. Fake tracks are due to combinations from multiple charged particles. The track selection described in Section 4 reduces the contribution of fake tracks to much less than 1% with a relative uncertainty that is about 30% [84]. The track parameters that are most relevant for this analysis are p T and d 0 . Weak modes in the ID alignment cause a bias in the track sagitta that is corrected for using a dedicated calibration and the calibration uncertainty is propagated through to the measurement [72]. The modeling of the d 0 scale and resolution was studied in Z → µ + µ − events at low p T and in dijet events at high p T [72] and is used to assign an uncertainty to the modeling of these important track properties.

Background fit
In addition to the fit validations described in Section 6, several aspects of the fit are varied in order to assess the uncertainty in the extracted flavor fractions.

Unfolding method and theoretical modeling
An uncertainty resulting from the unfolding method described in Section 7 is determined by unfolding the prediction from a different simulation with the nominal procedure. The alternative simulation is constructed by reweighting the nominal particle-level spectrum so that the simulated detector-level spectrum, obtained by propagating the reweighted particle-level spectrum through the response matrix, agrees well with the data. The modified detector-level distribution is unfolded with the nominal response matrix and the difference between this and the reweighted particle-level spectrum is an indication of the bias due to the unfolding method (in particular, the choice of prior) [85].
The unfolded result can depend on the modeling of jet fragmentation through the background fit, the prior, the response matrix, and the correction factors. The s d 0 distribution does not strongly depend on the properties of the jet radiation pattern, but an uncertainty is determined by taking the fitted background using templates from S instead of P . Variations in the prior are already accounted for in the data-driven non-closure uncertainty described above. The rest of the contributions are evaluated by comparing the result using P with the result using the alternative S sample described in Section 6. This comparison is decomposed into components corresponding to varying only the response matrix or only the initial/final correction factors. Varying only one component at a time is possible by reweighting a component of the P simulation to match the S simulation and then evaluating the relative difference in the unfolded result. All of the components are added in quadrature to determine the total uncertainty due to fragmentation modeling. Each component is treated as uncorrelated because the uncertainty is based on only two fragmentation models and therefore a potential reduced uncertainty from exploiting potentially unphysical correlations between kinematic properties impacting acceptance and substructure attributes impacting the response is avoided.

Results
The unfolded results along with multiple parton shower MC predictions are presented in Figure 6. By construction, 0.2 ≤ ∆R ≤ 1.0 and the peak around 0.3 is due to the radius of the track-jets. Furthermore, 0 ≤ z(p T ) ≤ 0.5, with some distortions to the natural distribution at low values due to the p T threshold applied to the small-radius track-jets.

The S
predictions are generally more accurate than those from P , although there are significant differences between both generators and the data at low mass, low z(p T ) and for all ∆θ ppg,gbb . The ∆θ ppg,gbb distribution in data appears to be inverted with respect to the one from P (with a minimum instead of maximum at π/2) while S predicts a relatively uniform distribution. For comparison, the figure contains a version of P 3 with the azimuthal asymmetries induced by gluon polarization turned off.4 This sample appears to be closer to S and also to the data. In general, the properties of gluon polarization inside unpolarized hadrons are largely unconstrained by experimental data (see e.g. Ref. [86] and references therein). This and future measurements of ∆θ ppg,gbb may provide a new way to extract p T -dependent parton distributions in order to better understand proton structure and further improve the precision of various cross-section calculations [87]. 3 Due to the available settings in P , different versions were used for these comparisons. It was checked that there is no significant difference in ∆θ ppg,gbb between P versions 8.186 and 8.230. 4 TimeShower:phiPolAsym = off and TimeShower:phiPolAsymHard = off.
In addition to studying gluon production properties, g → bb provides a handle on gluon fragmentation. Due to the large b-quark mass and in general the large m bb mass that is possible after splitting, there are many formally equivalent model choices in describing gluon fragmentation. For example, the scale at which the strong coupling constant acts (renormalization scale) may be better described as scaling with m 2 bb instead of the P default p 2 T,bb . To illustrate the sensitivity of ∆R(b, b), z(p T ) and log(m/p T ) to fragmentation settings in P , the plots of Figure 6 show the final-state radiation variations of the A14 tune (indicated as an uncertainty band in the plot) as well as a different way to treat the b-quark mass in the QCD splitting kernels indicated by the m 2 bb /4 variation (m 2 bb /4 instead of p 2 T,bb for the renormalization scale).5 No variation describes all of the data, but some variations are worse than others. For example, the Var2+ A14 variation, which increases the final-state shower α S (M Z ) value to 0.139, moves the prediction further from the data in nearly all measurement bins. Related variations (not shown) such as using m 2 bb instead of p 2 T,bb as the renormalization scale, adding additional phase-space factors, or suppressing high-mass bb pairs, are not significantly different from the nominal P setup.6 5 TimeShower:weightGluonToQuark = 5 and TimeShower:scaleGluonToQuark = 0.25. 6 Variations of TimeShower:weightGluonToQuark and TimeShower:scaleGluonToQuark; when TimeShower:weightGluonToQuark = 1, then the g → bb kernel is weighted by an extra β phase-space factor; when TimeShower:weightGluonToQuark = 5, this kernel is reweighted to α S (m 2 bb ) instead of α S (p 2 T ); when TimeShower:weightGluonToQuark = 8, there is an additional factor that suppresses the rate of high-mass bb pairs.

Conclusion
This paper presents a measurement of various properties of g → bb at high p T and low ∆R(b, b) from 33 fb −1 of √ s = 13 TeV pp collisions recorded by the ATLAS detector at the LHC. A flavor-fraction fit is used to remove contributions from processes other than g → bb. The fitted fractions significantly disagree with the pre-fit P predictions and suggest that further studies could improve the modeling of analyses sensitive to these fractions. The measured properties are unfolded to correct for the detector acceptance and resolution for direct comparison with particle-level models. Comparisons are made at particle level between the distributions and various models of jet formation. Simulations from the S event generator generally provide a better model than P , especially for the ∆θ ppg,gbb observable, which is sensitive to the modeling of the gluon polarization. The particle-level spectra are publicly available [88] for further interpretation and can be used to validate QCD MC predictions and tune their models' free parameters.                 [43] D0 Collaboration, Measurement of the W + b-jet and W + c-jet differential production cross sections in pp collisions at