Measurement of the top quark mass using proton-proton data at sqrt(s) = 7 and 8 TeV

A new set of measurements of the top quark mass are presented, based on the proton-proton data recorded by the CMS experiment at the LHC at sqrt(s) = 8 TeV corresponding to a luminosity of 19.7 inverse femtobarns. The top quark mass is measured using the lepton + jets, all-jets and dilepton decay channels, giving values of 172.35 +/- 0.16 (stat) +/- 0.48 (syst) GeV, 172.32 +/- 0.25 (stat) +/- 0.59 (syst) GeV, and 172.82 +/- 0.19 (stat) +/- 1.22 (syst) GeV, respectively. When combined with the published CMS results at sqrt(s) = 7 TeV, they provide a top quark mass measurement of 172.44 +/- 0.13 (stat) +/- 0.47 (syst) GeV. The top quark mass is also studied as a function of the event kinematical properties in the lepton + jets decay channel. No indications of a kinematic bias are observed and the collision data are consistent with a range of predictions from current theoretical models of t t-bar production.


Introduction
The mass of the top quark (m t ) is one of the fundamental parameters of the standard model (SM). A precise measurement of its value provides a key input to global electroweak fits and to tests of the internal consistency of the SM [1,2]. Its value leads to constraints on the stability of the electroweak vacuum [3,4] and affects models with broader cosmological implications [5,6].
The most precise measurements of m t have been derived from combinations of the results from the CDF and D0 experiments at the Tevatron, and ATLAS and CMS at the CERN LHC. The current combination from the four experiments gives a top quark mass of 173.34 ± 0.76 GeV [7], while the latest combination from the Tevatron experiments gives a mass of 174.34 ± 0.64 GeV [8]. The Tevatron combination is currently the most precise measurement and it includes all of the current Tevatron measurements. In contrast, the current four experiment combination has not been updated since 2013 and does not include the latest Tevatron and LHC measurements, in particular the measurement from ATLAS using a combination of the lepton+jets and dilepton channels [9].
Beyond the leading order (LO) in quantum chromodynamics (QCD), the numerical value of m t depends on the renormalization scheme [10,11]. The available Monte Carlo (MC) generators contain matrix elements at LO or next-to-leading order (NLO), while higher orders are approximated by applying parton showering. Each of the measurements used in the combinations has been calibrated against the mass implemented in a MC program. Given the precision of the experimental results, a detailed understanding of the relationship between the measurements and the value of m t in different theoretical schemes is needed. Current indications are that the present measurements based on the kinematic reconstruction of the top quark mass correspond approximately to the pole ("on-shell") mass to within a precision of about 1 GeV [12].
At the LHC, top quarks are predominantly produced in quark-antiquark pairs (tt) and top quark events are characterized by the decays of the daughter W bosons. This leads to experimental signatures with two jets associated with the hadronization of the bottom quarks and either a single lepton (e, µ), one undetected neutrino and two light quark jets (lepton+jets channel), or four light quark jets (all-jets channel), or two leptons (ee, eµ, µµ) and two undetected neutrinos (dilepton channel). While the events which contain leptonic τ decays are included in the analysis samples, they contribute very little to the mass measurements as their yields are negligible. The results presented in this paper focus on the analysis of data in these three channels recorded by the CMS experiment in the 2012 part of what is commonly referred to as Run 1 of the LHC.
The paper is organized as follows. The main features of the detector and the data are discussed in Sections 2 and 3. Section 4 is a discussion of the analysis techniques, which lead to the measurements of Section 5. The categorization of the systematic uncertainties is presented in Section 6, followed by the full results for the three decay channels in Section 7. Section 8 presents a study of m t as a function of the kinematical properties of the tt system in the lep-ton+jets channel. This is followed in Sections 9 and 10, which discuss the combination of the measurements and the final result for m t , respectively.

The CMS detector
The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL), and a brass and scin-tillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. The tracker has a track-finding efficiency of more than 99% for muons with transverse momentum p T > 1 GeV and pseudorapidity |η| < 2.5. The ECAL is a fine-grained hermetic calorimeter with quasi-projective geometry, and is distributed in the barrel region of |η| < 1.48 and in two endcaps that extend up to |η| < 3.0. The HCAL barrel and endcaps similarly cover the region |η| < 3.0. In addition to the barrel and endcap detectors, CMS has extensive forward calorimetry. Muons are measured in gas-ionization detectors, which are embedded in the steel flux-return yoke outside of the solenoid. A more detailed description of the CMS detector, together with a definition of the coordinate system used, and the relevant kinematic variables can be found in Ref. [13].

Data sets
The measurements presented in this paper are based on the data recorded at a center-of-mass energy of 8 TeV during 2012, and correspond to an integrated luminosity of 19.7 fb −1 .

Event simulation and reconstruction
Simulated tt signal events are generated with the MADGRAPH 5.1.5.11 LO matrix element generator with up to three additional partons [14]. MADSPIN [15] is used for the decay of heavy resonances, PYTHIA 6.426 for parton showering [16] using the Z2* tune, and TAUOLA [17] for decays of τ leptons. The most recent PYTHIA Z2* tune is derived from the Z1 tune [18], which uses the CTEQ5L parton distribution function (PDF) set, whereas Z2* uses CTEQ6L [19]. A full simulation of the CMS detector based on GEANT4 [20] is used. The tt signal events are generated for seven different values of m t ranging from 166.5 to 178.5 GeV. The W/Z+jets background events are generated with MADGRAPH 5.1.3.30. The diboson background (WW, WZ, ZZ) is simulated using PYTHIA 6.426 using the Z2* tune. The single top quark background is simulated using POWHEG 1.380 [21][22][23][24][25], assuming an m t of 172.5 GeV. The tt, W/Z+jets, and single top quark samples are normalized to the theoretical predictions described in Refs. [26][27][28][29][30]. The simulation includes the effects of additional proton-proton collisions (pileup) by overlapping minimum bias events with the same multiplicity distribution and location as in data.
Events are reconstructed using a particle-flow (PF) algorithm [31,32]. This proceeds by reconstructing and identifying each final-state particle using an optimized combination of all of the subdetector information. Each event is required to have at least one reconstructed collision vertex. The primary vertex is chosen as the vertex with the largest value of ∑ p 2 T of the tracks associated with that vertex. Additional criteria are applied to each event to reject events with features consistent with detector noise and beam-gas interactions.
The energy of electrons is determined from a combination of the track momentum at the primary vertex, the corresponding ECAL energy cluster, and the sum of the reconstructed bremsstrahlung photons associated with the track [33]. The momentum of muons is obtained from the track momentum determined in a combined fit to information from the silicon trackers and the muon detectors [34]. The energy of charged hadrons is determined from a combination of the track momentum and the corresponding ECAL and HCAL energies, corrected for the suppression of small signals and calibrated for the nonlinear response of the calorimeters. Finally, the energy of the neutral hadrons is obtained from remaining calibrated HCAL and ECAL energies. As the charged leptons originating from top quark decays are typically isolated from other particles, a relative isolation variable (I rel ) is constructed to select lepton candidates. This is defined as the scalar sum of the p T values of the additional particles reconstructed within an angle ∆R = √ (∆η) 2 + (∆φ) 2 of the lepton direction, divided by the p T of the lepton. Here ∆η and ∆φ are the differences in the pseudorapidity and azimuthal angles between the lepton direction and other tracks and energy depositions. A muon candidate is rejected if I rel ≥ 0.12 for ∆R = 0.4, and an electron candidate is rejected if I rel ≥ 0.10 for ∆R = 0.3.
Jets are clustered from the reconstructed PF candidates using the anti-k T algorithm [35] with a distance parameter of 0.5, as implemented in the FASTJET package [36]. The jet momentum is determined from the vector sum of the momenta of the particles in each jet, and is found in simulation to be within 5 to 10% of the jet momentum at hadron level for the full p T range [37]. Corrections to the jet energy scale (JES) and the jet energy resolution (JER) are obtained from the simulation and through in-situ measurements of the energy balance of exclusive dijet, photon+jet, and Z+jet events. Muons, electrons, and charged hadrons originating from pileup interactions are not included in the jet reconstruction. Missing transverse energy (E miss T ) is defined as the magnitude of the negative vector p T sum of all selected PF candidates in the event. Charged hadrons originating from pileup interactions are not included in the calculation of E miss T . Jets are classified as b jets through their probability of originating from the hadronization of bottom quarks, using the combined secondary vertex (CSV) b tagging algorithm, which combines information from the significance of the track impact parameters, the kinematical properties of the jets, and the presence of tracks that form vertices within the jet. Three different minimum thresholds are used for the CSV discriminator to define the loose (CSVL), medium (CSVM), and tight (CSVT) working points. These have b tagging efficiencies of approximately 85%, 67%, and 50%, and misidentification probabilities for light-parton jets of 10%, 1%, and 0.1%, respectively [38].

Event selection
For the lepton+jets channel we use the data collected using a single-muon or single-electron trigger with a minimum trigger p T threshold for an isolated muon (electron) of 24 GeV (27 GeV), corresponding to an integrated luminosity of 19.7 fb −1 . We then select events that have exactly one isolated muon or electron, with p T > 33 GeV and |η| < 2.1. In addition, at least four jets with p T > 30 GeV and |η| < 2.4 are required. Jets originating from b quarks (denoted as b jets) are identified using the CSV algorithm at the medium working point [38]. With the requirement of exactly two b-tagged jets among the four jets with the highest p T , 104 746 tt candidate events are selected in data. From simulation, the sample composition is expected to be 93% tt, 4% single top quark, 2% W+jets, and 1% other processes. Figure 1 shows the comparison of the data and simulation for the selected events in some representative distributions. The simulation shown is not corrected for the uncertainty in the shape of the top quark p T distribution [39], which accounts for almost all of the slope visible in the data/MC ratio plots. However, even without making a correction, the data and simulation are consistent within the quoted uncertainties.
For the all-jets channel we use the data collected using a multijet trigger, corresponding to an integrated luminosity of 18.2 fb −1 . The trigger requires the presence of at least four jets, reconstructed from the energies deposited in the calorimeters, with transverse momenta p T > 50 GeV. Since fully hadronically decaying top quark pairs lead to six partons in the final state, events are required to have at least four jets with p T > 60 GeV and a fifth and sixth jet with p T > 30 GeV. Jets originating from b quarks are identified using the CSV b algorithm at the tight working point [38]. With the requirement of exactly two b-tagged jets among the six leading ones, 356 231 candidate events are selected. From simulation, the sample is expected to be dominated by the QCD multijet background and to have a signal fraction of about 13%. The QCD multijet background cannot be reliably simulated and we determine its kinematic  Figure 1: Distributions for the lepton+jets channel of (upper left) lepton p T , (upper right) missing transverse energy, (lower left) leading jet p T , (lower right) second-leading jet p T for data and simulation, summed over all channels and normalized by luminosity. The vertical bars show the statistical uncertainty and the hatched bands show the statistical and systematic uncertainties added in quadrature. The lower portion of each panel shows the ratio of the yields between the collision data and the simulation.

CMS
dependence from a control sample in the data. The background normalization is determined as a part of the fit process, which is discussed in Section 5.2.
For the dilepton channel, events are required to pass the triggers appropriate for each of the three channels. The eµ channel uses a logical OR of two triggers that require a muon of p T > 17 (or 8) GeV and an isolated electron of 8 (or 17) GeV. Dimuon events must pass a trigger which requires a p T > 17 GeV for the muon with the highest ("leading") p T and 8 GeV for the second-leading muon. Similarly, dielectron events must satisfy a trigger with a threshold of p T > 17 GeV for the leading electron and 8 GeV for the second-leading electron. In this case both electrons are required to be isolated. In all three cases the amount data corresponds to a luminosity of 19.7 fb −1 . We select events for analysis if they have two isolated opposite-sign leptons with p T > 20 GeV and |η| < 2.4 (2.5) for muons (electrons). Jets originating from b quarks are identified with the CSV algorithm at the loose working point [38]. Events are retained if they have at least two b-tagged jets. Background contamination from low-mass resonances is reduced by demanding a dilepton pair invariant mass, m , of at least 10 GeV. To suppress the background from Z boson decays, events with ee and µµ signatures are required to have E miss T > 40 GeV, and to fall outside of the dilepton invariant mass window 76 < m < 106 GeV. The remaining Drell-Yan background is estimated from the data using the ratio of the event yield inside vs. outside the invariant mass window [38]. After all of the requirements, we find 41 125 candidate events in data for which the sample compositions is expected to be 95% tt, 3% single top quark, 2% Drell-Yan, and <0.3% other processes. Figure 2 shows a comparison of the data and simulation for events with at least one b jet for some representative distributions. As with the lepton+jets plot ( Fig. 1) the simulation is not corrected for the discrepancy in the top quark p T distribution, leading to the slopes visible in the data/MC plots.

Analysis techniques
The measurements discussed in the following sections use analysis techniques in which either m t alone is determined or m t and the overall jet energy scale factor (JSF) are determined simultaneously. For the lepton+jets and the all-jets channels we use analyses based on the ideogram technique (Section 4.1). While the ideogram technique provides the most precise measurements, it is not suitable for dilepton events where the presence of more than one neutrino introduces uncertainties in the use of the measured E miss T . Instead, for the dilepton channel, we use the Analytical Matrix Weighting Technique (AMWT) method (Section 4.2).

One-and two-dimensional ideogram analyses
The ideogram method is a joint maximum likelihood fit that determines m t and, optionally, the JSF from a sample of selected tt candidate events in the lepton+jets or all-jets channels. The observable used for measuring m t is the mass m fit t estimated by a kinematic fit [40]. The kinematic fit constrains the candidates for the tt decay products to the hypothesis of the production of two heavy particles of equal masses, each one decaying to a W boson and a bottom quark, where the W boson invariant mass is constrained to 80.4 GeV [41]. The JSF is defined as a multiplicative factor to be applied in addition to the standard jet energy corrections (JEC) [37] to the four-momenta of the jets. The JSF is determined from the invariant masses of the jet pairs, m reco W , associated with the W bosons before the jet momenta are constrained by the kinematic fit. For the case of a simultaneous fit to both m t and the JSF (2D approach), no prior knowledge of the JSF is assumed. If only m t is fitted (1D approach), the jet energy scale determined from the JEC is taken as the JSF prior, fixing it to unity. A third category of fits (hybrid approach) incorporates the prior knowledge about the jet energy scale by using a Gaussian constraint, P(JSF), centered at 1 with a variance depending on the total JEC uncertainty. For the hybrid analysis in the lepton+jets channel, the JSF determined from the Wboson decays and the jet energy scale from the JEC are given equal weight in the fit. In contrast, for the hybrid fit in the all-jets channel, the jet energy scale from the JEC contributes 80% of the information, because of the larger uncertainty on the JSF from the 2D fit.
The distributions of m fit t and m reco W are obtained from simulation for three to seven different m t and three to five different JSF values for the tt signal, and from simulated background events (lepton+jets) or the control sample for the multijet background (all-jets). From these distributions, probability density functions are derived separately for different cases of jet-parton assignments for the signal, and for the background contribution. The signal functions depend on m t and JSF, and are labeled P(m fit t |m t , JSF) and P(m reco W |m t , JSF), respectively, for an event in the final likelihood.
The likelihood for measuring m t and the JSF in an observed data sample can be expressed as: where the event weight w event = c ∑ n i=1 P gof (i) is used in the lepton+jets analysis to reduce the impact of events for which the chosen permutation of the jets is incorrect. Here, c is a normalization constant and the remaining quantities are defined as in Eq. (2). For the all-jets channel, w event = 1 is used. The event likelihoods (or ideograms) are given by where the index i runs over the n selected permutations of an event that each have a goodnessof-fit probability P gof assigned from the kinematic fit. The signal fraction f sig is assumed to be 1 for the lepton+jets channel and is left as a free parameter of the fit for the all-jets channel. The background term P bkg is independent of both m t and the JSF for backgrounds determined from the collision data.
As the W boson mass is fixed to 80.4 GeV in the fit, the observables m fit t and m reco W have a low correlation coefficient (less than 5%) and the probability density P can be factorized into onedimensional expressions, where the index j denotes the different jet-parton permutation classes defined for the measurement. Their relative fraction f j is either determined from the simulated sample with m t,gen = 172.5 GeV or by the fit.
The most likely m t and JSF values are obtained by minimizing −2 ln L (sample|m t , JSF) for the 2D and hybrid analyses. For the 1D analyses only m t is determined and the JSF is set to unity during the minimization.

Analytical matrix weighting technique (AMWT)
The measurement of m t for the dileptonic tt decays is performed using the analytical matrix weighting technique (AMWT). This is based on a matrix weighting technique used by the D0 collaboration [42], combined with an analytical algorithm to find solutions of the kinematic equations [43]. The method allows the determination of m t with the assumption of JSF = 1, and in this sense, the results are comparable to the 1D fits performed in either the lepton+jets or all-jets channels (see Section 4.1).
In dileptonic tt decays, the final state consists of two charged leptons, two neutrinos, and two b quarks, resulting in 18 unknowns: three momentum components for each of the six final state particles. Of these, we observe the momenta of the two charged leptons, the momenta of the two jets, and the momenta of all of the other charged particles and jets. If there are more than two jets in an event we have to select the jets to assign to the b quarks from the decay of the top quark pair. We preferentially assign b-tagged jets to these. Hence, after physics object reconstruction, we measure the following observables for each event: • the momenta p + and p − of the charged leptons from the W + and W − decays, • the momenta p b and p b of the b and b quarks produced by the t and t quark decays, • the total transverse momentum p tt T of the tt pair. This leaves four unknowns that must be solved analytically. Conservation of four-momentum provides the following four constraints on the kinematics, if a hypothetical value for the topquark mass is assumed: • the masses m + ν and m −ν of the lepton-neutrino pairs from the W + and W − decays are constrained to be 80.4 GeV [41], • the masses of the systems of particles from the t and t decays must equal the hypothesized mass of the top quark.
Hence, the system of equations is appropriately constrained. However, there is not a unique solution, because the equations are nonlinear. For a given assignment of reconstructed momenta to final-state particles there can be up to four solutions for the neutrino momenta such that the event satisfies all of the constraints. There is a twofold ambiguity of assigning jet momenta to the b and b jets, which doubles this to eight possible solutions. We follow the algorithm given in Refs. [44,45] to find these solutions. In rare cases, a latent singularity in the equations used to find these solutions can prohibit the calculation of the longitudinal momenta. In such events, a numerical method is employed to find the incalculable variables [44].
For each event, we find all solutions of neutrino momenta for hypothesized top quark masses between 100 and 600 GeV, in 1 GeV increments. In general, we expect solutions to be found for a large range of mass hypotheses. To each solution we assign a weight w given by [46]: where X represents the momentum vectors of the final state particles as obtained from the solutions of the kinematic equations. We sum the parton distribution functions F(x), evaluated at Q 2 = m 2 t , over the possible LO initial parton states (uu, uu, dd, dd, and gg); x 1 and x 2 are the Bjorken x values for the initial-state partons which can be computed from the momenta of the final-state particles. The function p(E|m t ) is the probability density of observing a charged lepton of energy E in the rest frame of a top quark of mass m t , given by [46]: where the b quark mass, m b , is set to 4.8 GeV, and the W boson mass, M W , to 80.4 GeV. For each m t hypothesis, we find an overall weight by summing the weights of all solutions found.
To compensate for mismeasurements of the momenta due to finite detector resolution or the loss of correlation between the jet and quark momentum because of hard-gluon radiation, we account for the jet energy resolution during reconstruction. Every event in both the collision and simulated data is reconstructed 500 times, each time with jet momenta drawn randomly from a Gaussian distributions of widths given by the detector resolution and with means given by the measured momenta. After this randomization procedure, approximately 96% of all events in both the collision and simulated data have at least one solution, and hence a top quark mass estimator. The final weight curve of each event is given by the average of the weight distributions from each of the 500 randomizations, after excluding the cases for which there is no valid solution. This distribution serves as a measure of the relative probability that the observed event occurs for any given value of m t .
The estimator for m t is then the hypothesized mass with the highest average sum weight for each event, called the AMWT mass, m AMWT t .

The lepton+jets channel
To check the compatibility of an event with the tt hypothesis and improve the resolution of the reconstructed quantities, a kinematic fit [40] is applied to the events. For each event, the inputs to the fitter are the four-momenta of the lepton and the four leading jets, the missing transverse energy, and their respective resolutions. The fit constrains these to the hypothesis of the production of two heavy particles of equal mass, each one decaying to a Wboson with an invariant mass of 80.4 GeV [41] and a bottom quark. It minimizes where x m is the vector of measured observables, x is the vector of fitted observables, and E −1 is the inverse error matrix which is given by the resolutions of the observables. The two b-tagged jets are candidates for the bottom quarks in the tt hypothesis, while the two untagged jets serve as candidates for the light quarks for one of the Wboson decays. This leads to two possible parton-jet assignments per event and two solutions for the z component of the neutrino momentum.
For simulated tt events, the parton-jet assignments are classified as correct permutations, wrong permutations, and unmatched permutations. The correct permutation class includes those events for which all of the quarks from the tt decay (after initial-state parton shower) are correctly matched to the selected jets within a distance ∆R < 0.3. The wrong permutations class covers the events for which the jets from the tt decay are correctly matched to the selected jets, but where two or more of the jets are interchanged. Lastly, the unmatched permutations class includes the events for which at least one quark from the tt decay is not matched unambiguously to any of the four selected jets. To increase the fraction of correct permutations, we require P gof > 0.2 for the kinematic fit with two degrees of freedom. This selects 28 295 events for the mass measurement, with an estimated composition of 96.3% tt signal and 3.7% non-tt background, which is dominated by single top quark events. In the mass extraction, the permutations are weighted by their P gof values, and the effective fraction of correct permutations among the tt signal improves from 13% to 44%, while the fractions of wrong and unmatched permutations change from 16% to 21% and 71% to 35%, respectively, determined in simulation.   The 2D ideogram fit to the combined electron and muon channels yields: As m t and the JSF are measured simultaneously, the statistical uncertainty in m t combines the statistical uncertainty arising from both components of the measurement. The uncertainty of the measurement agrees with the expected precision obtained by performing pseudo-experiments.
The results in the individual muon and electron channels are compatible within their statistical uncertainties: µ+jets: m 2D t = 172.03 ± 0.27 (stat+JSF) GeV, JSF 2D = 1.007 ± 0.003 (stat), e+jets: m 2D t = 172.26 ± 0.28 (stat+JSF) GeV, JSF 2D = 1.003 ± 0.003 (stat). The 1D and hybrid analyses give results of respectively. Both the 2D and hybrid results for the JSF (JSF 2D and JSF hyb ) are within 0.5% of one. The results for m t and the JSF are compared in Fig. 4, which shows the two-dimensional statistical likelihoods obtained from data in the 2D and hybrid cases and m t from the 1D analysis.

The all-jets channel
As in the lepton+jets channel, a kinematic fit [47] is used to improve the resolution of the reconstructed quantities and to check the compatibility of an event with the tt hypothesis. For each event, the inputs to the fit are the four-momenta of the six leading jets. The fit constrains these to the hypothesis of the production of two heavy particles of equal masses, each one decaying to a W boson with its invariant mass constrained to 80.4 GeV [41] and a bottom quark. The two b-tagged jets are candidates for the bottom quarks in the tt hypothesis, while the four untagged jets serve as candidates for the light quarks of the W boson decays. This leads to six possible parton-jet assignments per event and the assignment that fits best to the tt hypothesis based on the χ 2 of the kinematic fit is chosen. As final selection criteria, we require P gof > 0.1 for the kinematic fit with three degrees of freedom, and the two b quark jets be separated in η-φ space by ∆R bb > 2.0. These requirements select 7049 events for the mass measurement in data and the fraction of signal events f sig increases from 14% to 61% based on the simulation.
For simulated tt events, the parton-jet assignments are classified as correct permutations and wrong permutations. The correct permutation class is defined in the same way as for the lep-ton+jets channel (Section 5.1). The wrong permutations class consists of permutations where at least one quark from the tt decay is not unambiguously matched with a distance of ∆R < 0.3 to any of the six selected jets. For correct permutations, which compose 42% of the selected tt events, the kinematic fit improves the resolution of the fitted values of m t from 13.8 to 7.5 GeV.
The multijet background from QCD is modeled using a control sample that is obtained from data with the same event selection but without the b tagging requirement. While this sample has a small contamination of a few percent coming from signal events, these have no influence on the results for the background model. For each event, the kinematic selection is applied to all possible assignments of the six jets to the six quarks from the tt hypothesis. The best fitting assignment is chosen and the event is used to model the background if it fulfills the P gof and ∆R bb criteria. The modeled background is compared to the background predicted by an event mixing technique [48] . Both predictions are found to agree within their uncertainties that are derived from the validation of the methods on simulated multijet events.  results: As m t and the JSF are measured simultaneously, the statistical uncertainty in m t combines the statistical uncertainty arising from both components of the measurement. The two additional free parameters in the fit, the signal fraction f sig and the fraction of correct permutations f cp , are in agreement with the expectation from simulation.
Using the JEC determined from γ/Z+jet events in combination with the JSF prior from the 2D fit yields the results in the 1D and hybrid approaches of For the all-jets channel, the fitted results for the JSF (JSF 2D and JSF hyb ) are both within 1.1% of one. While the JSF results from the 2D analyses in the lepton+jets and all-jets channels differ by 0.6%, the results from the hybrid analyses agree to within 0.2%. The all-jets results for m t and the JSF are compared in Fig. 6 which shows the two-dimensional statistical likelihoods obtained from data in the 2D and hybrid cases and m t from the 1D analysis. AMWT masses are computed for all events in both the data and the simulations. The m AMWT t distributions computed for each of the seven simulated tt mass samples are added to the distributions from the background samples, and these are treated as templates in a binned likelihood fit. To minimize the effects of any bias from the poorly populated tails of the distribution, we only examine events with m AMWT t between 100 and 400 GeV. For each of the seven mass templates, a maximum likelihood fit is performed to the data distribution. A parabola is fit to the negative logarithms of the maximum likelihoods returned by the fits, and the minimum of the parabola is taken as the measured mass value.

The dilepton channel
The fit is calibrated to correct for any biases induced by the reconstruction using pseudo-data. The calibration is performed by means of a test using the simulated templates for the top quark masses between 169.5 and 175.5 GeV. We randomly draw 1000 samples of events, each selected such that the total number of events is the same as in the full data sample. For each template, the 1000 measured masses are averaged together and subtracted from the input mass to obtain a numerical value for the bias induced by the fit. The bias is then parametrized as a linear dependence on the generated value of m t , and the resulting calibration curve is used to correct for biases in the final result.
The likelihoods obtained from a fit of each of the seven simulated templates to data, as well as a second-order polynomial fit to these values, are shown in Fig. 8. This yields an uncalibrated measured mass of m t = 172.77 ± 0.19 (stat) GeV. After correcting for the fit bias, the result for the top quark mass is found to be m t = 172.82 ± 0.19 (stat) GeV.
The analysis was optimized with the value of m t blinded. The optimization of the event se- lection was done by minimizing the total expected (statistical+systematic) uncertainty. This resulted in the restriction of the analysis to events containing only two b jets, rather than the requirement of at least one b jet which was used initially.

Systematic uncertainties
The systematic uncertainties affecting each of the measurements can be grouped into four distinct categories: one experimental category and three theoretical categories that describe the modeling uncertainties. The experimental classification covers the uncertainties that arise from the precision of the calibration and resolution of the CMS detector and the effects coming from the backgrounds and pileup. The other three categories cover the modeling of the hard scattering process and the associated radiation; non-perturbative QCD effects, such as the simulation of the underlying event and color reconnection; and the modeling of the light-and b-quark hadronization. Each of these is broken down into sub-categories leading to a total of 24 distinct systematic uncertainties. In each case the uncertainty is evaluated in terms of the largest shift that is observed in the value of m t that occurs when the parameter is varied by ±1σ, where σ is the uncertainty assigned to that quantity. The only exception to this is if the statistical uncertainty in the observed shift is larger than the value of the calculated shift. In this case the statistical uncertainty is taken as the best estimate of the uncertainty in the parameter.

Experimental effects
• Intercalibration jet energy correction: This is the part of the JES uncertainty originating from modeling of the radiation in the relative (p T -and η-dependent) intercalibration procedure.
• In situ jet energy calibration: This is the part of the JES uncertainty coming from the uncertainties affecting the absolute JES determination using γ/Z+jets events.
• Uncorrelated jet energy correction: This is the uncertainty source coming from the statistical uncertainty in the in situ jet energy calibration, the contributions stemming from the jet energy correction due to pileup effects, the uncertainties due to the vari-ations in the calorimeter response versus time, and some detector specific effects. To give a clear indication of the contribution to the JES uncertainty coming from pileup, we have sub-divided this uncertainty into non-pileup and pileup contributions. • Jet energy resolution: The systematic uncertainty associated with the JER in the simulation is determined by increasing or decreasing the JER by 1σ.
• b tagging: The uncertainty in the b tagging efficiency and misidentification probability of non-b jets may lead to varying background and signal levels. This uncertainty is estimated by varying the b tagging discriminator requirements in simulations. In the lepton+jets analysis, for example, the changes in the CSVM discriminator leads to an uncertainty in the b tagging efficiency of 1.2% and the false tagging rate of 15%, both of which correspond to a 1σ variation in the value of the b tagging scale factor. The all-jets and dilepton uncertainties are computed in a similar manner for the CSVT and CSVL discriminators, respectively. Propagating the tagging efficiency uncertainty to the values of m t leads to the systematic uncertainty.
• Trigger: This systematic uncertainty captures the uncertainties related to the modeling of the trigger efficiency, and is only significant for the all-jets measurement.
• Pileup: During the data taking period, the instantaneous luminosity increased dramatically during the year, leading to an increased number of simultaneous protonproton interactions overlapping with the primary hard scattering (in-time pileup) and possible effects due to the detector response to previous collision events (outof-time pileup). These effects are evaluated by using pseudo-experiments in which the average number of pileup events was varied by ±5%. • Backgrounds: The background contamination expected from simulation is <5% in the lepton+jets and dilepton channels. The effect of the background modeling on m t is estimated by varying the shape and normalization for each background within their uncertainty. Uncertainties from simulated backgrounds are taken to be correlated across all the measurements. The only channel for which there is a significant non-tt background in the final fit sample is the all-jets channel. For this, the shape of the QCD multijet background is estimated from a control sample in the data. The method is validated using simulated QCD multijet events and with an alternative approach using event mixing in the data. The predicted background shapes are varied to cover the residual differences found in the validation. The uncertainties from the background estimation from control samples in the data are assumed to be uncorrelated.
• Fit calibration: For the calibration of the fits, the simulated samples are statistically limited. The uncertainty quoted is the statistical uncertainty in the residual bias in the fit calibration.

Theoretical and modeling uncertainties 6.2.1 Hard scattering and radiation
• Parton distribution functions: PDFs are used in modeling the hard scattering in proton-proton collisions in the simulations. The uncertainties in the PDFs and their effect on the measured value of m t are studied by reweighting a tt sample with different PDF eigenvectors using the PDF4LHC prescription [49]. The reweighted events are used to generate pseudo-experiments and the variation in the fitted mass is quoted as the uncertainty [37].
• Renormalization and factorization scales: This uncertainty is estimated using the simultaneous variation of the renormalization and factorization scales by factors of 2 and 0.5 in the matrix element calculation and the initial-state parton shower of the signal and the W+jets background.
• ME-PS matching threshold: In the tt simulation, the matching thresholds used for interfacing the matrix elements (ME) generated with MADGRAPH to the PYTHIA parton showers (PS) are varied from the default of 40 GeV down to 30 GeV and up to 60 GeV and the uncertainty is taken as the maximal difference in m t induced by this variation.
• ME generator: The sensitivity to the parton-level modeling is estimated by comparing the reference samples (MADGRAPH and PYTHIA) to samples produced using POWHEG and PYTHIA. The difference between the values of m t obtained with the two samples is taken as the uncertainty.
• Top quark p T uncertainty: This term represents the uncertainty coming from the modeling of the top quark p T distribution in the ME generator. The uncertainty is estimated by taking the difference in shape between the parton level p T spectrum from the ME generator and the unfolded p T spectrum determined from the data [39]. The uncertainty is considered fully correlated across the measurements.

Non-perturbative QCD
• Underlying event: This represents the uncertainty in modeling the soft underlying hadronic activity in the event, which affects the simulation of both signal and background. The uncertainty is estimated by comparing PYTHIA tunes with increased and decreased underlying event activity relative to a central tune. For this we compare the results for the Perugia 2011 tune to the results obtained using the Perugia 2011 mpiHi and the Perugia 2011 Tevatron tunes [50].
• Color reconnection: The effects of possible mismodeling of color reconnection are estimated by comparing the mass calculated using underlying event tunes with and without the inclusion of these effects. For these simulations the Perugia 2011 and Perugia 2011 no CR tunes are used [50]. The uncertainty is taken as the difference between the two computed values of m t .

Hadronization
• Flavor-dependent hadronization uncertainty: This is the part of the JES uncertainty that comes from differences in the energy response for different jet flavors and flavor mixtures with respect to those used in the calibration procedures. Four uncertainties are quoted that correspond to the uncertainties for light quarks (u, d, s), charm quarks, bottom quarks and gluons. These are evaluated by comparing Lund string fragmentation (PYTHIA 6 [16]) and cluster fragmentation (HERWIG++ [51]) for each category of jets. The models in PYTHIA and HERWIG allow for the differences between the jet types, and the uncertainty is determined by varying the jet energies within their respective flavor-dependent uncertainties. The full flavor-dependent uncertainty is obtained by taking a signed linear sum of these four contributions. For this we perform ±1σ shifts for each of the contributions and compute the total uncertainty from the sum of the +1σ and −1σ shifts separately. As these are symmetric, we quote the +1σ shifts for the values of the uncertainties in Tables 1-3. • b quark fragmentation and b hadron branching fraction uncertainties: This term provides a description of the residual uncertainties not covered by the flavor-dependent hadronization term. It has two components: the uncertainty in the modeling of the b quark fragmentation function and the uncertainty from the measured b hadron semileptonic branching fractions. The b quark fragmentation function in PYTHIA is modeled using a Bowler-Lund model for the fragmentation into b hadrons. The fragmentation uncertainty is determined from the difference between a version tuned to ALEPH [52] and DELPHI [53] data and the PYTHIA Z2 * tune. Lastly, the uncertainty from the semileptonic b hadron branching fraction is obtained by varying by −0.45% and +0.77%, which is the range of the measurements from B 0 /B + decays and their uncertainties [41].
The overall uncertainty in m t is 0.62 GeV and the measured JSF is compatible with the one obtained from events with Z bosons and photons [37] within the systematic uncertainties.
The measurements from the 1D and hybrid analyses are Thus the hybrid approach delivers the most precise measurement of the methods studied for the lepton+jets channel with a total uncertainty of 0.51 GeV.
The breakdown of the systematic uncertainties for the three fits is shown in Table 1. In the lepton+jets and all-jets measurements several uncertainty sources yield opposite signs in the 1D and 2D approaches. This arises because the untagged jets used for m reco W have a softer p T spectrum and larger gluon contamination compared to the b jets. As a consequence, the measurement of the JSF in the 2D measurement is more sensitive to low-p T effects and radiation uncertainties than the 1D measurement where the light-jet energies are bound to fulfill the W mass constraint. The net effect, when using a flat JSF, is that the uncertainties can be overcorrected in the 2D fit and thus their signs reverse. The hybrid fit makes optimal use of the available information and leads to partial cancelation of these uncertainties, resulting in the observed improvement of the precision of the mass measurement. Table 1: Category breakdown of the systematic uncertainties for the 2D, 1D, and hybrid measurements in the lepton+jets channel. Each term has been estimated using the procedures described in Section 6. The uncertainties are expressed in GeV and the signs are taken from the +1σ shift in the value of the quantity. Thus a positive sign indicates an increase in the value of m t or the JSF and a negative sign indicates a decrease. For uncertainties determined on independent simulated samples the statistical precision of the shift is displayed. With the exception of the flavor-dependent JEC terms (see Section 6), the total systematic uncertainty is obtained from the sum in quadrature of the individual systematic uncertainties.
The measurements from the 1D and hybrid analyses are with overall uncertainties of 0.66 and 0.64 GeV for the 1D and hybrid fits, respectively.
The breakdown of the systematic uncertainties for the three fits is shown in Table 2.

The dilepton channel
For the dilepton channel the systematic uncertainties are defined as the difference between measurements of m t from pseudo-data events, selected at random from the MC events in the m t = 172.5 GeV template. For each category of systematic uncertainty, modified templates were produced with a given systematic variable shifted, generically by ±1σ. The fit is repeated using the modified pseudo-data and the respective mean is subtracted from the mean of the default tt MC simulation to calculate the final systematic uncertainty for each category. This yields a final mass measurement of m t = 172.82 ± 0.19 (stat) ± 1.22 (syst) GeV.
The breakdown of the systematic uncertainty for the dilepton mass measurement is shown in Table 3. In comparison with the lepton+jets (Table 1) and the all-jets (Table 2) channels, the systematic uncertainties are similar in size with the exception of the factorization and renormalization, and b fragmentation terms, both of which are significantly larger. Studies of these indicate that this is probably the result of an increased boost of the visible decay products, coupled to the weak constraint of the E miss T on the energies of the two neutrinos.

The 2010 and 2011 measurements
The published CMS measurements are based on √ s = 7 TeV data recorded during 2010 and 2011. Although much less precise than the new measurements, they come from independent data sets and have different sensitivities to the various systematic uncertainties. These are included in the combined mass analysis, which is discussed in Section 9. For completeness we summarize these measurements in Table 4 below. The analysis techniques used for each of these are very similar to those used for the 2012 analyses. The dilepton results both use the AMWT method, which is described in Section 4.2, and the lepton+jets (all-jets) result comes from the 2D (1D) ideogram technique, which is described in Section 4.1.

Measured top quark mass as a function of kinematic observables
To search for possible biases in our measurements and the potential limitations of current event generators, a series of differential measurements of m t as a function of the kinematic properties Table 2: Category breakdown of the systematic uncertainties for the 2D, 1D and hybrid measurements in the all-jets channel. Each term has been estimated using the procedures described in Section 6. The uncertainties are expressed in GeV and the signs are taken from the +1σ shift in the value of the quantity. Thus a positive sign indicates an increase in the value of m t or the JSF and a negative sign indicates a decrease. For uncertainties determined on independent simulated samples the statistical precision of the shift is displayed. With the exception of the flavor-dependent JEC terms (see Section 6), the total systematic uncertainty is obtained from the sum in quadrature of the individual systematic uncertainties. To maximize the accuracy of the results, the study is performed in the lepton+jets channel using the hybrid fit technique. The variables are chosen to probe potential effects from color reconnection, initial-and final-state radiation, and the kinematics of the jets coming from the top quark decays. Table 3: Category breakdown of the systematic uncertainties for the AMWT measurement in the dilepton channel. Each term has been estimated using the procedures described in Section 6. The uncertainties are expressed in GeV and the signs are taken from the +1σ shift in the value of the quantity. Thus a positive sign indicates an increase in the value of m t and a negative sign indicates a decrease. For uncertainties determined on independent simulated samples the statistical precision of the shift is displayed. With the exception of the flavor-dependent JEC terms (see Section 6), the total systematic uncertainty is obtained from the sum in quadrature of the individual systematic uncertainties. For each measurement, the hybrid analysis method is applied to subsets of events defined according to the value of a given kinematic event observable after the kinematic fit. The contribution of the external JSF constraint is fixed to 50% to ensure consistency of all bins with the inclusive result. Constant shifts in the measured m t values may arise due to the systematic uncertainties of the inclusive measurement or from the use of different m t values in data and simulations. To search for kinematics-dependent biases the value of the mean measured top quark mass is subtracted and the results are expressed in the form m t − m t , where the mean comes from the inclusive measurement on the specific sample. In each case, the event sample is divided into 3 to 5 bins as a function of the value of the kinematic observable and we populate each bin using all permutations which lie within the bin boundaries. As some observables depend on the jet-quark assignment that cannot be resolved unambiguously, such as the p T of a reconstructed top quark, a single event is allowed to contribute to multiple bins.
To aid in the interpretation of a difference between the value of m t − m t and the prediction from a simulation in the same bin, a bin-by-bin calibration of the results is performed using the MADGRAPH + PYTHIA simulation. This is performed using the same technique as for the inclusive measurement [55] except that it is performed on each bin separately. Thus, after calibration the value in each bin can be interpreted in terms of its agreement with respect to the inclusive measurement.
For eight kinematic variables the results for the calibrated mass difference, m t − m t , are shown as a function of the chosen variable, and we compare the results to the predictions of seven different simulations. For each plotted point the statistical uncertainty and the dominant systematic uncertainties are combined in quadrature, where the latter include the JES (p T -, η-and flavor-dependent), JER, pileup, b fragmentation, renormalization and factorization scales, and ME-PS matching threshold. The systematic uncertainties are assumed to be correlated among all bins, so that any constant shift is removed by subtracting m t . We note that this approximation may underestimate the uncertainties from the p T /η-dependent JES.
For each plot we compare the data to simulations based on LO (MADGRAPH and SHERPA) and NLO (POWHEG and MC@NLO) matrix element calculations with both string (PYTHIA) and cluster (HERWIG) models for fragmentation. We also vary the choice of underlying event tune from Z2* to Perugia 2011 both with and without color reconnections, and the AUET2 tune. With the exception of the MC@NLO and SHERPA simulations, which are only used for this study, these are the same simulation as those discussed in Section 3.  The variables were chosen for their potential sensitivity to modeling the kinematics of top quark production ( Fig. 9) and decay (Fig. 10). No significant deviation in the value of the measured m t is observed, indicating that within the current precision, there is no evidence for a bias in the measurements. The agreement between the data and each of the simulations is quantified in Table 5. Here we show the cumulative χ 2 for the 27 degrees of freedom represented by the eight distributions studied (Figs. 9 and 10) and the corresponding number of standard deviations between the data and the simulation, where we have assumed two-sided Gaussian confidence intervals for each simulation. In all cases, with the possible exception of POWHEG + HERWIG 6 simulation, the data is well described by the models.

Combining the mass measurements
In this section, results for the combined top quark mass measurement are presented. As inputs we use the new results presented in this paper and the published CMS measurements from the 2010 [54] and 2011 [43,48,55] analyses. To combine the results, the best linear unbiased estimate method (BLUE) [62] is used. This determines a linear combination of the input measurements which takes into account statistical and systematic uncertainties by minimizing the total uncertainty of the combined result. The procedure takes account of the correlations that exist between the different uncertainty sources through the use of correlation coefficients. These are chosen to reflect the current knowledge of the uncertainties for both the correlations between measurements in a given decay channel from different years (ρ chan ) and between the measurements in different decay channels from the same year (ρ year ). The nominal values are set to either zero for uncorrelated or unity for fully correlated (see Table 6). Because the measurements from the 2012 analyses are significantly more precise, both statistically and systematically, than those from the 2010 and 2011 analyses, the use of unity coefficients for ρ chan and ρ year is problematic. To mitigate this, we have chosen to perform combinations in which the correlation coefficients are limited to value of less than unity. This has been done by setting the correlation coefficients for each pair of measurements in the fully correlated cases to ρ = σ i /σ j , where σ i and σ j are the uncorrelated components of the uncertainties in measurements i and j, respectively, and σ i < σ j . For all of the measurements, the statistical uncertainties are assumed to be uncorrelated.

Measurement permutations
The precision of any combination of the measurements will be dominated by and all-jets decay channels are varied. For simplicity of discussion, these are classified according to the type of fit used for each channel. They are labeled as follows: 2 for a 2D fit, 1 for a 1D or AMWT fit, and h for a hybrid fit. Thus a lepton+jets:all-jets:dilepton fit is denoted 211 in the case of a 2D fit for the lepton+jets channel a 1D fit for the all-jets channel and the AMWT fit in The most precise set of non-hybrid measurements corresponds to the set 211, which gives a To verify that this gives the most precise combination, combinations are performed using the other permutations of the 2012 measurements. The results, listed in Table 7, are in good agreement with the 211 result but have less precision, as expected.
For the hybrid results, the effect of constraining the JSF factor in the mass fits can be examined. There are three significant new permutations to consider, the h11, hh1, and 2h1 combinations. The results, shown in Table 7, are in good agreement with the 211 result, with the h11 and hh1 combinations giving the most precise measurements, as expected. For these the results are

Anticorrelation effects
For the results presented here, the signs of most of the uncertainty contributions are well defined (i.e. for a 1σ shift in a given quantity, the statistical component of the estimated systematic uncertainty is significantly smaller than the value of the uncertainty). This allows a comparison of the signs of the systematic uncertainties for the different channels and for the different fitting techniques. An anticorrelation (i.e. opposite signs) is observed between several of the terms when comparing the results from a 2D and a 1D (or AMWT) fit. However, if the 2D fit is replaced by the corresponding hybrid result, the anticorrelations are removed. This is illustrated in Fig. 11, which shows the uncertainty correlations between the lepton+jets and all-jets channels for the 2D vs. 1D and the hybrid vs. 1D cases. In the 2D vs. 1D plot (Fig. 11 left) we observe a significant number of anticorrelated terms (coming primarily from the JES and pileup terms), whereas in the hybrid vs. 1D plot (Fig. 11 right) we see no significant anti-correlations. Given the uncertainty terms that vary between the 2D and hybrid treatments, it is believed that the observed effect arises from the variation in the JSF factors between the 2D, 1D, and hybrid results (Sections 5.1 and 5.2).
These effects are not considered in the standard 211 combination as the input correlation coefficients are positive for all of the correlated cases (see Table 6). To estimate the effect of including anticorrelations, the correlation coefficients are set to negative values for the cases where an anticorrelation (opposite sign) is observed and positive values where a positive (same sign) or neutral (statistically limited) correlation is observed and the 211 combination analysis is repeated. This gives a result of 172.40 ± 0.13 (stat+JSF) ± 0.47 (syst) GeV. Thus, while the result for the mass is unchanged, the systematic uncertainty is decreased and becomes comparable to that achieved in the hybrid combinations.

Results
Based on the expected uncertainties for each of the individual measurements (Tables 1-3) and the consistency of the hybrid and 1D results for the JSF (Sections 5.1, 5.2), the hh1 combination is chosen as the preferred result. Combining the seven input measurements (four from √ s = 7 TeV and three from this analysis) gives a combined top quark mass measurement of m t = 172.44 ± 0.13 (stat+JSF) ± 0.47 (syst) GeV, for which the combination χ 2 is 2.5 for six degrees of freedom, corresponding to a probability of 87%. This is compared to the full set of Run 1 measurements in Fig. 12 where the current world average [7] and Tevatron [8] combinations are also shown. The result is consistent with all of the published LHC measurements and is the most precise measurement to date with a precision of 0.3%. The correlations between each of the measurements is shown in Table 8. Figure 13 shows the combination coefficients and pulls, where the pull is defined as (m comb top − m meas top ) / √ σ 2 meas − σ 2 comb where m comb top and m meas top are the combined and the individual measurements of m t , respectively, and σ comb and σ meas are the corresponding total uncertainties. The 2010 measurement contributes very little to the overall result. As the treatment of the systematic uncertainty for this analysis is the least sophisticated of the seven measurements, the final combination is repeated to verify that it does not influence the final result. Excluding this measurement produces negligible changes in the values of m t or its total uncertainty, δm t . For the combination of the  remaining six measurements the χ 2 is 2.3 for five degrees of freedom, corresponding to a probability of 80%.
The breakdown of the systematic uncertainties for the combination is shown in Table 9. The dominant uncertainty in the measurement arises from the modeling of the hadronization, with 0.33 GeV coming from the flavor-dependent jet energy corrections and a further 0.14 GeV coming from the b jets. There are a further six terms with uncertainties in the range of 0.11-0.12 GeV. Of these, four are coming from theory and only two, the JEC in situ (0.12 GeV) and the JEC Uncorrelated non-pileup (0.10 GeV) are experimental. The theoretical uncertainties are computed using the same models so they should be fully correlated. For the two experimental terms, the strength of the assumed correlations is varied by 50% of their nominal values to check the sensitivity to the assumed correlation strength. In both cases this produces changes of less than 0.01 GeV in m t and δm t . We therefore conclude that the result is quite stable against reasonable changes in the assumed correlation strength.
Although we do not believe that the use of 100% correlation strengths is appropriate to use for the correlated systematic uncertainties, for completeness we have re-run the final combination without the constraint on the correlation strengths. In this case we observe shifts of −0.28 GeV in m t and −0.03 GeV in δm t . For this combination, four of the seven measurements have negative combination coefficients and the central mass lies outside of the boundaries of

Summary
A new set of measurements of the top quark mass has been presented, based on the data recorded by the CMS experiment at the LHC at √ s = 8 TeV during 2012, and corresponding to a luminosity of 19.7 fb −1 . The top quark mass has been measured in the lepton+jets, all-jets and dilepton decay channels, giving values of 172.35 ± 0.16 (stat) ± 0.48 (syst) GeV, 172.32 ± 0.25 (stat) ± 0.59 (syst) GeV, and 172.82 ± 0.19 (stat) ± 1.22 (syst) GeV, respectively. Individually, these constitute the most precise measurements in each of the decay channels studied. When combined with the published CMS results at √ s = 7 TeV, a top quark mass measurement of 172.44 ± 0.13 (stat) ± 0.47 (syst) GeV is obtained. This is the most precise measurement of m t to date, with a total uncertainty of 0.48 GeV, and it supersedes all of the previous CMS measurements of the top quark mass.
The top quark mass has also been studied as a function of the event kinematical properties in the lepton+jets channel. No indications of a kinematical bias in the measurements is observed and the data are consistent with a range of predictions from current theoretical models of tt production.