Study of the dijet mass spectrum in pp W + jets events at =7 TeV

: We report an investigation of the invariant mass spectrum of the two jets with highest transverse momentum in pp→W+2-jet and W+3-jet events to look for resonant enhancement. The data sample corresponds to an integrated luminosity of 5.0 fb-1 collected with the CMS detector at √s=7 TeV. We find no evidence for the anomalous structure reported by the CDF Collaboration, and establish an upper limit of 5.0 pb at 95% confidence level on the production cross section for a generic Gaussian signal with mass near 150 GeV. Additionally, we exclude two theoretical models that predict a CDF-like dijet resonance near 150 GeV. We report an investigation of the invariant mass spectrum of the two jets with highest transverse momentum in pp ! W þ 2 -jet and W þ 3 -jet events to look for resonant enhancement. The data sample corresponds to an integrated luminosity of 5 : 0 fb (cid:1) 1 collected with the CMS detector at ﬃﬃﬃ s p ¼ 7 TeV . We ﬁnd no evidence for the anomalous structure reported by the CDF Collaboration, and establish an upper limit of 5.0 pb at 95% conﬁdence level on the production cross section for a generic Gaussian signal with mass near 150 GeV. Additionally, we exclude two theoretical models that predict a CDF-like dijet resonance near 150 GeV.

Study of the Dijet Mass Spectrum in pp ! W þ jets Events at ffiffi ffi s p ¼ 7 TeV S. Chatrchyan et al. * (CMS Collaboration) (Received 16 August 2012;published 21 December 2012) We report an investigation of the invariant mass spectrum of the two jets with highest transverse momentum in pp ! W þ 2-jet and W þ 3-jet events to look for resonant enhancement. The data sample corresponds to an integrated luminosity of 5:0 fb À1 collected with the CMS detector at ffiffi ffi s p ¼ 7 TeV. We find no evidence for the anomalous structure reported by the CDF Collaboration, and establish an upper limit of 5.0 pb at 95% confidence level on the production cross section for a generic Gaussian signal with mass near 150 GeV. Additionally, we exclude two theoretical models that predict a CDF-like dijet resonance near 150 GeV. DOI: 10.1103/PhysRevLett.109.251801 PACS numbers: 12.15.Ji, 12.38.Qk, 13.85.Rm, 14.80.Àj The CDF Collaboration reported evidence for an excess in the mass range 120-160 GeV in the invariant mass (m jj ) spectrum of the two leading transverse-momentum (p T ) jets produced in p " p ! W þ 2-jet events with a cross section of 4 pb [1]. The D0 Collaboration carried out a similar analysis but did not confirm the CDF result, instead setting a 95% confidence level (C.L.) upper limit of 1.9 pb on the cross section [2]. This Letter details the search for a bumplike enhancement in the m jj spectrum in events with a W boson using 5:0 fb À1 of data collected from pp collisions at ffiffi ffi s p ¼ 7 TeV with the Compact Muon Solenoid (CMS) detector at the CERN Large Hadron Collider (LHC) during 2010 and 2011.
We search for a resonance with a width consistent with detector resolution as reported by CDF. We further investigate three representative models, a technicolor T from the decay of a technicolor T [3], a leptophobic Z 0 decaying to two jets [4], and the standard model (SM) Higgs boson (m H ¼ 150 GeV) produced in association with a W boson (referred to as WH production) and decaying to a pair of jets. For the unknown state with detector resolution, we follow the convention used at the Tevatron of using the conservative WH simulation for analysis-dependent quantities like efficiencies and acceptances. The WH production cross section at the LHC is negligible compared to contributions from other SM processes, which overwhelm any contribution to this analysis from WH ! 'jj decays for m H % 125 GeV [5,6].
A detailed description of the CMS experiment can be found in Ref. [7]. The central feature of the CMS detector is a superconducting solenoid, of 6 m internal diameter, that produces an axial magnetic field of 3.8 T. Located within the field volume is the silicon pixel and strip tracker extending up to jj ¼ 2:5, as well as a lead tungstate crystal electromagnetic calorimeter (ECAL) and a brass and scintillator hadronic calorimeter (HCAL), both extending up to jj ¼ 3. Outside the field volume in the forward region (3 < jj < 5) is an iron and quartz-fiber hadronic calorimeter. Muons are measured in gas-ionization detectors embedded in the steel return yoke outside the solenoid, in the pseudorapidity range jj < 2:4. The CMS coordinate system has its origin at the center of the detector, with the z axis pointing along the direction of the counterclockwise proton beam. The azimuthal angle is denoted as , the polar angle as , and the pseudorapidity is defined as ¼ À ln½tanð=2Þ. We employ selection criteria similar to those used at the Tevatron [1,2], but modified to adapt to the higher background rates and different experimental conditions at the LHC. We also place more stringent requirements on the jet kinematics, as suggested in Ref. [8], to enhance a signal compared to the irreducible W plus jets background.
Events are selected with one well-identified and isolated lepton (muon or electron), large missing transverse energy 6 E T , and exactly two or exactly three high-p T jets. The data were collected with a suite of single-lepton triggers, mostly with a p T threshold of 24 GeV for muons and 25-32 GeV for electrons. The trigger efficiency for the selected muons (electrons) is about 94% (90%). We reconstruct muon candidates in the region jj < 2:1 by combining information from the silicon tracker and the muon detectors by means of a global fit. We identify electron candidates within jj < 1:44 and 1:57 < jj < 2:5 as clustered energy deposits in the electromagnetic calorimeter that are matched to tracks. Muon and electron candidates need to fulfill quality criteria established for the measurement of the inclusive W and Z cross sections [9]. In addition, all leptons must be well-separated from hadronic activity in the event. Jets within ancone of radius 0.3 around a lepton candidate are removed.
The muon (electron) transverse momentum must exceed 25 (35) GeV, and 6 E T must be greater than 25 (30) GeV in the muon (electron) analysis. The transverse mass M T of each W candidate must be greater than 50 GeV, where and ' and 6 E T are the azimuthal angles of the lepton and 6 E T , respectively. Events with more than one identified lepton are vetoed.
We reconstruct jets and 6 E T [9,10] with the particle-flow algorithm [11], which combines information from several subdetectors. The jet finding uses the anti-k T clustering algorithm [12] with a distance parameter of 0.5. We require j jet j < 2:4 to ensure that they lie within the tracker acceptance, and a minimum jet p T of 30 GeV. Jets must satisfy identification criteria that eliminate jet candidates originating from noisy channels in the hadron calorimeter [13]. Jet-energy corrections are applied to account for the nonlinear response of the calorimeters to the particle energies and other instrumental effects. These corrections are based on in situ measurements using dijet, þ jet, and Z þ jet data samples [14]. Overlapping minimum-bias events from other pp collisions (pileup) and the underlying event can contribute additional energy to the reconstructed jets. The median energy density due to pileup is evaluated in each event and the corresponding energy is subtracted from each jet [15]. In addition, tracks that do not originate from the primary vertex are not considered for jet clustering [16]. We verify that the procedures successfully remove the dependence of jet response on the number of interactions in a single event. The jet p T resolution varies from 15% at p T ¼ 40 GeV to 6% at p T ¼ 400 GeV [14]. We evaluate the mass resolution jj for a selected jet pair using simulation and verify it using hadronic W decays in data. We find jj to be 10% of m jj for masses around 150 GeV.
We require kp T j 1 þp T j 2 k > 45 GeV and jÁðj 1 ; j 2 Þj < 1:2, where the jets are numbered in order of decreasing p T . We retain events with exactly two or exactly three jets satisfying p T > 30 GeV and with the leading jet having p T > 40 GeV and pointing more than 0.4 rad in azimuth from the direction of the 6 E T . The selected jets and the lepton from the W decay must originate from the same primary vertex. Additionally, we impose 0:3 <p T j 2 =m jj < 0:7 to take advantage of the Jacobian nature of resonant dijet production as observed in simulation studies compared with nonresonant W plus jets production.
W production with two or more jets dominates the selected sample. Smaller contributions come from toppair and single-top decays, Drell-Yan events with two or more jets, multijet production, and WW and WZ diboson production where one W decays into leptons and the other W or Z decays into quarks.
The shapes of the m jj distributions for background processes are modeled using samples of simulated events. The MADGRAPH5 1.3.30 [17] event generator produces parton-level events with a W boson and up to four partons on the basis of matrix-element (ME) calculations. (The Tevatron experiments used the ALPGEN generator [18].) The ME-parton shower matching scale is taken to be 20 GeV [19], and the factorization and renormalization scales are set to q 2 ¼ M 2 W þ p 2 T;W . Samples of t " t and Drell-Yan events are also generated with MADGRAPH. Single-top production is modeled with POWHEG 1.0 [20]. Multijet and diboson samples (WW, WZ, ZZ) are generated with PYTHIA 6.422 [21]. PYTHIA provides the parton shower simulation in all cases, with parameters of the underlying event set to the Z2 tune [22]. The set of parton distribution functions used is CTEQ6LL [23]. A GEANT4based simulation [24] of the CMS detector is used in the production of all Monte Carlo (MC) samples. Multiple proton-proton interactions within a bunch crossing are simulated, and the triggers are emulated. All simulated events are reconstructed and analyzed with the same software as data.
We generate signal samples for the WH model using PYTHIA, with parameters corresponding a SM Higgs boson with m H ¼ 150 GeV. We use PYTHIA for technicolor generation as well. We generate leptophobic Z 0 with MADGRAPH. The authors of Refs. [3,4] provided values for masses and other parameters of the technicolor and Z 0 models that would best correspond to the signal observed by CDF.
We determine the contributions of the known SM processes to the observed m jj spectrum by means of an extended unbinned maximum-likelihood fit in the range between 40 GeV and 400 GeV. We fit separately in four event categories, f; eg Â f2-jet; 3-jetg, because the background compositions differ. The m jj signal region, 123 to 186 GeV, corresponding to AE2 jj , is excluded from this fit in order to arrive at an unbiased estimate of a possible resonant enhancement in this region. Table I lists the SM processes included in the fit. The W plus jets normalization is a free fit parameter because it is by far the dominant background. We allow the normalizations of the other background components to vary within Gaussian constraints around the central values also listed in Table I. The central values for all processes except multijet come from next-to-leading-order (NLO), next-to-next-to-leading-log (NNLL), or next-to-NLO (NNLO) calculations, and the constraints reflect the published uncertainties. We derive templates for the m jj distribution for each background from simulation except for the multijet events, which contribute when jets are misidentified as leptons. In a separate fit to events that fail the lepton isolation requirements, we determine the central value of the multijet normalization, the constraint on the normalization and the template for the m jj distribution [9]. The fit to data determines the correlations among the various fit parameters.
The default CMS MADGRAPH sample of the dominant W plus jets background does not describe well the m jj spectrum in the m jj sidebands. Four alternative samples of W events, with the scales and q increased and reduced by a factor two with respect to those of the default, fail to provide significant improvement. Thus, we employ an empirically driven combination of three shapes to describe this component in the fit model, where F Wþjets denotes the m jj shape from simulation. The parameters 0 ( 0 ) and q 0 (q 0 ) correspond to the default (alternative) values of and q, respectively, while fractional contributions and are free to vary between 0 and 1. We take 0 ¼ 2 0 or 0:5 0 (q 0 ¼ 2q 0 or 0:5q 0 ), depending on which alternative sample provides a better fit to data. Furthermore, we verify, via pseudoexperiment simulations generated with an alternate shape, that the function in the above equation has sufficient freedom to describe the W plus jets shape. Figure 1(a) shows the observed m jj distribution for all four event categories combined, together with the fitted projections of the contributions of various SM processes. Figure 1(b) shows the same distribution after subtraction of all SM contributions from data except electroweak diboson WW=WZ events. No peak is visible in the spectrum except that near 80 GeV due to diboson events. Figure 1(c) shows the bin-by-bin pull. Table II presents the yields of the SM components obtained from the fit. The sum of all the contributions is compared to the number of observed events. All numbers except those in the last two rows are for the m jj range of 40 to 400 GeV. The last two rows compare the observed number of events and the number predicted by the fit in the m jj range of 123 to 186 GeV. The data agree with the SM expectations, and we find no significant excess in the signal region. We observe a sizable deficit in the muon 2-jet data with respect to the prediction from our model. We do not observe similar deviations in the other three categories, suggesting it is a fluctuation and not a systematic bias.
We validate the fit procedure by performing pseudoexperiments. In each experiment, we generate the m jj pseudodata of the SM processes, including the correlations taken from the fit to data, and then fit each pseudodata sample. The results indicate that the bias on the total yield is below 0.2% and that the fit underestimates the total yield uncertainty by about 30%. These effects are corrected for in the final result. Uncertainties in the jet energy are estimated using a sample of W bosons decaying hadronically in a pure sample of semileptonic t " t events. The mean and resolution of the reconstructed dijet mass distribution in data agree within 0.6% with the expectation from simulation. A small difference in 6 E T resolution [10] between data and simulation affects the signal acceptance for the new physics models under consideration at the 0.5% level. Further systematic uncertainties are due to the uncertainty of the trigger efficiency estimates (1%) and the estimate of lepton reconstruction and selection efficiency (2%) [9]. The uncertainty on the integrated luminosity is 2.2% [31].
We scrutinize the dijet mass spectrum near 150 GeV, searching for a technicolor, leptophobic Z 0 , or WH resonant enhancement. We also use a generic signal model obtained by convolving a delta function centered at m jj ¼ 150 GeV with a Gaussian function having width equal to jj . Figure 1(b) shows this generic signal shape. The expected number of signal events at the LHC for a given cross section at the Tevatron can be estimated by considering the ratio of the predicted cross sections for our reference process, WH production with M H ¼ 150 GeV. This process is dominated by quark-antiquark (q " q) annihilation. As q " q processes have the smallest increase in parton luminosity from the Tevatron to the LHC, this choice provides a conservative limit. We therefore assume where WH LHC ¼ 300:1 fb [32] and WH Tevatron ¼ 71:8 fb [33]. A generic Gaussian signal normalized to Tevatron ¼ 4 pb corresponds to LHC ¼ 16:7 pb. Table III contains the values of LHC times the branching fraction to jets and of the overall efficiency times acceptance "A for the models considered.
Since we observe no resonant enhancement, we proceed to set exclusion limits using a modified frequentist CL S method [34,35] with profile likelihood as the test statistic. Inputs to the limit-setting procedure are the m jj distribution obtained by combining the SM components from the fit, the observed distribution in data, the expectation from the dijet resonance model under consideration, and the uncertainties associated with these quantities. Figure 2(a) shows the observed and expected CL S values versus cross section for a generic Gaussian signal, after combining the results of all four event categories. We set a 95% C.L. upper limit of 5.0 pb and a 99.9% C.L. upper limit of 8.5 pb on the dijet production cross section for a generic resonance with WH-like "A. Figure 2(b) compares the 95% C.L. upper limits with the expected cross sections for technicolor, leptophobic Z 0 , and WH (M H ¼ 150 GeV) signals. The technicolor and Z 0 models are excluded. Because we have minimal sensitivity to WH, we compare the limit in Fig. 2(b) to 100 times the SM cross section as an illustration.
In summary, we have studied the invariant mass spectrum of the two jets with highest transverse momentum in TABLE II. Event yields determined from maximum-likelihood fits to the data. The total fit yields are corrected for bias. The total fit uncertainties include the correlations among the various yields, as determined by the fit, and the corrections derived from the fit validation described in the text. The 2 probability uses the residuals and the data and MC statistical errors. pp ! W þ 2-jet and W þ 3-jet events, with the W decaying leptonically to a muon or electron. The analyzed data sample corresponds to an integrated luminosity of 5:0 fb À1 at ffiffi ffi s p ¼ 7 TeV. We find no evidence for a resonant enhancement near a dijet mass of 150 GeV, as reported by the CDF Collaboration, and set upper limits on the dijet production cross section of 5.0 pb at 95% C.L. and 8.5 pb at 99.9% C.L. Two theoretical models, leptophobic Z 0 and technicolor, which predict the presence of a resonant enhancement near 150 GeV, are excluded.
We thank Adam Martin and Matthew Buckley for help with simulation of technicolor and Z 0 models, respectively. We congratulate our colleagues in the CERN accelerator departments for the excellent performance of the LHC machine. We thank the technical and administrative staff at CERN and other CMS institutes, and acknowledge support from FMSR (Austria  Observed and expected 95% C.L. upper limits, with one-and two-sigma error bands, on the cross section divided by the expected values for various signal models. The limits are calculated using the CL S method. A value of the excluded cross section over the predicted cross section of less than one indicates that the model is excluded at 95% C.L. Table III lists the cross sections for these models.