Evidence for Associated Production of a Single Top Quark and W Boson in pp Collisions at ffiffi s p 1⁄4 7 TeV

Evidence is presented for the associated production of a single top quark and W boson in pp collisions at ffiffi s p 1⁄4 7 TeV with the CMS experiment at the LHC. The analyzed data correspond to an integrated luminosity of 4:9 fb . The measurement is performed using events with two leptons and a jet originated from a b quark. A multivariate analysis based on kinematic properties is utilized to separate the t t background from the signal. The observed signal has a significance of 4:0 and corresponds to a cross section of 16þ5 4 pb, in agreement with the standard model expectation of 15:6 0:4þ1:0 1:2 pb.

Electroweak production of single top quarks has been first observed by the D0 [1] and CDF [2] experiments at the Tevatron.Single top quark production proceeds via three processes: the t-channel exchange of a virtual W boson, the s-channel production and decay of a virtual W boson, and the associated production of a top quark and a W boson (tW).The latter channel, which has a negligible production cross section at the Tevatron, represents a significant contribution to single top quark production at the Large Hadron Collider (LHC).Associated tW production is a very interesting production mechanism because of its interference with top quark pair production [3][4][5], its sensitivity to new physics [6][7][8], and its role as a background to SUSY and Higgs searches.The ATLAS and Compact Muon Solenoid (CMS) experiments have measured the cross section for t-channel production [9,10] while evidence for tW associated production has been presented by the ATLAS experiment [11].This Letter presents the first study from the CMS experiment of tW production in pp collisions at ffiffi ffi s p ¼ 7 TeV.
The production cross section for tW has been computed at approximate next-to-next-to-leading order (NNLO), the theoretical prediction of the cross section for tW in pp collisions at ffiffi ffi s p ¼ 7 TeV, assuming a top quark mass (m t ) of 172.5 GeV, is 15:6 AE 0:4 þ1:0 À1:2 pb [12], the first uncertainty corresponds to scale variation and the second to parton distribution function (pdf) sets.
The leading order Feynman diagrams for tW production are shown in Fig. 1.The definition of tW production in perturbative QCD mixes with top quark pair production (t t) at next-to-leading order (NLO) [4,5].Two schemes are proposed to describe the tW signal: ''diagram removal'' (DR) [3], where all NLO diagrams that are doubly resonant, such as those in Fig. 2, are excluded from the signal definition; and ''diagram subtraction'' (DS) [3,13], in which the differential cross section is modified with a gauge-invariant subtraction term, which locally cancels the contribution of t t diagrams.The DR scheme is used in this Letter, but it has been verified that the number of predicted events after full selection is consistent between the two approaches within the statistical uncertainties of the simulated samples.The differences are accounted for in the systematic uncertainties.
In the standard model, top quarks decay almost exclusively to a W boson and a b quark.The study presented here has been performed in the channels in which both W bosons decay leptonically into a muon or an electron and a neutrino, with a branching fraction BðW !'Þ ¼ ð10:80 AE 0:09Þ%, where ' ¼ e or [14].The dilepton final states of the tW process are characterized by the presence of two isolated leptons with opposite charge, a jet from the fragmentation of a b quark, and a substantial amount of missing transverse energy (E miss T ) due to the presence of the neutrinos.The primary source of background events arise from t t production, followed by Z= ?þ jets processes.
The analysis uses fits to a discriminant variable built from kinematic quantities combined with a multivariate technique.A second analysis, intended as a cross-check of the robustness of the selection, is performed using event counting.In both cases, a sample collected at ffiffi ffi s p ¼ 7 TeV by CMS, corresponding to an integrated luminosity of 4:9 fb À1 , is used.
The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the field volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter, and a brass or scintillator hadron calorimeter.Muons are measured in gas-ionization detectors embedded in the steel return yoke.Extensive forward calorimetry complements the coverage provided by the barrel and endcap detectors.A more detailed description can be found in Ref. [15].
Single top quark events in all channels have been simulated with the POWHEG event generator version 301 [16], designed to describe the full NLO properties of these processes, while MADGRAPH 5.1.1 [17] is used for t t and for the inclusive single-boson production (V þ X), where V ¼ W, Z, and X can indicate light or heavy partons.The remaining background samples are simulated using PYTHIA version 6.4.24 [18], including diboson production and QCD multijet production enriched in events with electrons or muons produced in the decay of b and c quarks, and muons from the decay of long-lived hadrons.The CTEQ 6.6M pdf sets [19] are used for all simulated samples.All generated events undergo a full simulation of the detector response using GEANT4 [20,21].The value used for the top quark mass is m t ¼ 172:5 GeV.
Approximate NNLO theoretical predictions are used to normalize t t production ( tt ¼ 163 þ11 À10 pb) [22], W þ jets and Z= Ã þ jets processes are normalized to complete NNLO calculations for the inclusive cross sections, and NLO cross sections are used for diboson processes [23].Unless otherwise stated, the theoretical values of the cross section have been used in this Letter to normalize the simulation in figures and tables.
Leptons, jets, and E miss T are reconstructed by the CMS particle flow (PF) algorithm [24], which performs a global event reconstruction and provides the full list of particles identified as electrons, muons, photons, and charged and neutral hadrons.
Events are collected using dilepton triggers with electrons or muons.The lepton transverse energy thresholds are symmetric, the highest used in these triggers is 17 GeV while the lowest is 8 GeV.The two selected leptons must originate from the same primary vertex and have opposite charge.The primary vertex used is defined as the reconstructed vertex with the highest p T of associated tracks and is required to have at least four tracks, with longitudinal (radial) distance of less than 24 (2) cm from the center of the detector.Muon (electron) candidates are required to have a transverse momentum p T > 20 GeV and pseudorapidity jj < 2:4ð2:5Þ; events with additional leptons passing looser quality criteria are vetoed.
To remove low invariant mass Drell-Yan (Z= Ã ) events, the invariant mass of the lepton pair (m '' ) is required to be greater than 20 GeV.In the ee and final states, events are also rejected if m '' is between 81 and 101 GeV, compatible with the Z boson mass; this veto removes background from Z= Ã þ jets, as well as from ZZ and WZ processes.In the ee and decay channels, a requirement is applied on the E miss T as well to further reduce the contribution from events without genuine E miss T (mostly Z= Ã þ jets and QCD multijet production).Since the E miss T resolution is degraded in events with high pileup, an additional quantity is used (tracker-E miss T ), calculated using only the charged particles associated with the primary vertex.Events are selected if both E miss T and tracker-E miss T are larger than 30 GeV.Jets are defined according to the anti-k T algorithm [25] with a distance parameter of 0.5.Jets within jj < 2:4 and with p T > 30 GeV are considered in the analysis.
Exactly one jet is required to be present in the event, and it must be identified as coming from a b quark.The identification of b jets is done according to an algorithm that reconstructs the secondary vertex of the decay of the b quark [26,27], resulting in a discriminating variable sensitive to the lifetime of b hadrons.The selection on this discriminant yields a b-tagging efficiency of 62% with a mistag rate of 1.4% for jets with p T between 50 and 80 GeV.Events with additional b-tagged jets with p T > 20 GeV are removed.After this selection, the sample is dominated by t t events and a tW signal.Additionally, events with exactly two jets, in which either one or both jets have been b tagged, are used in the fit.Three regions are defined per dilepton final state: one region with one jet that is b tagged (1j1t) where the tW signal is substantial, and two regions with two jets, where the t t background is dominant, and exactly one or two b tags are required (2j1t and 2j2t, respectively).
A smaller background comes from Z= Ã events.It is found that in high-pileup scenarios the E miss T distribution for Z= Ã events is not properly modeled by the simulation, leading to disagreement between data and simulation.To solve this problem, the Z= Ã simulation is corrected to match the missing transverse energy distribution observed in the data using events from the Z resonance.
The contributions of other backgrounds, i.e., diboson production (WW, WZ, ZZ), QCD, W þ jets, and other single top quark processes, are small, less than 1% of the selected events, and estimated from simulation.FIG. 2. Feynman diagrams for tW single top quark production at next-to-leading order that are removed from the signal definition in the DR scheme; the charge-conjugate modes are implicitly included.
The number of events in the signal and two control regions is presented for data and simulation in Table I.The approximate composition of the sample at this level is 70% t t events with 20% tW events in the signal region.In the 2j1t region the t t content represents 90% of the events, while tW events are less than 6%.In the 2j2t region, more than 95% of the events are t t events.A multivariate analysis based on boosted decision trees (''BDT'' analysis) [28,29] is used, testing the overall compatibility of the signal event candidates with the event topology of the tW associated production.Four variables are chosen to train the BDT based on their ability to separate the tW signal from the dominant t t background.These variables are H T , defined as the scalar sum of the transverse momenta of the leptons, jet, and E miss T , the p T of the system composed of the leptons, E miss T and jet, the p T of the jet with the highest energy, and the difference in angular separation, , between the direction associated to the E miss T and the closest of the two selected leptons.The distributions of H T and the p T of the system composed of the leptons, E miss T and the jet, are presented, in the signal region (1j1t), in Fig. 3.The presence of the tW signal over the background is visible in all the distributions.The distributions of the other two variables are available in the Supplemental Material [30].
The output of the BDT is a single discriminant value for every event ranging from À1 (backgroundlike) to þ1 (signal-like).The distribution of the BDT discriminant is shown for the 1j1t signal region in Fig. 4.Even if the tW signal does not peak strongly at þ1, its distribution discriminates it with respect to t t and other backgrounds.Maximum signal sensitivity is achieved through a simultaneous fit to 9 categories: the 3 BDT discriminant shapes (1j1t, 2j1t, and 2j2t) in the three final states (ee, e, and ).The two t t enriched regions are included to control the rate of this background in the signal region.
The impact of each individual source of uncertainty on the analysis has been estimated in every region and final state.The dominant systematic uncertainty that affects the rate of the tW signal is associated with the b-tagging efficiency, with values between 3% and 6% for the different final states.The b-tagging efficiency uncertainty is also important for the t t background yield, with values between 1.5% and 4.0%.The main systematic uncertainty for the t t background is due to the factorization/renormalization scale used in the simulation, up to 11%, with values around 2% for the tW signal.Also for t t, the uncertainties due to jet energy scale (7%) and the threshold used to match the matrix element generator to the parton shower model in   simulation (3%) are important.The statistical uncertainty is the largest contribution to the uncertainty of the measured cross section, with a 20% effect.The complete information about the systematic uncertainties is available in tabulated form in the Supplemental Material [30].
A binned likelihood fit is performed on the distributions of the BDT discriminant.Template shapes for the signal and backgrounds are taken from simulation.Distributions are included separately in the fit for each of the three dilepton channels (ee, e, and ) in the signal region (1j1t) and control regions (2j1t and 2j2t).Signal and background rates are allowed to vary in the fit, using the systematic uncertainties on the background rates as constraint terms in the likelihood function.The signal rate and 68% confidence level (C.L.) interval is determined using the profile likelihood method.The sources of theoretical uncertainty that affect the template shape are then considered.For each uncertainty, AE1 systematic shifts are applied to the simulated samples to obtain revised templates.Differences in signal rates found using the revised templates are taken as systematic uncertainties and are added in quadrature to the 1 interval from the fit using the baseline templates.The expected significance is evaluated using the median and central 68% of the values obtained from pseudoexperiments generated using the theoretical prediction of the standard model tW cross section.
An excess of events over the expected background is observed with a significance of 4:0, compatible with the expected significance of the tW signal, 3:6 þ0:8 À0:9 .The measured cross section, including both statistical and systematic uncertainties, is 16 þ5 À4 pb, in agreement with the standard model prediction.
The measurement can be used to determine the absolute value of the Cabibbo-Kobayashi-Maskawa matrix element jV tb j, following the same technique as in [10], assuming that jV td j and jV ts j are much smaller than jV tb j: jV tb j ¼ ffiffiffiffiffiffiffiffi ffi tW th tW s ¼ 1:01 þ0:16 À0:13 ðexp:Þ þ0:03 À0:04 ðth:Þ; where th tW is the standard model prediction computed assuming jV tb j ¼ 1.Using the standard model assumption of 0 jV tb j 2 1, a value of jV tb j ¼ 1:00 is inferred, with a 90% confidence level interval of [0.79,1.00].This is based on profile likelihood intervals, the same method used for the cross section measurement and intervals.Studies with pseudoexperiments were performed, showing the validity of the profile likelihood method in presence of the boundary jV tb j 1:0.
A second analysis (''count-based'' analysis), used as a cross-check, is performed using event counts.After the jet selection step, instead of building the BDT discriminant, events are required in addition to having H T > 60 GeV in the e channel, where no invariant mass and E miss T requirements are applied.The analysis uses a statistical model of Poisson event counts in the three dilepton final states in the signal region (1j1t) and control regions (2j1t and 2j2t).The event yield for each process in every region is affected by different sources of systematic uncertainties, equivalent to the ones calculated for the BDT analysis.These are included in the model as nuisance parameters.The same methods for the cross section measurement and the significance calculation as in the BDT analysis have been used.Figure 5 shows the event yields selected by the count-based analysis for each region, in data and simulation, in which  the simulation yields have been normalized to the outcome of the maximum likelihood fit.The observed significance of the tW signal obtained with the count-based analysis is 3:5, with an expected significance of 3:2 AE 0:9.The count-based analysis measures a cross section of 15 AE 5 pb.These results are consistent with those obtained with the BDT analysis.
In summary, using 4:9 fb À1 of data collected with the CMS experiment at the LHC, evidence has been found for the associated production of a single top quark and W boson in pp collisions at ffiffi ffi s p ¼ 7 TeV with a significance of 4:0 and a measured cross section of 16

FIG. 3 (
FIG. 3 (color online).Distributions of H T and the p T of the system composed of the leptons, E miss T and the jet, in data and simulation after jet selection in the signal region (1j1t).
FIG. 4 (color online).Distribution of the BDT discriminant in the signal region (1j1t) in data and simulation.

FIG. 5 (
FIG. 5 (color online).Event yields in data and simulation in the signal region (1j1t) and the two t t-enriched control regions for the count-based analysis.Simulation yields are scaled to the outcome of the fit.

dd
Also at University of Bucharest, Faculty of Physics, Bucuresti-Magurele, Romania.ee Also at Faculty of Physics of University of Belgrade, Belgrade, Serbia.ff Also at University of California, Los Angeles, Los Angeles, California, USA.gg Also at Scuola Normale e Sezione dell' INFN, Pisa, Italy.hh Also at INFN Sezione di Roma, Universita `di Roma ''La Sapienza,'' Roma, Italy.ii Also at University of Athens, Athens, Greece.jj Also at Rutherford Appleton Laboratory, Didcot, United Kingdom.kk Also at Paul Scherrer Institut, Villigen, Switzerland.ll Also at Institute for Theoretical and Experimental Physics, Moscow, Russia.mm Also at Albert Einstein Center for Fundamental Physics, Bern, Switzerland.nn Also at Gaziosmanpasa University, Tokat, Turkey.oo Also at Adiyaman University, Adiyaman, Turkey.pp Also at Izmir Institute of Technology, Izmir, Turkey.qq Also at The University of Iowa, Iowa City, USA.rr Also at Mersin University, Mersin, Turkey.ss Also at Ozyegin University, Istanbul, Turkey.tt Also at Kafkas University, Kars, Turkey.uu Also at Suleyman Demirel University, Isparta, Turkey.vv Also at Ege University, Izmir, Turkey.ww Also at School of Physics and Astronomy, University of Southampton, Southampton, United Kingdom.xx Also at INFN Sezione di Perugia, Universita `di Perugia, Perugia, Italy.yy Also at University of Sydney, Sydney, Australia.zz Also at Utah Valley University, Orem, USA.aaa Also at Institute for Nuclear Research, Moscow, Russia.bbb Also at University of Belgrade, Faculty of Physics and Vinca Institute of Nuclear Sciences, Belgrade, Serbia.ccc Also at Argonne National Laboratory, Argonne, USA.

TABLE I .
Event yields in the different regions.The simulation is quoted with statistical (first) and systematic uncertainties (second).When only one uncertainty is quoted, it is the total one.
þ5 Also at National Institute of Chemical Physics and Biophysics, Tallinn, Estonia.Also at Universidade Federal do ABC, Santo Andre, Brazil.e Also at California Institute of Technology, Pasadena, USA.f Also at CERN, European Organization for Nuclear Research, Geneva, Switzerland.Also at Laboratoire Leprince-Ringuet, Ecole Polytechnique, IN2P3-CNRS, Palaiseau, France.Also at Zewail City of Science and Technology, Zewail, Egypt.Also at National Centre for Nuclear Research, Swierk, Poland.Also at Universite ´de Haute Alsace, Strasbourg, France.Also at Brandenburg University of Technology, Cottbus, Germany.r Also at The University of Kansas, Lawrence, USA.Also at Institute of Nuclear Research ATOMKI, Debrecen, Hungary.Also at Eo ¨tvo ¨s Lora ´nd University, Budapest, Hungary.Also at Tata Institute of Fundamental Research -HECR, Mumbai, India.Also at University of Visva-Bharati, Santiniketan, India.w Also at Sharif University of Technology, Tehran, Iran.Also at Plasma Physics Research Center, Science and Research Branch, Islamic Azad University, Tehran, Iran.z Also at Facolta `Ingegneria Universita `di Roma, Roma, Italy.aa Also at Universita `della Basilicata, Potenza, Italy.Universita `degli Studi Guglielmo Marconi, Roma, Italy.cc Also at Universita `degli Studi di Siena, Siena, Italy.
a Deceased.b Also at Vienna University of Technology, Vienna, Austria.c d g i n o q s t u v x Also at Isfahan University of Technology, Isfahan, Iran.y bb Also at