Observation of the decay B 0 → D 0 K + K − and evidence for B 0 s → D 0 K + K −

The first observation of the decay B0 → D0K+K− is reported from an analysis of 0.62 fb−1 of pp collision data collected with the LHCb detector. Its branching fraction is measured relative to that of the topologically similar decay B0 → D0π+π− to be B ( B0 → D0K+K− ) B ( B0 → D0π+π− ) = 0.056± 0.011± 0.007 , where the first uncertainty is statistical and the second is systematic. The significance of the signal is 5.8σ. Evidence, with 3.8σ significance, for B0 s → D0K+K− decays is also presented. The relative branching fraction is measured to be B ( B0 s → D0K+K− ) B ( B0 → D0K+K− ) = 0.90± 0.27± 0.20 . These channels are of interest to study the mechanisms behind hadronic B decays, and open new possibilities for CP violation analyses with larger data sets. Submitted to Phys. Rev. Lett. †Authors are listed on the following pages. ar X iv :1 20 7. 59 91 v2 [ he pex ] 3 1 A ug 2 01 2

The precise measurement of the angle γ of the CKM Unitarity Triangle [1,2] is one of the primary objectives of flavour physics experiments. Prior to the start of LHC datataking, the combination of measurements with the decay mode B + → DK + , where D denotes a neutral charmed meson that is an admixture of D 0 and D 0 , gave a constraint on γ with an uncertainty of around 20 • [3]. Recent results from LHCb on B + → DK + [4] have helped to reduce this uncertainty, but the use of additional channels to improve further the precision is of great interest. The as-yet unobserved decay B 0 s → Dφ is one of the modes with potential to make a significant impact on the overall determination of γ [5][6][7]. Moreover, a Dalitz plot analysis of B 0 s → DK + K − can further improve the sensitivity to γ due to heightened sensitivity to interference effects, as well as allowing a determination of φ s , the CP -violating phase in the B 0 s -B 0 s system, with minimal theoretical uncertainties [8].
The first step in the programme towards the measurement of γ using the B 0 s → DK + K − decay is the observation of the channel. In this Letter the results of a search for neutral B meson decays to D 0 K + K − are presented. The quantities measured include small contributions from decays to D 0 K + K − . The inclusion of charge conjugate modes is implied throughout.
The analysis uses 0.62 fb −1 of LHC collision data at a centre-of-mass energy of 7 TeV collected with the LHCb detector during 2011. In high energy pp collisions all b hadron species are produced, so both B 0 and B 0 s decays are searched for simultaneously. The decay B 0 → D 0 K + K − can be mediated by the decay diagrams shown in Fig. 1. These are a W -exchange diagram similar to that for the decay B 0 → D − s K + [9,10] (in this case an excited state that decays to D 0 K − , such as D * − s2 (2573), would be produced), and a colour-suppressed tree diagram producing D 0 h 0 , where h 0 is a light unflavoured meson such as a 0 (980) that subsequently decays to K + K − . Related B decays with ss production, B + → D 0 K + K ( * )0 [11] and B + → D ( * )− s K + π + [12,13], have been measured to have branching fractions of O(10 −4 ). The LHCb detector [14] is a single-arm forward spectrometer covering the pseudorapidity range 2 < η < 5, designed for the study of particles containing b or c quarks. The detector includes a high precision tracking system consisting of a silicon-strip vertex detector surrounding the pp interaction region, a large-area silicon-strip detector located upstream of a dipole magnet with a bending power of about 4 Tm, and three stations of silicon-strip detectors and straw drift tubes placed downstream. The combined tracking system has momentum resolution ∆p/p that varies from 0.4% at 5 GeV/c to 0.6% at 100 GeV/c, and impact parameter (IP) resolution of 20 µm for tracks with high transverse momentum (p T ). Charged hadrons are identified using two ring-imaging Cherenkov (RICH) detectors. Photon, electron and hadron candidates are identified by a calorimeter system consisting of scintillating-pad and pre-shower detectors, an electromagnetic calorimeter and a hadronic calorimeter. Muons are identified by a system composed of alternating layers of iron and multiwire proportional chambers. The trigger consists of a hardware stage, based on information from the calorimeter and muon systems, followed by a software stage which applies a full event reconstruction. In this analysis, signal candidates are accepted if one of the final state particles created a cluster in the calorimeter with sufficient transverse energy to fire the hardware trigger. Events that are triggered at hardware level by the decay products of the other b hadron in the pp → bbX event are also retained.
The software trigger requires characteristic signatures of b hadron decays: at least one track, with high p T and a large IP with respect to any primary interaction vertex (PV) [15], that subsequently forms part of a two-, three-or four-track secondary vertex with a high sum of the p T of the tracks and significant displacement from the PV [16]. In the offline analysis, the software trigger decision is required to be due to the candidate signal decay.
Candidates that are consistent with the decay chain B 0 (s) → D 0 K + K − , D 0 → K + π − are selected. In order to reduce systematic uncertainties in the measurement, the topologically similar decay D 0 π + π − , which has previously been well studied [17,18], is used as a normalisation channel. The D 0 candidate invariant mass is required to satisfy 1844 < m Kπ < 1884 MeV/c 2 . Tracks are required to be consistent with either the kaon or pion hypothesis, as appropriate, based on particle identification (PID) information from the RICH detectors. All other selection criteria were tuned on the D 0 π + π − channel. The large yields available in the normalisation sample allow the selection to be based on data, though the efficiencies are determined using Monte Carlo (MC) simulated events. For the simulation, pp collisions are generated using Pythia 6.4 [19] with a specific LHCb configuration [20]. Decays of hadronic particles are described by EvtGen [21]. The interaction of the generated particles with the detector and its response are implemented using the Geant4 toolkit [22] as described in Ref. [23].
Selection requirements are applied to obtain a clear signal peak in the D 0 π + π − normalisation channel. The selection includes criteria on the track quality of the tracks forming the signal candidate, their p, p T and inconsistency with originating from the PV (χ 2 IP ). Requirements are also placed on the corresponding variables for candidate composite particles (D 0 , B 0 (s) ) together with restrictions on the consistency of the decay fit (χ 2 vertex ), the flight distance significance (χ 2 flight ), and the angle between the momentum vector and the line joining the PV to the B 0 (s) vertex (cos θ dir ) [24]. Further discrimination between signal and background categories is achieved by calculating weights for the remaining D 0 π + π − candidates [25]. The weights are used by the NeuroBayes neural network package [26] to maximise the separation between categories. A total of 15 variables are used in the network. They include the χ 2 IP of the four candidate tracks, the χ 2 IP , χ 2 vertex , χ 2 flight and cos θ dir of the D 0 and B 0 (s) candidates, and the B 0 (s) candidate p T . Variables describing the p T asymmetry and track multiplicity in a 1.5 rad cone [4] around the B 0 (s) candidate flight direction are also used. The input quantities to the neural network only depend weakly on the kinematics of the B 0 (s) decay. A requirement on the network output is imposed that reduces the combinatorial background by an order of magnitude while retaining about 80 % of the signal. No bias is observed by using data driven selection requirements.
To improve the B 0 (s) candidate invariant mass resolution, the four-momenta of the tracks from the D 0 candidate are adjusted so that their combined invariant mass matches the world average value [3]. An additional B 0 mass constraint is applied in the calculation of the Dalitz plot coordinates, which are used in the determination of event-by-event efficiencies. A small fraction (∼ 5 % within the mass range described below) of candidates with invariant masses far from the B 0 (s) peak fail the mass constrained fit, and are removed from the analysis.
To remove a large potential background from B 0 → D * − (2010)π + , candidates in the D 0 π + π − sample are rejected if m Dπ -m D (for either pion charge) lies within ±2.5 MeV/c 2 of the nominal D * − -D 0 mass difference [3]. Candidates in the D 0 K + K − sample are also rejected if the invariant mass difference calculated under the pion mass hypothesis satisfies the same criterion. This removes 3.3 % of D 0 K + K − candidates. Less than 1 % of D 0 K + K − combinations are rejected by requiring that the pion from the D 0 candidate together with the two kaons do not form an invariant mass in the range 1950-1975 MeV/c 2 , which removes potential background from B 0 s → D ∓ s K ± decays. After all selection requirements are applied, less than 1 % of events with at least one candidate also contain a second candidate. Such multiple candidates are retained and treated the same as other candidates; the associated systematic uncertainty is negligible.
In addition to combinatorial background, candidates may be formed from misidentified or partially reconstructed B 0 (s) decays, or from B 0 (s) decays to identical final states but without intermediate charmed mesons (referred to below as charmless peaking background). Contributions from partially reconstructed decays are reduced by requiring the invariant mass of the B 0 (s) candidate to be above 5150 MeV/c 2 . Sources of misidentified backgrounds are investigated using simulation. Most potential sources are found to have a broad invariant mass distribution, and are absorbed in the combinatorial background shape used in the fit described below. Backgrounds from Λ 0 b → D 0 pK + and Λ 0 b → D 0 pπ + [27], B 0 → D 0 K + π − and B 0 s → D 0 K − π + decays may, however, give contributions with distinctive shapes and therefore need to be included in the fit.
The contributions from charmless peaking background are investigated using candidates, reconstructed without the D 0 mass constraint, in sideband regions around the D 0 mass. The distributions are fitted with double Gaussian signal and linear background probability density functions (PDFs). Extrapolating to the D mass signal region, 773 ± 30 (126 ± 18) charmless background decays are expected in the B 0 → D 0 π + π − (B 0 → D 0 K + K − ) distributions. No peaking background is observed in the B 0 s region. The signal yields are obtained from unbinned maximum likelihood fits to the D 0 π + π − and D 0 K + K − invariant mass distributions in the range 5150-5600 MeV/c 2 . There are 14 214 D 0 π + π − and 2990 D 0 K + K − candidates. The D 0 π + π − fit includes a double Gaussian shape for signal, together with an exponential component for partially reconstructed background, and a PDF for Λ 0 b → D 0 X decays modelled using a non-parametric function obtained from simulation. The D 0 K + K − fit includes a second double Gaussian component to account for the possible presence of both B 0 and B 0 s decays, and peaking background PDFs for Λ 0 b → D 0 pK + , B 0 → D 0 K + π − and B 0 s → D 0 K − π + , all modelled using non-parametric functions. The shape of the combinatorial background is essentially linear, but is multiplied by a function that accounts for the fact that candidates with high invariant masses are more likely to fail the B 0 (s) mass constrained fit. The result of the fit to D 0 π + π − candidates is shown in Fig. 2. There are nine free parameters in this fit: the double Gaussian peak position, core width and fraction in the core, the linear slope of the combinatorial background and the exponential shape parameter of the partially reconstructed background, and the yields of the four categories. The relative width of the broader to the core Gaussian component is constrained within uncertainty to the value obtained in simulation. The fit yields 8060 ± 150 B 0 → D 0 π + π − decays, including charmless peaking background. Since the fit to D 0 K + K − candidates, shown in Fig. 2, has more components, additional constraints are imposed in order to improve the stability of the results. The parameters of the double Gaussian shapes are constrained to be identical for B 0 and B 0 s signals, with an offset in their mean values fixed to the known B 0 -B 0 s mass difference [3]. The slope of the combinatorial component is constrained to the value obtained in the fit to D 0 mass sideband events. The exponential shape parameter is constrained to the value obtained in the D 0 π + π − fit. The fit yields 558 ± 49 B 0 → D 0 K + K − decays, including charmless peaking background, and 104 ± 29 B 0 s → D 0 K + K − decays. All background yields are consistent with their expectations within uncertainties.
The ratio of branching fractions is obtained after subtracting the charmless peaking background, and applying event-by-event efficiencies as a function of the Dalitz plot position where N is the yield obtained from the fit, N peak is the charmless peaking background contribution, and the efficiency corrected yield N corr = i W i / tot i . Here the index i runs over all candidates in the fit range, W i is the signal weight for candidate i [25] from the fit shown in Fig. 2 and tot i is the efficiency for candidate i, which depends only on its Dalitz plot position. The statistical uncertainty on the branching fraction ratio incorporates the effects of the shape parameters that are allowed to vary in the fit, the dilution due to event weighting, and the charmless peaking background subtraction. Most potential systematic effects cancel in the ratio.
The PID efficiency is measured using a control sample of D * − → D 0 π − , D 0 → K + π − decays to obtain background-subtracted efficiency tables for kaons and pions as functions of their p and p T [28]. The kinematic properties of the tracks in signal decays are obtained from simulation, allowing the PID efficiency for each event to be obtained from the tables taking into account the correlation between the p and p T values of the two tracks. The other contributions to the efficiency (detector acceptance, selection criteria and trigger effects) are determined from simulation, and validated using data. All are found to be approximately constant across the Dalitz plane, apart from some modulations seen near the kinematic boundaries.
The Dalitz plot distributions obtained from the signal weights are shown in Fig. 3. The B 0 → D 0 π + π − distribution shows contributions from the ρ 0 (770) and f 2 (1270) resonances (upper diagonal edge of the Dalitz plot) and from the D * − 2 (2460) state (horizontal band), as expected from previous studies of this decay [17,18]. The B 0 → D 0 K + K − distribution shows a possible contribution from the D * − s2 (2573) resonance, together with an enhancement of events at low K + K − invariant mass (upper diagonal edge).
The branching fraction of the B 0 s decay to D 0 K + K − is measured relative to that of B 0 to the same final state. Due to the low yield in this decay, an event-by-event efficiency correction is not used. The ratio of branching fractions is instead determined as .
(2) The ratio of fragmentation fractions is f s /f d = 0.267 +0.021 −0.020 [29]. Systematic uncertainties are assigned to both branching fraction ratios due to the following sources. The variation of efficiency across the Dalitz plot may not be correctly modelled in simulation. The difference, 6.7 %, between the nominal result for R(B 0 , B 0 ) and that obtained using Dalitz plot averaged efficiencies is conservatively taken as an estimate of the associated systematic uncertainty. The fit model is varied by scaling the B 0 s /B 0 PDF width ratio to account for their different masses, removing components with small yields, adding components for potential background from B 0 s → D * 0 K * 0 and B 0 s → D * 0 K + K − , and varying the linear parameter of the combinatorial background PDF within uncertainties from the fit to the D 0 sidebands used to estimate the charmless peaking background. Together these contribute 10.7 % (19.9 %) to R(B 0 , B 0 ) (R(B 0 s , B 0 )). An uncertainty of 1.5 % is assigned due to the charmless peaking background subtraction procedure. Possible biases in the determination of the fit parameters are investigated using MC pseudoexperiments, leading to 1.5 % (3.4 %) uncertainty on the R(B 0 , B 0 ) (R(B 0 s , B 0 )). In addition, the possible differences in the data/MC ratios of trigger and PID efficiencies between the two channels (both 2.0 %) and the effect of the D + s veto (1.7 %) affect only R(B 0 , B 0 ). The uncertainty on the quantity f s /f d (7.9 %) affects only R(B 0 s , B 0 ). The total systematic uncertainties are obtained as the quadratic sums of all contributions.
A number of cross-checks are performed to test the stability of the result. The data sample is divided by dipole magnet polarity, data taking period and trigger category. Candidates were divided based upon the hardware trigger decision into three groups; events in which a particle from the signal decay created a large enough cluster in the calorimeter to fire the trigger, events that were triggered independently of the signal decay and those events that were triggered by both the signal decay and the rest of the event. The neural network and PID requirements are tightened and loosened. The PID efficiency is evaluated using the kinematic proprties from D 0 π + π − data instead of from simulation. The charmless peaking background contribution is determined from the upper and lower D 0 mass sidebands separately. All give consistent results.
The significances of the signals are obtained from the changes in likelihood in fits to data with and without signal components, after accounting for systematic uncertainties and for charmless peaking background in B 0 → D 0 K + K − only. They are found to be 5.8 σ and 3.8 σ for B 0 → D 0 K + K − and B 0 s → D 0 K + K − respectively. In summary, the decay B 0 → D 0 K + K − has been observed for the first time, and its branching fraction relative to that of B 0 → D 0 π + π − is measured to be where the first uncertainty is statistical and the second is systematic. Using the known value of B B 0 → D 0 π + π − = (8.4 ± 0.4 ± 0.8) × 10 −4 [17], this gives B B 0 → D 0 K + K − = (4.7 ± 0.9 ± 0.6 ± 0.5) × 10 −5 , where the third uncertainty arises from B B 0 → D 0 π + π − . Evidence for the B 0 s → D 0 K + K − decay has also been found, with relative branching fraction A future study of the Dalitz plot distributions of these decays will provide insight into the dynamics of hadronic B decays. In addition, the B 0 s → D 0 K + K − decay may be used to measure the CP violating phase γ.