Skip to main content

NTuple

An NTuple, in the context of particle physics, is a simplified data structure that allows for straightforward access to the parameters of interest in an event. These files store data as rows of values, similar to an spreadsheet, where each row represents a collision event and each column a variable associated with that event.

Structure and Accessibility

The structure of NTuple files is made of a series of branches, each corresponding to a specific variable from the ATLAS detector. These branches may include information on particle types, momenta, energies, and other variables that help to the understanding the particle interactions observed in the detector. NTuple files are particularly valued in educational settings due to their ease of use. They require minimal preprocessing and can be easily accessed using standard data analysis software commonly employed in educational environments, such as ROOT.

Benefits for Education

  • Simplicity: The simple, tabular format of NTuples makes them accessible for students and educators without the need for extensive background in data analysis or the complexities of particle physics detectors.
  • Flexibility: Educators can easily extract specific datasets from NTuple files to tailor lessons to various educational objectives, from simple demonstrations of particle tracks to more complex analyses like identifying particle decays.
  • Interactivity: NTuple files can be used in interactive environments that facilitate hands-on learning, such as Jupyter notebooks, where students can visualize and manipulate data in real-time.

Using NTuple Files

To work with NTuple files from the ATLAS Open Data

  • Access: Users can download NTuple files directly from the CERN Open Data portal.
  • Software: Open-source tools like ROOT can be employed to analyze the data. Tutorials and examples are provided to help users get started.
  • Analysis: Users can perform a variety of analyses, from plotting simple distributions to conducting multi-variable analyses, using the structured data provided.

Details About the Published Data

8 TeV Data for Education

The resulting format of the datasets is a TTree tuple (or ROOT ntuple) with 45 branches as detailed in the table below:

Open to view full list
Tuple branch nameTypeDescription
runNumberintnumber uniquely identifying ATLAS data-taking run
eventNumberintevent number and run number combined uniquely identifies event
channelNumberintnumber representing leptonic channel
mcWeightfloatweight of a simulated event
pvxp_nintnumber of primary vertices
vxp_zfloatz-position of the primary vertex
trigEboolsignifies whether event passes a standard electron trigger
trigMboolsignifies whether event passes a standard muon trigger
passGRLboolsignifies whether event passes a data quality assessment
hasGoodVertexboolsignifies whether the event has at least one good vertex
lep_nintnumber of preselected leptons
lep_truthMatchedvector<bool>indicates whether the lepton is matched to a truth lepton
lep_trigMatchedvector<bool>indicates whether the lepton is the one triggering the event
lep_ptvector<float>transverse momentum of the lepton
lep_etavector<float>pseudorapidity, η, of the lepton
lep_phivector<float>azimuthal angle, φ, of the lepton
lep_Evector<float>energy of the lepton
lep_z0vector<float>z-coordinate of the track associated to the lepton wrt. the primary vertex
lep_chargevector<float>charge of the lepton
lep_flagvector<int>used to implement object cuts
lep_typevector<int>number signifying the lepton type (e, mu, tau) of the lepton
lep_ptcone30vector<float>used to ensure tracking isolation of the lepton
lep_etcone20vector<float>used to ensure calorimeter isolation of the lepton
lep_trackd0pvunbiasedvector<float>d0 of the track associated to the lepton at the point of closest approach (p.c.a.)
lep_tracksigd0pvunbiasedvector<float>d0 significance of the track associated to the lepton at the p.c.a.
met_etfloatTransverse energy of the missing momentum vector
met_phifloatAzimuthal angle of the missing momentum vector
jet_nintnumber of selected jets
jet_ptvector<float>transverse momentum of the jet
jet_etavector<float>pseudorapidity, η, of the jet
jet_phivector<float>azimuthal angle, φ, of the jet
jet_Evector<float>energy of the jet
jet_mvector<float>invariant mass of the jet
jet_jvfvector<float>fraction of the total momentum of tracks in the jet
jet_trueflavvector<int>true flavor of the jet
jet_truthMatchedvector<int>information whether the jet matches a jet on truth level
jet_SV0vector<float>weight from algorithm that reconstructs secondary vertices associated with a jet
jet_MV1vector<float>weight from algorithm based on multi variate technique
scaleFactor_BTAGfloatscalefactor for btagging
scaleFactor_ELEfloatscalefactor for electron efficiency
scaleFactor_JVFSFfloatscalefactor for jet vertex fraction
scaleFactor_MUONfloatscalefactor for muon efficiency
scaleFactor_PILEUPfloatscalefactor for pileup reweighting
scaleFactor_TRIGGERfloatscalefactor for trigger
scaleFactor_ZVERTEXfloatscalefactor for z-vertex reweighting

13 TeV Data for Education

The full list of ROOT branches and variables contained within the this dataset is presented in the list below:

Open to view full list
Tuple branch nameC++ typeVariable description
runNumberintnumber uniquely identifying ATLAS data-taking run
eventNumberintevent number and run number combined uniquely identifies event
channelNumberintnumber uniquely identifying ATLAS simulated dataset
mcWeightfloatweight of a simulated event
XSectionfloattotal cross-section, including filter efficiency and higher-order correction factor
SumWeightsfloatgenerated sum of weights for MC process
scaleFactor_PILEUPfloatscale-factor for pileup reweighting
scaleFactor_ELEfloatscale-factor for electron efficiency
scaleFactor_MUONfloatscale-factor for muon efficiency
scaleFactor_PHOTONfloatscale-factor for photon efficiency
scaleFactor_TAUfloatscale-factor for tau efficiency
scaleFactor_BTAGfloatscale-factor for b-tagging algorithm at 70% efficiency
scaleFactor_LepTRIGGERfloatscale-factor for lepton triggers
scaleFactor_PhotonTRIGGERfloatscale-factor for photon triggers
trigEboolboolean whether event passes a single-electron trigger
trigMboolboolean whether event passes a single-muon trigger
trigPboolboolean whether event passes a diphoton trigger
lep_nintnumber of pre-selected leptons
lep_truthMatchedvector<bool>boolean indicating whether the lepton is matched to a simulated lepton
lep_trigMatchedvector<bool>boolean indicating whether the lepton is the one triggering the event
lep_ptvector<float>transverse momentum of the lepton
lep_etavector<float>pseudo-rapidity, η\eta, of the lepton
lep_phivector<float>azimuthal angle, ϕ\phi, of the lepton
lep_Evector<float>energy of the lepton
lep_z0vector<float>z-coordinate of the track associated to the lepton wrt. primary vertex
lep_chargevector<int>charge of the lepton
lep_typevector<int>number signifying the lepton type (e or µ)
lep_isTightIDvector<bool>boolean indicating whether lepton satisfies tight ID reconstruction criteria
lep_ptcone30vector<float>scalar sum of track pT in a cone of R=0.3 around lepton, used for tracking isolation
lep_etcone20vector<float>scalar sum of track ET in a cone of R=0.2 around lepton, used for calorimeter isolation
lep_trackd0pvunbiasedvector<float>d0 of track associated to lepton at point of closest approach (p.c.a.)
lep_tracksigd0pvunbiasedvector<float>d0 significance of the track associated to lepton at the p.c.a.
met_etfloattransverse energy of the missing momentum vector
met_phifloatazimuthal angle of the missing momentum vector
jet_nintnumber of pre-selected jets
jet_ptvector<float>transverse momentum of the jet
jet_etavector<float>pseudo-rapidity, η\eta, of the jet
jet_phivector<float>azimuthal angle, ϕ\phi, of the jet
jet_Evector<float>energy of the jet
jet_jvtvector<float>jet vertex tagger discriminant of the jet
jet_trueflavvector<float>flavor of the simulated jet
jet_truthMatchedvector<float>output of the multivariate b-tagging algorithm of the jet
photon_nintnumber of pre-selected photons
photon_truthMatchedvector<bool>boolean indicating whether the photon is matched to a simulated photon
photon_trigMatchedvector<bool>boolean indicating whether the photon is the one triggering the event
photon_ptvector<float>transverse momentum of the photon
photon_etavector<float>pseudo-rapidity of the photon
photon_phivector<float>azimuthal angle of the photon
photon_Evector<float>energy of the photon
photon_isTightIDvector<bool>boolean indicating whether photon satisfies tight identification reconstruction criteria
photon_ptcone30vector<float>scalar sum of track pT in a cone of R=0.3 around photon
photon_etcone20vector<float>scalar sum of track ET in a cone of R=0.2 around photon
photon_convTypevector<int>information whether and where the photon was converted
largeRjet_nintnumber of pre-selected large-R jets
largeRjet_ptvector<float>transverse momentum of the large-R jet
largeRjet_etavector<float>pseudo-rapidity of the large-R jet
largeRjet_phivector<float>azimuthal angle of the large-R jet
largeRjet_Evector<float>energy of the large-R jet
largeRjet_mvector<float>invariant mass of the large-R jet
largeRjet_truthMatchedvector<int>information whether the large-R jet is matched to a simulated large-R jet
largeRjet_D2vector<float>weight from algorithm for W/Z-boson tagging
largeRjet_tau32vector<float>weight from algorithm for top-quark tagging
tau_nintnumber of pre-selected hadronically decaying τ\tau-lepton
tau_ptvector<float>transverse momentum of the hadronically decaying τ\tau-lepton
tau_etavector<float>pseudo-rapidity of the hadronically decaying τ\tau-lepton
tau_phivector<float>azimuthal angle of the hadronically decaying τ\tau-lepton
tau_Evector<float>energy of the hadronically decaying τ\tau-lepton
tau_chargevector<int>charge of the hadronically decaying τ\tau-lepton
tau_isTightIDvector<bool>boolean indicating whether hadronically decaying τ\tau-lepton satisfies tight ID reconstruction criteria
tau_truthMatchedvector<bool>boolean indicating whether the hadronically decaying τ\tau-lepton is matched to a simulated τ\tau-lepton
tau_trigMatchedvector<bool>boolean signifying whether the τ\tau-lepton is the one triggering the event
tau_nTracksvector<int>number of tracks in the hadronically decaying τ\tau-lepton decay
tau_BDTidvector<float>output of the multivariate algorithm discriminating hadronically decaying τ\tau-leptons from jets
ditau_mfloatdi-τ\tau invariant mass using the missing-mass calculator
lep_pt_systvector<float>single component syst. uncert. (lepton momentum scale and resolution) affecting lep_pt
met_et_systfloatsingle component syst. uncert. (ETmissE_T^{miss} scale and resolution) affecting met_pt
jet_pt_systvector<float>single component syst. uncert. (jet energy scale) affecting jet_pt
photon_pt_systvector<float>single component syst. uncert. (photon energy scale and resolution) affecting photon_pt
largeRjet_pt_systvector<float>single component syst. uncert. (large-R jet energy resolution) affecting largeRjet_pt
tau_pt_systvector<float>single component syst. uncert. (τ\tau lepton reconstruction and energy scale) affecting tau_pt

Evolution from the 8 TeV release (2016) to the 13 TeV release (2020)

The evolution of the ATLAS Open Data and the tuple structure from the 8 TeV release in 2016 to the [13 TeV release].(./files.md) in 2020 are depicted below: