Skip to main content

Using the Open Event Generation Data

The event generation open data is available in the HEPMC2 format, which is a text-based event generator record. Each file of around 10,000 events is compressed to save disk space. The notebook provides a brief introduction to reading the events and making use of them in a variety of ways.

What's Inside the Notebook

In this notebook, you will learn:

  • How to search the open event generation samples, identify the samples you are interested in, and get files from them
  • How to inspect files from the samples and understand the event generator records
  • How to make simple histograms from the input files
  • How to run a simple parametric simulation using the files as input and examine the simulated output

With these tools in place, you should be able to follow almost any of the other notebooks to pick samples and recreate your favorite analysis.

Resources

Downloading all the available Open Data requires significant resources. For those who do not have the computing resources to hand, there are a few options.

In Colab, the first cell of the example notebook has to be modified:

# Pip installs first
%pip install atlasopenmagic pyhepmc graphviz hist
# Now conda installs
!pip install -q condacolab
import condacolab
condacolab.install_from_url("https://github.com/conda-forge/miniforge/releases/download/25.3.1-0/Miniforge3-Linux-x86_64.sh")
# See https://github.com/conda-incubator/condacolab/issues/73
!mamba install --channel conda-forge delphes

And when accessing cards for Delphes, ${CONDA_PREFIX} should be replaced with /usr/local.

Note that the drawing via graphviz also does not work without some modifications.