Step 1: Skim the initial datasets

Overview

Teaching: 5 min
Exercises: 10 min

Questions

What happens in the skimming step?

Objectives

Perform this step of the analysis by yourself

In this step, the NanoAOD files containing data and simulated events are pre-processed. This step is called skimming since the event selection reduces the size of the datasets significantly. In addition, we perform a pair selection to find from the muon and tau collections the pair which is most likely to have originated from a Higgs boson.

This step is implemented in the file skim.cxx and is written in C++ for performance reasons. To compile and run the program, use the script skim.sh. Note that you may need to change the compiler in the script based on your system.

Execute the following command to run the skimming:

mkdir -p $HOME/awesome-workshop/skims
bash skim.sh root://eospublic.cern.ch//eos/root-eos/HiggsTauTauReduced/ $HOME/awesome-workshop/skims

In case you want to download the files first, for example if you want to run many times, execute the following two commands. The overall size of the initial samples is 6.5 GB.

mkdir -p $HOME/awesome-workshop/samples $HOME/awesome-workshop/skims
bash download.sh $HOME/awesome-workshop/samples
bash skim.sh $HOME/awesome-workshop/samples $HOME/awesome-workshop/skims

The results of this step are files in $HOME/awesome-workshop/skims with the suffix *Skim.root.

Key Points

We reduce the initial datasets by filtering suitable events and the selection of the interesting observables.

This step includes finding the interesting muon-tau pair in each selected event.

previous episode

Example analysis used for the Awesome Workshop

next episode

Step 1: Skim the initial datasets

Overview

Key Points

previous episode

next episode