Step 1: Skim the initial datasets
Overview
Teaching: 5 min
Exercises: 10 minQuestions
What happens in the skimming step?
Objectives
Perform this step of the analysis by yourself
In this step, the NanoAOD files containing data and simulated events are pre-processed. This step is called skimming since the event selection reduces the size of the datasets significantly. In addition, we perform a pair selection to find from the muon and tau collections the pair which is most likely to have originated from a Higgs boson.
This step is implemented in the file skim.cxx
and is written in C++ for performance reasons. To compile and run the program, use the script skim.sh
. Note that you may need to change the compiler in the script based on your system.
Execute the following command to run the skimming:
mkdir -p $HOME/awesome-workshop/skims
bash skim.sh root://eospublic.cern.ch//eos/root-eos/HiggsTauTauReduced/ $HOME/awesome-workshop/skims
In case you want to download the files first, for example if you want to run many times, execute the following two commands. The overall size of the initial samples is 6.5 GB.
mkdir -p $HOME/awesome-workshop/samples $HOME/awesome-workshop/skims
bash download.sh $HOME/awesome-workshop/samples
bash skim.sh $HOME/awesome-workshop/samples $HOME/awesome-workshop/skims
The results of this step are files in $HOME/awesome-workshop/skims
with the suffix *Skim.root
.
Key Points
We reduce the initial datasets by filtering suitable events and the selection of the interesting observables.
This step includes finding the interesting muon-tau pair in each selected event.