From 4aa14843fc96aab1d9e6a3ed3e49184fc209fcf7 Mon Sep 17 00:00:00 2001 From: Kati Lassila-Perini Date: Tue, 23 Jul 2024 18:24:59 +0300 Subject: [PATCH] 01: expand and rename intro --- config.yaml | 2 +- episodes/01-introduction.md | 79 ++++++++++++++++++++++++++++++++++ episodes/02-nanoaod-miniaod.md | 6 ++- episodes/introduction.md | 48 --------------------- 4 files changed, 85 insertions(+), 50 deletions(-) create mode 100644 episodes/01-introduction.md delete mode 100644 episodes/introduction.md diff --git a/config.yaml b/config.yaml index bd9e7c6..5ba3617 100644 --- a/config.yaml +++ b/config.yaml @@ -63,7 +63,7 @@ contact: 'cms-dpoa-coordinator@cern.ch' # FIXME # Order of episodes in your lesson episodes: -- introduction.md +- 01-introduction.md - 02-nanoaod-miniaod.md - 03-nanoaod-dataset.md - 04-nanoaod-exercises.md diff --git a/episodes/01-introduction.md b/episodes/01-introduction.md new file mode 100644 index 0000000..c799b3d --- /dev/null +++ b/episodes/01-introduction.md @@ -0,0 +1,79 @@ +--- +title: "Introduction" +teaching: 10 +exercises: 5 +--- + +:::::::::::::::::::::::::::::::::::::: questions + +- What have we learned in the pre-leaning lessons and how can we apply it? +- Where do we find information about physics objects in the CMS NanoAOD format? + +:::::::::::::::::::::::::::::::::::::::::::::::: + +::::::::::::::::::::::::::::::::::::: objectives + +- Apply what we have learned in the pre-learning lessons about CMS physics objects +- Learn about the documentation of the NanoAOD format + +:::::::::::::::::::::::::::::::::::::::::::::::: + +## Dataformats in CMS + +Most previous releases of CMS open data have been in the Analysis Object Data (AOD) format. +This is a complex format and specific CMS software (CMSSW) is required in order to read and analyze it. + +From 2015 data releases have been a slimmed-down format called MiniAOD, which has the same essential structure and software requirements for analysis as AOD. Essentially there are few +physics object collections and often the physics objects themselves are different. + +For data released in 2016 and beyond a new format called NanoAOD is used. NanoAOD is not just simply slimmed-down MiniAOD. In contrast to AOD and MiniAOD which is stored in CMSSW C++ objects, NanoAOD is stored using ROOT TTree objects. You therefore do not need to use the CMS Virtual Machine or docker container to analyze NanoAOD data. NanoAOD can be analyzed using the ROOT program and/or python libraries capable of interpreting the ROOT's TTree structure. + +In this workshop we will focus on working with open data in the NanoAOD format. + +## Physics objects in CMS data + +The recommended [CMS Physics Objects prelearning lesson](https://cms-opendata-workshop.github.io/workshop2024-lesson-physics-objects/instructor/index.html) guides you through different physics objects and explains what information is available for them in the CMS NanoAOD format. + +Let us now make sure that you can find that information. + +::::::::::::::::::::: challenge + +## Exercise 1: Find the NanoAOD variable description for a physics object + +Select a physics objects of your choice in the [CMS Physics Objects lesson](https://cms-opendata-workshop.github.io/workshop2024-lesson-physics-objects/instructor/index.html) and find the corresponding variable listing from a CMS dataset record on the [CERN Open Data portal](https://opendata.cern.ch/). + +:::::::::::::: solution + +Find the NanoAOD variable listing for example for the [SingleElectron collision dataset from 2016 RunG](https://opendata.cern.ch/record/30529). Scroll down to "Dataset semantics" and open the [variable list](https://opendata.cern.ch/eos/opendata/cms/dataset-semantics/NanoAOD/30529/SingleElectron_doc.html). + +Find the links to the physics object collections under "Events Content" and find the object of your choice. + +:::::::::::::: + +:::::::::::::::::::: + +::::::::::::::::::::: challenge + +## Exercise 2: Compare variable lists in different collision datasets. + +Find all collision datasets from 2016 in NanoAOD format. Compare the variable list. Do Muon datasets contain an electron collection? Do Electron datasets contain a muon collection? Why? + +:::::::::::::: solution + +Use the search facets of the [search page](https://opendata.cern.ch/search?q=&l=list&order=desc&p=1&s=10&sort=mostrecent). + +Select **Collision** under Dataset, **CMS** under Experiment, **2016** under "Year", and **nanoaod** under File type. + +Open two different collision datasets and check their variable lists. + +:::::::::::::: + +:::::::::::::::::::: + +::::::::::::::::::::::::::::::::::::: keypoints + +- The variable list with a variable brief description is linked to all CMS NanoAOD datasets. +- CMS Physics Objects pre-learning lesson describes different physics object variables in more detail. + +:::::::::::::::::::::::::::::::::::::::::::::::: + diff --git a/episodes/02-nanoaod-miniaod.md b/episodes/02-nanoaod-miniaod.md index fe97ebc..180ceee 100644 --- a/episodes/02-nanoaod-miniaod.md +++ b/episodes/02-nanoaod-miniaod.md @@ -19,8 +19,12 @@ exercises: 0 :::::::::::::::::::::::::::::::::::::::::::::::: -## Dataformats in CMS +TO-DO we can "borrow" information from below: + +miniAOD links for use: [Getting started with miniAOD](https://opendata.cern.ch/docs/cms-getting-started-miniaod), [miniAOD in Workbook](https://twiki.cern.ch/twiki/bin/view/CMSPublic/WorkBookMiniAOD2016#High_level_physics_objects) + +nanoAOD links for use: [Getting started with nanoAOD](https://opendata.cern.ch/docs/cms-getting-started-nanoaod) ::::::::::::::::::::::::::::::::::::: keypoints diff --git a/episodes/introduction.md b/episodes/introduction.md deleted file mode 100644 index ac01cc2..0000000 --- a/episodes/introduction.md +++ /dev/null @@ -1,48 +0,0 @@ ---- -title: "Introduction" -teaching: 10 -exercises: 0 ---- - -:::::::::::::::::::::::::::::::::::::: questions - -- What have we learned in the pre-exercises and how can we apply it? -- What is the structure and content of the nanoAOD format? - -:::::::::::::::::::::::::::::::::::::::::::::::: - -::::::::::::::::::::::::::::::::::::: objectives - -- Apply what we have learned in the pre-exercises -- Learn about the structure and content of nanoAOD - -:::::::::::::::::::::::::::::::::::::::::::::::: - -## Dataformats in CMS - -Most previous releases of CMS open data have been in the Analysis Object Data (AOD) format. -This is a complex format and specific CMS software (CMSSW) is required in order to read and analyze it. - -From 2015 data releases have been a slimmed-down format called MiniAOD, which has the same essential structure and software requirements for analysis as AOD. Essentially there are few -physics object collections and often the physics objects themselves are different. - -For data released in 2016 and beyond a new format called NanoAOD is used. NanoAOD is not just simply slimmed-down MiniAOD. In contrast to AOD and MiniAOD which is stored in CMSSW C++ objects, NanoAOD is stored using ROOT TTree objects. You therefore do not need to use the CMS Virtual Machine or docker container to analyze NanoAOD data. NanoAOD can be analyzed using the ROOT program and/or python libraries capable of interpreting the ROOT's TTree structure. - -TO-DO we can "borrow" information from below: - -miniAOD links for use: [Getting started with miniAOD](https://opendata.cern.ch/docs/cms-getting-started-miniaod), [miniAOD in Workbook](https://twiki.cern.ch/twiki/bin/view/CMSPublic/WorkBookMiniAOD2016#High_level_physics_objects) - -nanoAOD links for use: [Getting started with nanoAOD](https://opendata.cern.ch/docs/cms-getting-started-nanoaod) - - - -::::::::::::::::::::::::::::::::::::: keypoints - -- Use `.md` files for episodes when you want static content -- Use `.Rmd` files for episodes when you need to generate output -- Run `sandpaper::check_lesson()` to identify any issues with your lesson -- Run `sandpaper::build_lesson()` to preview your lesson locally - -:::::::::::::::::::::::::::::::::::::::::::::::: - -[r-markdown]: https://rmarkdown.rstudio.com/