CMS DAS Pre-Exercises

General Information

CMS is a large collaboration of three thousand people, spread over several institutions, across five continents. It is very important to bring the collaborators up to speed, learning about the CMS detector and computing software, in order to contribute to physics data analysis. CMS maintains the WorkBook as much as up-to-date as possible to help guide its users. In addition to it, several tutorials (a.k.a. HATS) and workshops are held regularly to give a hands-on experience to the users. There are tutorials covering all aspects of CMS data analysis from reconstructed physics objects (e.g. muons, electrons, photons, jets etc) to trigger, generators, statistics, machine learning, software development and access to grid and computing resources. where one learns the basic CMS software tools. CMS Data Analysis Schools (DASes) offer a boot camp model, concentrating all of the training in a very short period of time. In the beforetimes, DAS used to be in-person and 5 days long, during Covid-19 times it became fully remote, and it expanded into two weeks, with an asynchronous and a synchronous component. The CMSDAS@CERN 2023 intends to continue the in-person activities that were halted due to the pandemic.

To prepare CMSDAS participants, a series of pre-exercises are required to be completed, with the laptop that will be used during the school. The goal of the pre-exercises is to make sure all basic issues can be resolved before the actual start of the school so that participants can hit the ground running.

This instruction page was created by the FNAL team for the purposes of CMS DAS@FNALLPC schools. It was modified for the purposes of CMS DAS@CERN 2023 by the local organizing team. Links to all the past and upcoming CMS Schools can be found HERE.

What: A series of pre-exercises to exercise all the needed tools with the laptop participants will bring to CMS DAS, so that they can be ready to go from the beginning of the school.

Who: All participants of the CMS DAS schools.

Where: Espl. des Particules 1, 1211 Meyrin, Switzerland. Get directions with OpenStreetMap or Google Maps.

When: June 5-10, 2023.

Requirements: Participants must have access to a computer with internet access for which they have administrative privileges. Acceptable operating systems include Mac OS, Linux, or Windows (preferably not a tablet, Chromebook, etc.). The setup page will have more information about any additional pieces of software that must be installed or any accounts which must be obtained.

Accessibility: We are committed to making this workshop accessible to everybody. Please do not hesitate to contact the organizers (using contact details below) if you need wheelchair / scooter accessible rooms and restrooms during the school week.

Materials will be provided in advance of the workshop and large-print handouts are available if needed by notifying the organizers in advance. If we can help making learning easier for you (e.g. sign-language interpreters, lactation facilities) please get in touch (using contact details below) and we will attempt to provide them.

Instructors: TBA

Helpers: TBA

Mattermost chat: The PreExercises channel will be available once you join the CMSDAS@CERN 2023 team. Direction for how to join this Mattermost chat team can be found on the setup page.

Contact: Please email cmsdas-cern-organizers@cern.ch for more information or assistance.

Prerequisites

Before going any further, please follow the instructions on the setup page.

Schedule

Setup Obtain the necessary accounts (warning: may take some time)
00:00 1. CMS Data Analysis School Pre-Exercises - First Set How do you setup a CMSSW release?
How do you find a dataset using the Data Aggregation Service (DAS)?
What are some EDM standalone utilities and what do they do?
What is MiniAOD and how do you use it?
01:00 2. CMS Data Analysis School Pre-Exercises - Second Set How to slim a MiniAOD file?
How to know the size of a MiniAOD file?
How to use FWLite to analyze data and MC?
01:30 3. CMS Data Analysis School Pre-Exercises - Third Set How do I do an analysis with so much data that I cannot run it interactively on my computer?
What is CRAB? How do I use it to run an analysis on the grid?
How do configuration files look like?
How do I extract the luminosity of the dataset I analyzed?
05:30 4. CMS Data Analysis School Pre-Exercises - Fourth Set How do we analyze an EDM ROOT file using an EDAnalyzer?
How do we analyze an EDM ROOT file using an FWLite executable?
How do we use ROOT/RooFit to fit a function to a histogram?
06:30 5. CMS Data Analysis School Pre-Exercises - Fifth Set How do I setup git on my computer/cluster?
How do I collaborate using GitHub?
07:00 6. CMS Data Analysis School Pre-Exercises - Sixth Set What is Jupyter?
What is pyROOT?
07:30 7. CMS Data Analysis School Pre-Exercises - Seventh Set What is an image? How about a container?
What is Docker/Singularity?
Why is containerization useful?
Ummmm…how is this different from a virtual machine?
08:30 Finish

The actual schedule may vary slightly depending on the topics and exercises chosen by the instructor.