One of the key tasks in particle physics analyses is proper classification of particle collision events based on the parent particles and the process that produced them. To handle this task, we’re developing a flexible machine learning pipeline which can be applied to a broad range of classification tasks. Proposals should build on last year’s work which combined the L-GATr and ParT models and implemented a track-level masked autoencoder pretraining. This project is related to the project “Foundation models for End-to-End event reconstruction” but should focus primarily on improving the existing hybrid model architectures either in speed or numerical accuracy and quantifying results.
Total project length: 175/350 hours.
Please use this link to access the test for this project.
Significant experience in Python and Machine Learning in Pytorch. Preferably some experience with Transformers and multi-GPU parallelization or with the ROOT library developed by CERN.
Advanced
Please DO NOT contact mentors directly by email. Questions should instead be directed to ml4-sci@cern.ch which is forwarded to mentors. To submit your proposal, CV, and test task solutions, please use this Google form.