Learning Representation Through Self-Supervised Learning on Real Gravitational Lensing Images

Description

Strong gravitational lensing is a promising probe of the substructure of dark matter to better understand its underlying nature. Deep learning methods have the potential to accurately identify images containing substructure, and differentiate WIMP particle dark matter from other well-motivated models, including axions and axion-like particles, warm dark matter etc.

Supervised classification can be difficult when the number of known objects of a particular class is very small. This is usually the case for strong gravitational lensing images, where the number of samples from one or more classes are relatively lower than others. Self-supervised learning (SSL) has proven to outperform standard supervised machine learning models, particularly when the number of data labels available for supervision is low. Moreover, SSL can take advantage of very large unlabelled datasets that would be difficult or impossible to label manually and build meaningful representations. To date, only convolutional neural networks (CNNs) have been used with the SSL technique for strong gravitational lensing data. Transformers or hybrid models (Transformers + CNN) promise more robustness for representation learning but have not been addressed by the community. This project will focus on the development of self-supervised learning techniques with Transformers for strong gravitational lensing data on real dataset

Duration

Total project length: 175/350 hours.

Difficulty level

Advanced

Task ideas

Explore the use of Transformers/Hybrid architectures with self-supervised learning for representation learning on real dataset. The trained model can then be fine-tuned for specific tasks such as regression or classification.
Explore the use of Equivariant Transformers with self-supervised learning for representation learning on real dataset. The trained model could then be fine-tuned for specific tasks such as regression or classification.
Expand the DeepLense functionality with self-supervised learning algorithms suitable for computer vision tasks applicable to strong gravitational lensing data.

Expected results

Develop a self-supervised learning transformer model for DeepLense training and inference.

Requirements

Python, PyTorch and relevant past experience in Machine Learning.

Mentors

Sergei Gleyzer (University of Alabama)
Michael Toomey (Massachusetts Institute of Technology)
Pranath Reddy (BITS Pilani Hyderabad)
Emanuele Usai(University of Alabama)
Saranga Mahanta (Institut Polytechnique de Paris)
Kartik Sachdev (RWTH Aachen)

Please DO NOT contact mentors directly by email. Instead, please email ml4-sci@cern.ch with Project Title and include your CV and test results. The relevant mentors will then get in touch with you.

Corresponding Project

DEEPLENSE