Machine Learning for Segmentation of Lensed Galaxies: Distinguishing Source Galaxies from Gravitational Lenses
Title | Machine Learning for Segmentation of Lensed Galaxies: Distinguishing Source Galaxies from Gravitational Lenses |
---|---|
Summary | Machine Learning for Segmentation of Lensed Galaxies: Distinguishing Source Galaxies from Gravitational Lenses |
Keywords | Semanitc Segmentation, Astronomical data, classification |
TimeFrame | 2023-2024 |
References | https://drive.google.com/drive/folders/1OHbp3pUHygOWheJSdYQFvOUBZ-MPIVdx |
Prerequisites | Good Programing skills, previous experience with CNN and deep learning, interest in astronomy data. |
Author | |
Supervisor | Tiago Cortinhal, Idriss Gouigah, Eren Erdal Aksoy, Margherita Grespan, Hareesh Thuruthipilly |
Level | Master |
Status | Open |
Abstract: Strong gravitational lensing (SGL) is a phenomenon in which a massive foreground objecting (lens) distorting space-time bends the signal coming from background sources. On the sky, this is visible as arc-like structures or double images of the foreground object.
Strong gravitational lenses (SGLs) are a powerful tool for addressing the dark matter problem and constraining cosmological parameters, while also serving as a natural telescope into otherwise too faint to observe celestial objects. The upcoming large-scale astronomical surveys, such as the Rubin Observatory Legacy Survey of Space and Time (LSST) and Euclid, are expected to uncover approximately 10^5 SGLs by analyzing datasets of unprecedented scale. However, detecting and accurately modelling these systems is a formidable task, necessitating the development of automated deep learning algorithms.
This master's thesis focuses on the development and training of a machine learning model for the detection of SGLs and the segmentation of lensed galaxy images, enabling the differentiation between source (background) galaxies and gravitational lenses permitting the modelling of lensed systems. The dataset utilized comprises simulated images from the Gravitational Lens Finding Challenge, simulated data based on the Euclid survey and simulated data from the Legacy Survey of Space and Time (LSST).
Research Objectives:
1. Dataset Compilation: Preprocess a diverse dataset of lensed galaxy images, encompassing data from the Gravitational Lens Finding Challenge, and the data from the Legacy Survey of Space and Time (LSST).
2. Model Development: Investigate and implement state-of-the-art machine learning techniques, with a focus on convolutional neural networks (CNNs) and transformers to design a pipeline capable of classifying and subsequently segmenting the lensed galaxy images accurately to predict the parameters of the lensing system.
3. Strong-Lens classification: Develop a state-of-the-art model for identifying SGLs with a focus on reducing false positives (FP).
4. Source-Lens Separation: Develop a novel model architecture that can effectively differentiate between the source galaxy and the gravitational lens in lensed galaxy images, considering the unique challenges posed by gravitational lensing.
5. Training and Validation: Train the model on the curated dataset, employing data augmentation and regularization techniques. Validate the model's performance using cross-validation and a range of appropriate evaluation metrics, such as accuracy, precision, recall, and F1-score. Compare the performance of the model with the existing models from the literature.
6. Generalization and Application: Assess the model's generalization abilities by testing it on different datasets, including data from various astronomical surveys. Evaluate its applicability in real-world astronomical observations and discuss potential applications.
Methodology:
1. Data Preparation: Collect lensed galaxy images from the Gravitational Lens Finding Challenge and the other two previously mentioned astronomical survey and preprocess the data.
2. Model Design: Explore various deep learning architectures and techniques to develop a robust model tailored to identify and segment lensed galaxy images and distinguish source galaxies from gravitational lenses.
3. Training and Validation: Train the model on the prepared dataset, optimizing hyperparameters and monitoring performance throughout the training process. Employ cross-validation to ensure robustness and apply the model to real data.
4. Generalization Testing: Evaluate the model's ability to generalize to different datasets, including those from the survey, to assess its practicality for broader astronomical research.
5. Comparative Analysis: Compare the proposed model's performance with existing methods, highlighting its strengths, weaknesses, and potential contributions to the field.
Expected Outcomes: This master's thesis aims to advance the field of astrophysics by providing a novel machine learning approach for identifying strong lenses and accurately segmenting the input image into the lens and source galaxies. The expected outcomes include:
1. A trained machine learning model capable of identifying SGLs from the upcoming astronomical surveys such as Euclid and LSST.
2. A trained machine learning model capable of accurately segmenting lensed galaxy images which would be used for modelling the lensing system and estimating the physical parameters.
3. Insights into the effectiveness of different deep learning architectures for this specific task.
4. An evaluation of the model's generalization capabilities to diverse astronomical datasets.
5. Recommendations for the practical application of the model in gravitational lensing studies, enhancing our understanding of the distant universe.
You can contact us at tiago.cortinhal@hh.se or idriss.gouigah@hh.se