The purpose of this project is to use publicly available data from observations intended to identify exoplanets in order to determine whether or not a given stellar system contains one or more exoplanets.
An exoplanet is any planet not in our solar system. Since the early 1990s, thousands of exoplanets have been discovered. Exoplanets are identified through a variety of methods, including, but not limited to, transit light curve analysis, direct imaging, and radial velocity measurements. One of the most successful attempts was the Kepler Mission, which studied light curves of hundreds of thousands of stars and identified over 2,500 planets. Each detection technique is biased towards detecting different types of planets at different distances from stars, so a variety of methods is necessary to fully explore the exoplanetary parameter space.
As is exemplified by the Kepler Mission, identifying exoplanets – using any method – relies on massive datasets and signals that can be subtle and noisy. In fact, the first exoplanet was observed many years before it was identified because the researchers did not originally notice it in the data. An unfortunate side effect of this delay was that the “first” discovered exoplanet was announced in the interim despite being observed years later. Machine learning may be a useful avenue for systematically, accurately, and quickly identifying planets and avoiding such situations.
Total project length: 175/350 hours.
Please DO NOT contact mentors directly by email. Instead, please email ml4-sci@cern.ch with Project Title and include your CV. The mentors will then get in touch with you.