class: center, middle # Exploratory Data Analysis ## Kepler Object of Interest Dataset
Github Repo: [https://github.com/JamesMcGuigan/dataset-kepler/](https://github.com/JamesMcGuigan/dataset-kepler/) --- # KOI - Kepler Objects of Intrest
Number of KOIs
9564
Number of Solar Systems
8214
Number of Named Planets
2305
CONFIRMED KOIs
2303
CANDIDATE KOIs
2420
FALSE POSITIVE KOIs
4841
# Named False Positives [Kepler-469b](https://twitter.com/exohugh/status/1169262460504875008) - "clearly a binary fold in the wrong period" - Hugh Osborn [Kepler-503b](https://arxiv.org/abs/1805.08820) - An Object at the Hydrogen Burning Mass Limit Orbiting a Subgiant Star --- # FastAI - Neural Network Predictions Using a FastAI Neural Network to predict KOI Disposition (CONFIRMED / CANDIDATE / FALSE POSITIVE) from the other fields in the dataset, NN Config: `layers=[200, 200, 100, 100, 50, 50] weight_decay=0.1 dropout=0.5`
Only using
fpflags
and OneHot encoded
koi_comments
83.3% accuracy
20
fit_one_cycle()
epocs
Using entire dataset
89.6% accuracy
Superconvergence
5
fit_one_cycle()
epocs
--- # Exoplanet Habitability A key goal of the Kepler Space Telescope is to determine how many Earth-size and larger planets there are in or near the habitable zone (often called "Goldilocks planets") of a wide variety of spectral types of stars. The Goldilocks Zone: - Liquid Water = surface temperature between 273.2K - 373.2K - Earth Sized Rocky Planet = assumed between 0.8 - 1.7 (Earth Radii) **17 potentially habitable exoplanets found!**
KMeans Clustering for Planet Type
KMeans Clustering for Star Type
--- # Where are my habitable exoplanets? The 17 potentially habitable exoplanets plotted on the night sky!
- The bottom right of the grid is closest to the Galactic Rim, thus has a greater overall star density - Each square represents the stationary field of view of the Kepler Space Telescope - Kepler was repointed 21 times in a grid search pattern during its 9 year mission