9.7 Mini-Project: SQLite DB Creation

Derek Banks (dmb3ey)

Camille Leonard (cvl7qu)

Gary Mitchell (gm3gq)

For this project, we sourced the Kepler Exoplanet Search Results collected by the Kepler Space Observatory from Kaggle. We created three tables: Exoplanet_info, Transit_Properties, and Stellar_Parameters in our database. The ER diagram and corresponding schema can be found below.

ER Diagram

ER Diagram

DB Schema

Exoplanet_info(kepoi_name, koi_score, kepid, kepler_name, koi_disposition)

Transit_Properties(kepoi_name, koi_impact, kepid, koi_insol, koi_duration, koi_period, koi_time0bk, koi_depth, koi_prad, koi_teq)

Stellar_Parameters(kepoi_name, kepid, koi_steff, koi_slogg, koi_srad)

has(Exoplanet.kepoi_name, Transit_properties.kepoi_name, Stellar_Parameters.kepoi_name)

Database Creation

Queries

  1. Find the name and score of all Exoplanets that are confirmed.
  1. Find the name, disposition, and score of all Exoplanets that have a score greater than 0.95.
  1. Find the name and score of all Exoplanets that have a score greater than 0.95 and are confirmed.
  1. Find the number of exoplanets for each disposition.
  1. Of the confirmed planets, what are their photospheric temperatures?
  1. Of the confirmed planets, what are the equilibrium temperatures for the exoplanets measured in degrees Kelvin?
  1. What are the average equilibrium planet temperatures(Kelvin) and orbital periods(days) for each category of disposition?
  1. What planets have a planet temperature and orbital period similar to Earth (250 K and 350 K, 350 and 380 days)?
  1. Of the confirmed planets, what are their photospheric temperatures and the equilibrium temperatures for the exoplanets measured in degrees Kelvin?

Visualizations

The plot above shows the distribution of confidence scores among the exoplanets that are within the confirmed disposition. As you can see, a lare majority of the scores are at or near 1, meaning near absolute confidence.

The plot above shows a similar distribution, but has been expanded to look at all dispositions of the exoplanets. The confirmed category appears to be more heavily distributed towards the upper end of the specturm, seeing as the plot is wider near one. The Candidate category has a somewhat similar distribution, but doesn't appear to have nearly as many observations with a value at or near one.

The plot above shows another distribution amongst the 3 disposition classes, but this time looking at the average orbital period, or it other words, how long 1 year is for each class. On average, the confirmed class has an orbital period of close to 30 days, implying that the planets within the confirmed class are very close to their host star. If this is true, we would expect the planet temperature to be quite high. The candidate class has a much higher average orbital period. Perhaps planets close to the local star are easier to identify as being an exoplanet or not, where as celestial bodies further away present challenges to the detection methods used in identifying exoplanets.

The above plot is looking at a scatter plot distribution of the planets that had both an earth like temperature, and an earth like distance from the sun, as represented by the orbital period of the planet. It's worth noting that of all the points on the plot, only one of them is in the confirmed class, and most of the points are classified as false postives. Finding an earth like planet, even among the Kepler data, truly is like finding a needle in a haystack.

The plot above does a good job visualizing the distribution of star and planet temperatures of the confirmed planet category. As mentioned earlier, a large majority of the confirmed planets had a lower orbital period, and thus a high planet temperature, or so intuition would lead us to believe. The plot above confirms this. A large majority of the planets have temperature well above 300 K which would be roughly 27 degrees celcius. Any more than 300, and we start looking at planets that are nearly uninhabitable for human life. That being said, there do appear to be some planets that may fall within the range required by humans.