The Challenge | Health Makes Wealth

Integrate NASA Earth science data and citizen science data to learn more about the connections between human, animal, and environmental health.


ProApt is a web application which enables college administrators to forecast the current risk of disease outbreak on campus as well as share this information with the student body.


To create ProApt our group used a machine learning algorithms to identify patterns in disease outbreaks in micro-communities using various environmental factors. Our ML algorithms were trained using data across all US states between 2013-2017. This included NASA's GLOBE data regarding humidity, barometric pressure and air temperature, as well as CDC data regarding the intensity of disease outbreaks. After training, we were then able to use our model to predict the risk of an outbreak given local environmental conditions. This model can then be used to notify and warn college administrators when conditions on campus are prime for a disease outbreak. ProApt also serves as a streamlined communication platform, enabling college administrator to share risk assessment information and possible mitigation techniques with the entire student body.

we used random forest regression to inform differential models of disease spread. This was used to accurately predict disease incidence over the next 100 days. Random forest regression was used for the ML algorithm. See to view runner. ML was done in alalysis_mk2.ipyb

All other files were used for data analysis. A k-d tree was used for clustering of location data for the purpose of associating CDC and GLOBE data.

Project GitHub -

Resources Used:

Globe Data -

CDC Data -


SpaceApps is a NASA incubator innovation program.