Project 4: Can you predict that? (4 days)

Background

The clean air act of 1970 was the beginning of the end for the use of asbestos in home building. By 1976, the U.S. Environmental Protection Agency (EPA) was given authority to restrict the use of asbestos in paint. Homes built during and before this period are known to have materials with asbestos https://www.asbestos.com/mesothelioma-lawyer/legislation/ban/.

The state of Colorado has a large portion of their residential dwelling data that is missing the year built and they would like you to build a predictive model that can classify if a house is built pre 1980. They would also like you to build a model that predicts (regression) the actual age of each home.

Colorado gave you home sales data for the city of Denver from 2013 on which to train your model. They said all the column names should be descriptive enough for your modeling and that they would like you to use the latest machine learning methods.

Data:

Download: dwellings_denver.csv, dwellings_ml.csv, dwellings_neighborhoods_ml.csv
Information: Data description

Readings:

Optional References

Grand Questions:

  1. Create 2-3 charts that evaluate potential relationships between the home variables and before1980.
  2. Can you build a classification model (before or after 1980) that has at least 90% accuracy for the state of Colorado to use (explain your model choice and which models you tried)?
  3. Will you justify your classification model by detailing the most important features in your model (a chart and a description are a must)?
  4. Can you describe the quality of your classification model using 2-3 evaluation metrics? You need to provide an interpretation of each evaluation metric when you provide the value.

Deliverables:

Use this template to submit your Client Report. The template has three sections (for additional details please see the instructional template):

  1. An “elevator pitch” that summarizes the entire case study.
  2. Answers to the grand questions that include text, pictures, and tables.
  3. An appendix that provides your commented code and justification for any programming that required you to choose an option.