Skip to content

Latest commit

 

History

History
 
 

house_prices

Project Overview

In this project, we'll predict future house prices. We'll use data from the Federal Reserve, along with house price data from Zillow. We'll merge and combine this data, then use it to train a random forest model. The model will predict if house prices will increase or decrease in the future. We'll measure error using backtesting, then improve our model with new predictors.

This project can be customized to predict house prices in your metro area if you live in the US.

Project Steps

  • Load in data
  • Clean and merge data
  • Create an initial machine learning model and estimate accuracy
  • Improve the accuracy of the model
  • Run diagnostics to figure out how we can improve

Code

You can find the code for this project here

File overview:

  • prices.ipynb - a Jupyter notebook that contains all of the code.

Local Setup

Installation

To follow this project, please install the following locally:

  • JupyerLab
  • Python 3.8+
  • Python packages
    • pandas
    • yfinance
    • scikit-learn

Data

You'll need to download a few csv files to run this project. These files are included in this repo. You can also download them all here.

If you want to get newer versions: