Purdue-Simplilearn-AI-ML

Projects done under Purdue-Simplilearn PGP AI & ML

View the Project on GitHub lookupinthesky/Purdue-Simplilearn-AI-ML

California Housing Prices Prediction

DESCRIPTION

Background of Problem Statement

The US Census Bureau has published California Census Data which has 10 types of metrics such as the population, median income, median housing price, and so on for each block group in California. The dataset also serves as an input for project scoping and tries to specify the functional and nonfunctional requirements for it.

Problem Objective

The project aims at building a model of housing prices to predict median house values in California using the provided dataset. This model should learn from the data and be able to predict the median housing price in any district, given all the other metrics.

Districts or block groups are the smallest geographical units for which the US Census Bureau publishes sample data (a block group typically has a population of 600 to 3,000 people). There are 20,640 districts in the project dataset.

Domain: Finance and Housing

Analysis Tasks to be performed

  1. Build a model of housing prices to predict median house values in California using the provided dataset.

  2. Train the model to learn from the data to predict the median housing price in any district, given all the other metrics.

Solution

View Solution - Jupyter Notebook

View entire Project on Github