Machine-Learning-project-end-to-end

About this file :

California Housing Prediction

Target variable : Median House Value( Continious)

Model : 'Supervised Regression Model'

longitude: A measure of how far west a house is; a higher value is farther west
latitude: A measure of how far north a house is; a higher value is farther north
housingMedianAge: Median age of a house within a block; a lower number is a newer building
totalRooms: Total number of rooms within a block
totalBedrooms: Total number of bedrooms within a block
population: Total number of people residing within a block
households: Total number of households, a group of people residing within a home unit, for a block
medianIncome: Median income for households within a block of houses (measured in tens of thousands of US Dollars)
medianHouseValue: Median house value for households within a block (measured in US Dollars)
oceanProximity: Location of the house w.r.t ocean/sea

About Dataset :

Context :

This is the dataset used in the second chapter of Aurélien Géron's recent book 'Hands-On Machine learning with Scikit-Learn and TensorFlow'. It serves as an excellent introduction to implementing machine learning algorithms because it requires rudimentary data cleaning, has an easily understandable list of variables and sits at an optimal size between being to toyish and too cumbersome.

The data contains information from the 1990 California census. So although it may not help you with predicting current housing prices like the Zillow Zestimate dataset, it does provide an accessible introductory dataset for teaching people about the basics of machine learning.

Requirements:

Github
Heroku or render
cloud(AWS, GCP, Azure)
pycharm or vs code

create environment

conda create -p venv python==3.7 -y

activate environment

conda activate venv/

create requirements.txt

pip install - r requriements.txt

Docker file

FROM python:3.7 - os

COPY . /app - app

WORKDIR /app - working directory

RUN pip install -r requirements.txt - install requirements

EXPOSE $PORT - port number sent from the environment

CMD gunicorn --workers=4 --bind 0.0.0.0:$PORT app:app

Render app

since heroku is not availabe iam using render app

Housing folder :

package for project
init.py - converts the housing folder into packages
required packages
exception - to print the error details in the project process
logger - to keep the log or track the project
pipeline - combination of every component
config - the input to the every pipeline component
entity - the output of the each pipeline component
components - each stages of the pipeline
Util - helper function
constant - the variable that need for housing folder
every folder should contaion init file.

Pipline :

Data Ingestion :
- Bring data into system from multiple resources
- Split data into traina and test data
Data validation :
- Data Range
- Outliers check
- Duplicate Values
- NUll value check
- Schema validation
- Domain values
- Data drift - the statistics of old and new data is different
- Imbalanced dataset
Data Transformation :
- Perform EDA to understand the data
- Perform Feature engineering (Pickle)
Model Training :
- Model selection (pickle)
- Hyperparamter tuning
Model Evaluation :
- Model comparsion - Compare the performance of old model to new model( set some threshold)
Model pushing :
- Model deployement

Name		Name	Last commit message	Last commit date
Latest commit History 103 Commits
.github/workflows		.github/workflows
Notebook		Notebook
config		config
housing		housing
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
demo.py		demo.py
ml.gif		ml.gif
requirements.txt		requirements.txt
setup.py		setup.py

License

BALAJIHARIDASAN/Machine-Learning-project-end-to-end

Folders and files

Latest commit

History

Repository files navigation

Machine-Learning-project-end-to-end

Requirements:

create environment

activate environment

create requirements.txt

Docker file

Render app

Housing folder :

Pipline :

About

Topics

Resources

License

Stars

Watchers

Forks

Languages