🌲 🌲 🌲 🌲 🌲 Decision Tree Classifier 🌲 🌲 🌲 🌲 🌲

Rules:

Dataset must have the last column be the target column.
Specify this target column in tree.py's main function as "header = []" and enter the column names in the list
It converts any datapoint into a string because at this moment, the actual numbers are not relevant nor necessary.

How to use:

clone repo
install python
in tree.py's main -> specify file for training dataset, testing data, and headers
include build_tree for the tree, print if needed.
include test function for testing or just run classify on the new row

Future Proof

This tree will work on any pretty much dataset. Add as many feature columns as needed - the gini impurity and information gain will be calculated recursively on all of them.

Algorithm

Calculates Gini Impurity for all viable partitions
Based on Information Gain -> decides on order of Question

We may use different data in the future. The Data Generation file allows us to create a large dataset by just inputting values as ranges. The generation file will then create a dataset by selecting a random value from each range as many times as you like!!! A Decision Tree is the only way to do this well

Initial very basic testing:

Current seedpack data (UserToPack.csv) configuration:

We have been given every possibility and they are represented by the range values. Key points:
From the data se can see that users are placed into a range for example '2-3', we never see plants recommended as either only 2 or only for 3 - they are not unique values in the dataset. Data also never overlaps any of the ranges. Thus we can maintain this range.

Name		Name	Last commit message	Last commit date
Latest commit History 49 Commits
.ipynb_checkpoints		.ipynb_checkpoints
.vscode		.vscode
API		API
Data_Manipulation		Data_Manipulation
Datasets		Datasets
Tree		Tree
User		User
__pycache__		__pycache__
env		env
imgs		imgs
.DS_Store		.DS_Store
.gitignore		.gitignore
Data_Generate.py		Data_Generate.py
README.md		README.md
Tree.py		Tree.py
Use_Tree.py		Use_Tree.py

Shepherde/Decision-Tree

Folders and files

Latest commit

History

Repository files navigation

🌲 🌲 🌲 🌲 🌲 Decision Tree Classifier 🌲 🌲 🌲 🌲 🌲

Rules:

How to use:

Future Proof

Algorithm

Initial very basic testing:

Current seedpack data (UserToPack.csv) configuration:

About

Resources

Stars

Watchers

Forks

Languages