Classification of phishing URLs using machine learning in Python.
A feature extractor for URLs to generate a dataset that can be used for machine learning. The purpose of the model is to predict whether a URL is a phishing one or not.
Install all required packages from the requirements.txt file, then download the main.py, model.py, feature_extraction.py, known_shorteners.txt and the datasets to the same directory. Configure the proxy to the one you are using which can be found in feature_extractor.py. The ip address and port that it is running on is what needs configuring (if not using a proxy then remove all references to it and the switch_ip function).
Run main.py.