Phishing URL Classification

Classification of phishing URLs using machine learning in Python.

A feature extractor for URLs to generate a dataset that can be used for machine learning. The purpose of the model is to predict whether a URL is a phishing one or not.

Usage

Install all required packages from the requirements.txt file, then download the main.py, model.py, feature_extraction.py, known_shorteners.txt and the datasets to the same directory. Configure the proxy to the one you are using which can be found in feature_extractor.py. The ip address and port that it is running on is what needs configuring (if not using a proxy then remove all references to it and the switch_ip function).
Run main.py.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
README.md		README.md
dataset.arff		dataset.arff
dataset_validation.arff		dataset_validation.arff
feature_extraction.py		feature_extraction.py
known_shorteners.txt		known_shorteners.txt
main.py		main.py
model.py		model.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

dataset.arff

dataset.arff

dataset_validation.arff

dataset_validation.arff

feature_extraction.py

feature_extraction.py

known_shorteners.txt

known_shorteners.txt

main.py

main.py

model.py

model.py

requirements.txt

requirements.txt

Repository files navigation

Phishing URL Classification

Usage

About

Releases

Packages

Languages

Marcus-Jon/phishing_url_classification

Folders and files

Latest commit

History

Repository files navigation

Phishing URL Classification

Usage

About

Topics

Resources

Stars

Watchers

Forks

Languages