Skip to content

Chaganti-Reddy/AI-Prototype-Customer-Segmentation

Repository files navigation

Cusomer Segmentation Analysis

Artificial intelligence (AI) has the potential to revolutionize pathology. AI refers to the application of modern machine learning techniques to digital tissue images in order to detect, quantify, or characterize specific cell or tissue structures. By automating time‑consuming diagnostic tasks, AI can greatly reduce the workload and help to remedy the serious shortage of pathologists. At the same time, AI can make analyses more sensitive and reproducible and it can capture novel biomarkers from tissue morphology for precision medicine. So, we created a Machine Learning model for Customer Segmentation Analysis of E-Commerce Industry. .

Table of Contents

⚠️ Frameworks and Libraries

  • SKLearn: Simple and efficient tools for predictive data analysis
  • Seaborn: Seaborn is a Python data visualization library based on matplotlib. It provides a high-level interface for drawing attractive and informative statistical graphics.
  • Plotly: The plotly Python library is an interactive, open-source plotting library that supports over 40 unique chart types covering a wide range of statistical, financial, geographic, scientific, and 3-dimensional use-cases.
  • Matplotlib : Matplotlib is a comprehensive library for creating static, animated, and interactive visualizations in Python.
  • Numpy: Caffe-based Single Shot-Multibox Detector (SSD) model used to detect faces
  • Pandas: pandas is a fast, powerful, flexible and easy to use open source data analysis and manipulation tool, built on top of the Python programming language.

📖 Data Preprocessing

Data pre-processing is an important step for the creation of a machine learning model. Initially, data may not be clean or in the required format for the model which can cause misleading outcomes. In pre-processing of data, we transform data into our required format. It is used to deal with noises, duplicates, and missing values of the dataset. Data pre-processing has the activities like importing datasets, splitting datasets, attribute scaling, etc. Preprocessing of data is required for improving the accuracy of the model.

🔗 Download

The dataset is now available here !

🔑 Prerequisites

All the dependencies and required libraries are included in the file requirements.txt See here

🚀  Installation

  1. Clone the repo
$ git clone https://github.com/Chaganti-Reddy/AI-Prototype-Customer-Segmentation.git
  1. Change your directory to the cloned repo
$ AI-Prototype-Customer-Segmentation
  1. Now, run the following command in your Terminal/Command Prompt to install the libraries required
$ pip3 install -r requirements.txt

💡 How to Run

  1. Open terminal. Go into the cloned project directory and type the following command:
$ python3 Customer-Segmentation.py

🔑 Results

Silhouette intra-cluster score:

• Word Cloud Analysis:

• Decomposed Data:

• Report via PCA:

• Customers Morphology

• Confusion Matrix:

Check out the report here

👏 And it's done!

Feel free to mail me for any doubts/query :email: chagantivenkataramireddy1@gmail.com


🙋 Citation

You are allowed to cite any part of the code or our dataset. You can use it in your Research Work or Project. Remember to provide credit to the Maintainer Chaganti Reddy by mentioning a link to this repository and her GitHub Profile.

Follow this format:

  • Author's name - Chaganti Reddy
  • Date of publication or update in parentheses.
  • Title or description of document.
  • URL.

🔰 Future Goals

This study endeavoured to present Customer Segmentation of E-Commerce Industry using an a-priori approach to categorize potential buyers into sub-segments of old and new customers and their purchases.

❤️ Owner

Made with ❤️  by Chaganti Reddy

👀 License

MIT © Chaganti Reddy