Skip to content

Documents classification using KNN Algorithm a graph based approach along with scrapped data

Notifications You must be signed in to change notification settings

MS1034/document-classification-using-KNN

Repository files navigation

Graph-Based Document Classification using KNN's

Description

This project aims to implement a document classification system using graph theory principles. By representing documents as graphs and leveraging graph-based features, the system can categorize documents into predefined topics with improved accuracy compared to traditional vector-based models.

Table of Contents

Installation

To install and set up the project, follow these steps:

  1. Clone the repository to your local machine.

Features

  • Representation of documents as directed graphs.
  • Extraction of graph-based features using common subgraph identification techniques.
  • Classification of documents using the K-Nearest Neighbors (KNN) algorithm based on graph similarity measures.

Contributing

Contributions to the project are welcome! If you'd like to contribute:

  1. Fork the repository.
  2. Create a new branch for your feature or bug fix.
  3. Make your changes and commit them with clear messages.
  4. Push your changes to your fork.
  5. Submit a pull request, clearly describing the changes implemented.

License

This project is licensed under the MIT License.

Credits

Sir Waqas Ali

Contact

For any inquiries or feedback, please contact: