Skip to content

Sarthakjain1206/Intelligent-Document-Finder

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

27 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Intelligent-Document-Finder(old)

A tool which can find your any document using semantic search
New Version - https://github.com/Sarthakjain1206/Intelligent_Document_Finder

What is Intelligent Document Finder ?

How easy do you find it to remember the exact location of a document that you created last year? Not very easy, right? Big Organizations/people deal with hundreds of documents daily and forget about them, most of the time.
But what if we want that old documentation again for some work, but unfortunately you do not remember the name or the actual content of that document to retrieve it from the large storage of your computer.
In such cases, use of a Intelligent document finder can really make a huge difference. As, it can Search for the document(semantically) of your need based on a query input. This will not only help in faster access to the document, but will also help in grouping similar documents together and in analysing them.

Note:

Currently this repositry is using predefined database of news articles gathered by web scraping. Due to the github restrictions on uploading the large files, we cannot upload it here.

Soon, we will add the support of the dynamic databases, so that you can use this tool for your own databases to build your own custom search engine.

Technologies Used

Python3.6 JavaScript HTML/CSS

Database Used:

SQlite

For implementing searching:

Various NLP(Natural Language Processing) techniques is used.

For website(Local Host):

  • Python-based Web framework : Flask
  • JavaScript/JQuery

Program Flow

Trulli

Compatibility

  • Backend (AI part) is compatible on any machine that has python and required dependencies installed.
  • Recommended browsers: Mozilla Firefox and Google Chrome.

How to Install and Use?

> mkdir IntelligentDocumentFinder

> cd IntelligentDocumentFinder

> git clone https://github.com/Sarthakjain1206/Intelligent-Document-Finder

Install Vitual Environment if not installed

  • On Linux/MacOs > python3 -m pip install --user virtualenv
  • On windows > py -m pip install --user virtualenv

Create Virtual Environment

  • On macOS and Linux: > python3 -m venv env
  • On Windows: > py -m venv env

Activate Environment:

  • On macOS and Linux: > source env/bin/activate
  • On Windows: > .\env\Scripts\activate

> pip install -r requirements.txt

Download Glove Word Embeddings from this link, decompress it and copy the glove.6B.100d file in DataBase folder

then, run initial_file.py through this command > python initial_file.py

Now you are good to go.. Just type this command everytime you want to access it, and open the website in chrome/firefox
> python src/app.py