Skip to content

Latest commit

 

History

History
11 lines (10 loc) · 705 Bytes

File metadata and controls

11 lines (10 loc) · 705 Bytes

Information retrieval -Text-mining-

This is a full version on how to creat a search engine using python . Text-minig \ TF IDF \ Textual data manipulation \ Boolean modal , Vector space modal \ Cosine similarity between the text files . You find inside the PDF file all the process that i have followed . \

Version 1 :

In the first version ( codes.py ) i have implimented the IR system from scratch , the matrices , the vecotr model space and the TF IDF , all of the are implimented from scratch .

You need to :

1 - Update the path file to your text files .
2 - Spyder 3 is recommended here .

Version 2 :

In the second version , i have used predifined Python packages and libraries .