Skip to content

stripathi08/pos_cmism

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

POS Tagging for CMISM, ICON 2016

Please find the details of the shared task here.

Update : SMPOST Python Module available now.


Running the code

  • Install CRF++ and pycrfsuite before execution.
  • Go to Resources/training_data and testing_data to add your respective files.
  • Sample Train File name : domainName_langPair_FinerOrCoarser.txt, FB_HI_EN_FN.txt
  • Sample Test File name : domainName_langPair_Test_Raw.txt, FB_HI_EN_Test_Raw.txt
  • Make sure to add the read files with different names in the main_train_frame in main.py
  • Enter the language pair, mode and Classifier mode in main.py. Classifier modes are crf++ and pycrf.
  • Make suitable file name changes in transforms.py.
  • For final testing, we only used the crf++ module as their CV results were better than pycrfsuite.
  • Run main.py.

Citing the paper

Reporting Doubts and Errors

  • For any queries, please contact me at stripathi1770@gmail.com.
  • Please refer to the publication for detailed results.

About

POS Tagging system for code-mixed Indian Social Media text

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published