Skip to content

jacerong/normalesp

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

53 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

normalesp

Source code for paper: J. A. Cerón-Guzmán and E. León-Guzmán (2016), Lexical Normalization of Spanish Tweets. NLPIT 2016 (WWW'16 Companion). DOI: 10.1145/2872518.2890558.

normalesp is an open-source spell checker for texts written in Spanish, with a focus on tweets, which has been released to public domain in order to enable reproducibility and foster research on the area.

If you use this software in research, please cite:

@inproceedings{Ceron16,
    author = {Cer\'{o}n-Guzm\'{a}n, Jhon Adri\'{a}n and Le\'{o}n-Guzm\'{a}n, Elizabeth},
    title = {Lexical Normalization of Spanish Tweets},
    booktitle = {Proceedings of the 25th International Conference Companion on World Wide Web},
    series = {WWW'16 Companion},
    year = {2016},
    location = {Montr\&\#233;al, Qu\&\#233;bec, Canada},
    pages = {605--610},
    publisher = {International World Wide Web Conferences Steering Committee},
}

Compatibility

normalesp has been developed in Python 2.7 and on Debian 7 "wheezy". It has been tested on Debian 7 "wheezy", Debian 8 "jessie", and Debian 9 "Stretch". Because of this, normalesp should successfully run on a Linux system.

Installation and Usage

Please refer to the /docs directory.