This is the code for the VLSP2019-SHARED Task: Hate Speech Detection on Social Networks submitted by team HH_UIT:
- HUY DUC HUYNH
- HANG THI-THUY DO
- KIET VAN NGUYEN
- NGAN LUU-THUY NGUYEN
- ANH GIA-TUAN NGUYEN
- Python(3.6)
- Keras
- Sklearn
- Download Word embedding at link and save at folder Word embedding.
- Implement file Preprocessing.py to make the data clear.(Implement for both csv file and save at Data folder. We have prepared train and test cleaned datasets in Data folder)
- Run main.py (Model and result file will be generated.)
You should use Google Colab to speed up the execution of the program.
The detailed approach is described in the following paper:
Hang Thi-Thuy Do, Huy Duc Huynh, Duc-Vu Nguyen, Kiet Van Nguyen, Ngan Luu-Thuy Nguyen, Anh Gia-Tuan Nguyen. Hate Speech Detection on Vietnamese Social Media Text using the Bidirectional-LSTM Model. The Sixth International Workshop on Vietnamese Language and Speech Processing VLSP 2019 - in conjunction with the international conference PACLING 2019. Link.