Skip to content

frozsgy/semantle-tr

Repository files navigation

Semantle Türkçe

This repository contains the Turkish version of Semantle.

Initial Setup

  • Create a virtual environment and install required Python modules from requirements.txt.
  • Use the word2vec/train.sh file to train a Turkish Word2Vec model using the Wikipedia corpus.
  • Run python dump-vecs.py to initialize the SQLite database with vectors.
  • Run python dump-hints.py to create hints pickle.
  • Run python store-hints.py to import the hints pickle to the database.

Deployment & Running the App

  • Run docker-compose up -d. The project should be up and running at HTTP port 80.

Notes

This project was forked from the original Semantle repository and some modifications to the dump-hints.py file were cherry picked from the Semantle-es repository.

Known Bugs

  • Since Turkish is an agglutinative language, while looking for similar words you will see lots of words with suffixes. I'm planning to clean up the corpus to work with lemmas or stems and improve the performance.

Want to Contribute?

Go ahead, they're always appreciated!


Made in Ankara with 💙

About

Turkish fork of Semantle

Topics

Resources

License

GPL-3.0, Unknown licenses found

Licenses found

GPL-3.0
LICENSE.md
Unknown
COPYING

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published