Skip to content
This repository has been archived by the owner on May 5, 2021. It is now read-only.

pipeline for no-whitespace language #693

Open
LatteDev opened this issue Jul 31, 2020 · 2 comments
Open

pipeline for no-whitespace language #693

LatteDev opened this issue Jul 31, 2020 · 2 comments

Comments

@LatteDev
Copy link

When create a project , the default tokenize in pipeline is WhitespaceTokenizer,
if my project is base on no-whitespace language(ie:chinese, japanese),
How should I set the pipeline?
Or I didi not need to modify the default pipeline?

@znat
Copy link
Contributor

znat commented Jul 31, 2020

You should be able to use the JiebaTokenizer. However it is not installed by default in Rasa. So you need to add it explicitly to the requirements of the Rasa image built in your Botfront project

@LatteDev
Copy link
Author

@znat
I am confuseing,in the botfront's stories page and nlu page and pipeline page, you can select different language here.
these language option will not help rasa to understand user's input message?
just a tag to mark diffrerent language settings in the same project?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants