Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guidance on how to implement subword sampling at train time #103

Open
sooheon opened this issue Jun 14, 2018 · 2 comments
Open

Guidance on how to implement subword sampling at train time #103

sooheon opened this issue Jun 14, 2018 · 2 comments
Labels
sample code Asks toprovide sample code

Comments

@sooheon
Copy link

sooheon commented Jun 14, 2018

I guess I should be re-sampling tokenizations on the train data with SP before each epoch, but it would be nice to see a canonical implementation of this in $FRAMEWORK.

@taku910
Copy link
Collaborator

taku910 commented Jun 16, 2018

will do.

@Diego999
Copy link

Any update on this ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
sample code Asks toprovide sample code
Projects
None yet
Development

No branches or pull requests

3 participants