Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language Modeling #2077

Merged
merged 136 commits into from
Aug 25, 2018
Merged

Language Modeling #2077

merged 136 commits into from
Aug 25, 2018

Conversation

iliakur
Copy link
Contributor

@iliakur iliakur commented Aug 5, 2018

Ok, after getting some feedback on my previous attempt, I re-worked things a bit.
This time there's tests a-plenty and I've tried to add documentation as well.

I have regression tests for:

Since I didn't add the Simple Good Turing estimator yet, can't say anything about the issues related to that. Which brings me to the next point. I spent most of my effort making sure the basic datastructures are useable and robust, so didn't have as much time to add a lot of model classes. I'm hoping that the infrastructure the module provides, however, makes it easy for folks to create models on their own.

Would be nice to merge this into the next release :)

Happy modeling!

@iliakur
Copy link
Contributor Author

iliakur commented Aug 6, 2018

@alvations @stevenbird The tests that fail currently all have to do with python 2 lacking itertools.accumulate. functools.singledispatch is also probably an issue.

I can easily compensate for accumulate but could we add singledispatch as a dependency? It's a small module, easily installed.

@stevenbird
Copy link
Member

@copper-head thanks for all this... fine to add this dependency.

@iliakur
Copy link
Contributor Author

iliakur commented Aug 16, 2018

Ok, it looks like there's more tweaking that I'll have to do to make python 2.7 happy. I'll get to it soon

@iliakur
Copy link
Contributor Author

iliakur commented Aug 19, 2018

Looks like python 2 is now happy :)
What's the drill with adding oneself to the contributors list these days?

@stevenbird
Copy link
Member

Wow, what a big contribution!

@stevenbird
Copy link
Member

[CI: retest]

@stevenbird stevenbird merged commit 974d816 into nltk:develop Aug 25, 2018
@stevenbird
Copy link
Member

Thanks @copper-head!

stevenbird added a commit that referenced this pull request Aug 25, 2018
@alvations
Copy link
Contributor

Thanks @copper-head!! An awesome update to the model package!!

@iliakur
Copy link
Contributor Author

iliakur commented Aug 25, 2018

lol, only took me... 4 years ;)

@stevenbird
Copy link
Member

How about announcing it on nltk-dev?

@iliakur
Copy link
Contributor Author

iliakur commented Aug 26, 2018

Sure!
Which release will this appear in?

@stevenbird
Copy link
Member

The next one... once it's packaged I hope you can announce your work on nltk-dev

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants