Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Online/streaming LDA? #8

Open
carbocation opened this issue Aug 25, 2018 · 1 comment
Open

Online/streaming LDA? #8

carbocation opened this issue Aug 25, 2018 · 1 comment

Comments

@carbocation
Copy link

Is it possible to run LDA (or other processing algorithms) in a streaming/online fashion, such as is done with gensim? It seems that this would not easily support online processing, but I thought I'd bounce the question off of you since you know the internals much better.

@james-bowman
Copy link
Owner

Great question. All the algorithms will work in an online setting but the majority require batch training in advance. Some, like the LDA and RI algorithms could be made to work with online training with a small amount of effort. The HashingVectoriser doesn't require training so is particular suited to streaming data. I will take a look and see if I can add the online training and persistence support. In the meantime, Pull Requests are welcome :-)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants