Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integration with ML Flink #61

Open
rcabanasdepaz opened this issue Oct 20, 2016 · 5 comments
Open

Integration with ML Flink #61

rcabanasdepaz opened this issue Oct 20, 2016 · 5 comments

Comments

@thvasilo
Copy link

Hello Rafael, just ran into this issue :)

Could you give a few more details about your plans?

We are going to start development on a online learning library for Flink soon, so we are looking at our options for what to include in the library and we could also be looking to bring in some of the work that has been done as part of the AMIDST project.

@rcabanasdepaz
Copy link
Contributor Author

Hello Theodore,
this issue is in a very initial phase of development. Our idea is to make possible to use any of the latent variable models provided by AMIDST with FlinkML data structures (e.g., DataSet[LabeledVector]). This functionality will be used from scala. To best of our knowledge, FlinkML cannot be used yet from Java, or at least the whole functionality.

Yet, our toolbox is already integrated with (standard) Flink by means of the module flinklink. With that, you are able to learn and to do inference of PGMs in a cluster environment. More details are given in the documentation of the web:

http://www.amidsttoolbox.com/documentation/0-6-0/examples-060/flinklink-060/

All the about this issue will be publish here. Alternatively, you can also be aware of the news about the toolbox by twitter: https://twitter.com/AmidstToolbox

@thvasilo
Copy link

Cool, let me know if you need any help. If you think some of your work would make sense to be ported to FlinkML, we can talk about that as well. We still don't have a Naive Bayes model for example which I see is included here.

You are right that we don't support Java currently in FlinkML, unfortunately there are no plans to add it in the near future AFAIK.

I'll check out the rest of the toolbox, thanks for the info!

@rcabanasdepaz
Copy link
Contributor Author

The idea of porting some of the functionality in AMIDST to FlinkML sounds good. Do you have any documentation about how contributing to FlinkML?
Clearly it would be interesting porting the Naive Bayes, but also some other classifiers much more powerful.

@thvasilo
Copy link

Sure, our contribution guide is here, if somebody from your team is interested in porting AMIDST code to FlinkML, I'll be able to help them personally.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants