Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

python dependencies are not downloaded along with the spark package #218

Open
skeller88 opened this issue Dec 27, 2019 · 2 comments
Open

Comments

@skeller88
Copy link

When I run a spark job with this library downloaded as a package, I get an error that tensorflow is not found. I would expect that downloading this library as a package would pull in the necessary python dependencies. If that's not the case, what's the recommended way to include the necessary python dependencies?

There is a lot of discussion on approaches to handling pyspark dependencies:

This question is a more general version of my other question re: dataproc

@Ben-Epstein
Copy link

Can you post your stacktrace? It's possible that the spark executors don't have the dependencies, not the master. Can you also post your environment setup?

@spark-water
Copy link

I understand your question is regarding general dependencies. In this particular example, if you install tensorflow, the error would go away. Sparkdl is unable to find tensorflow backend, hence the error.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants