Cache directory with pre processed models is hard-coded #207

cabral1888 · 2019-09-30T12:51:04Z

Hello. I am using the sparkdl in a Spark cluster with YARN integrated with Docker. I am having problems related to user home directory when the codes fetch the preprocessed models (like InceptronV3, XCeptron, etc) and stores it into my HOME_DIR. For advanced reasons, YARN doesn't create the user HOME_DIR, and when the library tries to write into this directory, it fails. What I need to do is to change the default behavior to store models in any directory as I want.

Would it be possible to change code behavior to define the cache directory at execution time? For instance, when I instantiate the following class:

featurizer = DeepImageFeaturizer(inputCol="image", outputCol="features", modelName="InceptionV3", cacheDir=<SOME_PATH>)

Obs.: The file with the HOME_DIR hard-coded is: src/main/scala/com/databricks/sparkdl/ModelFetcher.scala on line 40

Best regards!

cabral1888 · 2019-10-02T17:32:55Z

Someone?

cabral1888 changed the title ~~Cache directory with pre processed models hard-coded~~ Cache directory with pre processed models is hard-coded Sep 30, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cache directory with pre processed models is hard-coded #207

Cache directory with pre processed models is hard-coded #207

cabral1888 commented Sep 30, 2019 •

edited

cabral1888 commented Oct 2, 2019

Cache directory with pre processed models is hard-coded #207

Cache directory with pre processed models is hard-coded #207

Comments

cabral1888 commented Sep 30, 2019 • edited

cabral1888 commented Oct 2, 2019

cabral1888 commented Sep 30, 2019 •

edited