You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have searched the existing issues and did not find a match.
Who can help?
No response
What are you working on?
I am using the spark-nlp for NER detection on Azure databricks cluster. The cluster is made of 5 nodes. But when running the job it is not scaling up to use the full cluster and uses only a single node. It seems that the NER pipeline does not parallelize and only runs on a single node.
Current Behavior
The NER pipeline uses only one node of the available 5 nodes.
Expected Behavior
The expected behavior is to fully run on all the worker nodes.
I recommend watching this Webinar, scaling Apache Spark is independent from Spark NLP. You should follow the general "tuning and sizing your cluster" advice in order to utilize all your executors.
I have watched the webinar and came out with great insights. The issue here is not the speed optimization but why sparknlp NER pipeline is not fully utilizing the cluster and using only one worker? Once this issue is solved, I will work on optimizing the spark application.
Is there an existing issue for this?
Who can help?
No response
What are you working on?
I am using the spark-nlp for NER detection on Azure databricks cluster. The cluster is made of 5 nodes. But when running the job it is not scaling up to use the full cluster and uses only a single node. It seems that the NER pipeline does not parallelize and only runs on a single node.
Current Behavior
The NER pipeline uses only one node of the available 5 nodes.
Expected Behavior
The expected behavior is to fully run on all the worker nodes.
Steps To Reproduce
Spark NLP version and Apache Spark
spark-nlp==5.1.4
spark==3.4.1
com.johnsnowlabs.nlp:spark-nlp_2.12:5.1.4
Working on Databricks
Type of Spark Application
Python Application
Java Version
8
Java Home Directory
/usr/lib/jvm/zulu8-ca-amd64/jre/
Setup and installation
Pypi
Operating System and Version
No response
Link to your project (if available)
No response
Additional Information
No response
The text was updated successfully, but these errors were encountered: