You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi! I am trying to set up an sparklyr job in EMR. I am using YARN Docker-mode (YARN_CONTAINER_RUNTIME_TYPE=docker) and deploying the job with spark-submit:
When running the execution is stuck with this messages:
23/04/03 12:22:01 INFO sparklyr: Session (55821) is starting under 127.0.0.1 port 8880
23/04/03 12:22:01 INFO sparklyr: Session (55821) found port 8880 is available
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) is waiting for sparklyr client to connect to port 8880
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) accepted connection
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) is waiting for sparklyr client to connect to port 8880
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) received command 0
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) found requested session matches current session
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) is creating backend and allocating system resources
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) is using port 8881 for backend channel
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) created the backend
23/04/03 12:22:01 INFO sparklyr: Gateway (55821) is waiting for R process to end
23/04/03 12:22:02 INFO HiveConf: Found configuration file null
23/04/03 12:22:02 INFO SparkContext: Running Spark version 3.3.2
23/04/03 12:22:02 INFO ResourceUtils: ==============================================================
23/04/03 12:22:02 INFO ResourceUtils: No custom resources configured for spark.driver.
23/04/03 12:22:02 INFO ResourceUtils: ==============================================================
23/04/03 12:22:02 INFO SparkContext: Submitted application: sparklyr
23/04/03 12:22:02 INFO ResourceProfile: Default ResourceProfile created, executor resources: Map(cores -> name: cores, amount: 1, script: , vendor: , memory -> name: memory, amount: 1024, script: , vendor: , offHeap -> name: offHeap, amount: 0, script: , vendor: ), task resources: Map(cpus -> name: cpus, amount: 1.0)
23/04/03 12:22:02 INFO ResourceProfile: Limiting resource is cpus at 1 tasks per executor
23/04/03 12:22:02 INFO ResourceProfileManager: Added ResourceProfile id: 0
23/04/03 12:22:02 INFO SecurityManager: Changing view acls to: hadoop,root
23/04/03 12:22:02 INFO SecurityManager: Changing modify acls to: hadoop,root
23/04/03 12:22:02 INFO SecurityManager: Changing view acls groups to:
23/04/03 12:22:02 INFO SecurityManager: Changing modify acls groups to:
23/04/03 12:22:02 INFO SecurityManager: SecurityManager: authentication disabled; ui acls disabled; users with view permissions: Set(hadoop, root); groups with view permissions: Set(); users with modify permissions: Set(hadoop, root); groups with modify permissions: Set()
23/04/03 12:22:02 INFO Utils: Successfully started service 'sparkDriver' on port 40419.
23/04/03 12:22:02 INFO SparkEnv: Registering MapOutputTracker
23/04/03 12:22:02 INFO SparkEnv: Registering BlockManagerMaster
23/04/03 12:22:02 INFO BlockManagerMasterEndpoint: Using org.apache.spark.storage.DefaultTopologyMapper for getting topology information
23/04/03 12:22:02 INFO BlockManagerMasterEndpoint: BlockManagerMasterEndpoint up
23/04/03 12:22:03 INFO SparkEnv: Registering BlockManagerMasterHeartbeat
23/04/03 12:22:03 INFO DiskBlockManager: Created local directory at /mnt/yarn/usercache/root/appcache/application_1680508493561_0044/blockmgr-49a3dae7-e16e-4332-8e4f-e92bfbed66dd
23/04/03 12:22:03 INFO DiskBlockManager: Created local directory at /mnt1/yarn/usercache/root/appcache/application_1680508493561_0044/blockmgr-22600d14-035e-4ce8-aaaf-d05919f739ef
23/04/03 12:22:03 INFO MemoryStore: MemoryStore started with capacity 434.4 MiB
23/04/03 12:22:03 INFO SparkEnv: Registering OutputCommitCoordinator
23/04/03 12:22:03 INFO Utils: Successfully started service 'SparkUI' on port 4040.
23/04/03 12:22:03 INFO SparkContext: Added JAR file:/usr/local/lib/R/site-library/sparklyr/java/sparklyr-master-2.12.jar at spark://ip-10-5-78-242.eu-west-1.compute.internal:40419/jars/sparklyr-master-2.12.jar with timestamp 1680524522444
23/04/03 12:22:03 INFO DefaultNoHARMFailoverProxyProvider: Connecting to ResourceManager at /0.0.0.0:8032
23/04/03 12:22:03 INFO DefaultNoHARMFailoverProxyProvider: Connecting to ResourceManager at /0.0.0.0:8032
23/04/03 12:22:04 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:05 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:06 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:07 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:08 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:09 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:10 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 6 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:11 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:12 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:13 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:14 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 0 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:15 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 1 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:16 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 2 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:17 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 3 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:18 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 4 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:19 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 5 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:20 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 6 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:21 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 7 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:22 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 8 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:23 INFO Client: Retrying connect to server: 0.0.0.0/0.0.0.0:8032. Already tried 9 time(s); retry policy is RetryUpToMaximumCountWithFixedSleep(maxRetries=10, sleepTime=1000 MILLISECONDS)
23/04/03 12:22:23 INFO RetryInvocationHandler: java.net.ConnectException: Your endpoint configuration is wrong; For more details see: http://wiki.apache.org/hadoop/UnsetHostnameOrPort, while invoking ApplicationClientProtocolPBClientImpl.getNewApplication over null after 1 failover attempts. Trying to failover after sleeping for 22758ms.
The same deployment works using SparkR:
library(SparkR)
sparkR.session(appName="app-name")
sqlContext<- sparkRSQL.init(spark.sparkContext);
df<- sql("SELECT 1 AS A")
print(df)
This are the environment variables in the Docker container deployed by YARN:
Hi! I am trying to set up an
sparklyr
job in EMR. I am using YARN Docker-mode (YARN_CONTAINER_RUNTIME_TYPE=docker
) and deploying the job withspark-submit
:When running the execution is stuck with this messages:
The same deployment works using SparkR:
This are the environment variables in the Docker container deployed by YARN:
And the Spark config loaded by SparkR:
How can I make
sparklyr
run on this set-up?The text was updated successfully, but these errors were encountered: