You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I have 20 bag files adding up to 1 TB. All my bag files are stored in HDFS already. I have a DC/OS cluster running containers (mesosphere/spark:2.3.0-2.2.1-2-hadoop-2.7). I am trying to run ros_hadoop as a distributed service to extract data from the 20 bag files in an automated way.
Is there a way to generate the chunkIdx during runtime and pass it to newAPIHadoopFile?
If not, can the idx.bin file be stored in HDFS and presented as "hdfs://...idx.bin" to chunkIdx?
Thanks!
The text was updated successfully, but these errors were encountered:
msashokkumar
changed the title
Generate chunkIdx at rather than a file input
Generate chunkIdx at runtime rather than a file input
Jun 29, 2018
msashokkumar
changed the title
Generate chunkIdx at runtime rather than a file input
Generate chunkIdx at runtime rather than as a file input
Jun 29, 2018
Hi,
I have 20 bag files adding up to 1 TB. All my bag files are stored in HDFS already. I have a DC/OS cluster running containers (mesosphere/spark:2.3.0-2.2.1-2-hadoop-2.7). I am trying to run ros_hadoop as a distributed service to extract data from the 20 bag files in an automated way.
Is there a way to generate the chunkIdx during runtime and pass it to newAPIHadoopFile?
If not, can the idx.bin file be stored in HDFS and presented as "hdfs://...idx.bin" to chunkIdx?
Thanks!
The text was updated successfully, but these errors were encountered: