Skip to content

sbordya/spark-arff-source

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

spark-arff-source

Instructions how to run this project (run this from the root of the repository):

  1. Check the default configuration paths in src/main/resources/application.conf and overwrite if necessary:
input-path = "./input/*.arff"           // input path for arff files
output-path = "./output"                // output path for parquet files
  1. Prepare a jar with dependencies (you'll need sbt for this):
sbt assembly
  1. Run the job to run the ingestion
java -cp target/scala-2.12/spark-arff-source-assembly-0.1.jar handson.Ingestion

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages