Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to get Input data for the Logistic Regression workload ? #183

Open
Aalnafessah opened this issue Nov 19, 2018 · 0 comments
Open

How to get Input data for the Logistic Regression workload ? #183

Aalnafessah opened this issue Nov 19, 2018 · 0 comments

Comments

@Aalnafessah
Copy link

Spark-Bench version (version number, tag, or git commit hash)

spark-bench-launch-2.1.1_0.2.2-RELEASE

Details of your cluster setup (Spark version, Standalone/Yarn/Local/Etc)

Spark.2.2.1

Scala version on your cluster

Scala version 2.11.8
Spark Cluster: Spark Standalone ( 1 master and 2 slaves).

Description of your problem and any other relevant info

I am working on analyzing Spark performance using as many workloads that I can use. Till this moment, I am able to use the KMeans (data generation and workload). I want to use the Logistic Regression workload but I could not because there is No data generation for Logistic Regression.
I hav tried to run the "legacy version" of spark bench, but i got some issues because my SPark version is 2.2.1 .
IS there any way to get or generate the input data to the Logistic Regression workload?

Many thanks in advance for your help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant