Skip to content

A datastack playground; includes Spark, Kafka, Airbyte, etc.

License

Notifications You must be signed in to change notification settings

makism/datastack-playground

Repository files navigation

datastack-playground

General

A playgound for Apache Spark with Minio support and Delta Lake integration. Additional tools include Airbyte and Lightdash.

Getting Started

Building

Run the following command to build the Docker images:

./build.sh

Running

Now, bring up the cluster with:

docker-compose up

Extras

Airbyte

Clone and run the Airbyte repo locally; follow the instructions at https://docs.airbyte.com/deploying-airbyte/local-deployment.

CLI

You may need to install the Airbyte CLI. See https://docs.airbyte.com/understanding-airbyte/airbyte-cli and https://github.com/airbytehq/airbyte/blob/master/octavia-cli/README.md

Lightdash

Clone and run the Lightdash repo locally; follow the instructions at https://docs.lightdash.com/getting-started/quickstart.

About

A datastack playground; includes Spark, Kafka, Airbyte, etc.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published