Trying out Apache Iceberg with Apache Flink using Docker Compose

Use the docker-compose.yml file to create a MariaDB database and an Apache Flink Job and Task manager to work with.

Make sure to add your AWS credentials to the docker-compose.yml file first, so that it will be able to write to s3.

docker compose up -d

Once the containers are running, submit the job to Flink using:

docker exec -it jobmanager /opt/flink/bin/sql-client.sh embedded -f /opt/flink/job.sql

If you open your browser to http://localhost:8081 you'll see the Flink UI with your job running, saving the data from the database to s3 using the Iceberg format

The data in s3 will be in a folder named after the database, in Parquet format.

You can use the AWS CLI to verfiy the data is there:

aws s3 ls s3://my-test-bucket/iceberg/my_database/my_products/data/

If you happen to use DuckDB, you can query the resulting parquet file on s3 to verify the data:

Assuming duckdb is installed, provide it with AWS details:

SET s3_region='us-east-1';
SET s3_access_key_id='AKAIXXXXXXXXXXXX';
SET s3_secret_access_key='XXXXXXXXXXXX';

Then you can query one or more parquet files using:

SELECT * FROM 's3://my-test-bucket/iceberg/my_database/my_products/data/00000-0-b3a04103-6ef1-49fa-9c7b-62194183c3fd-00001.parquet';

You should see an output just like the following table

┌───────┬───────────┬───────────────┐
│  id   │   name    │     price     │
│ int32 │  varchar  │ decimal(10,2) │
├───────┼───────────┼───────────────┤
│     3 │ Product C │         39.99 │
│     2 │ Product B │         29.99 │
│     1 │ Product A │         19.99 │
└───────┴───────────┴───────────────┘

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
jars		jars
jobs		jobs
sql		sql
LICENSE		LICENSE
README.md		README.md
docker-compose.yml		docker-compose.yml
flink.png		flink.png
s3.png		s3.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jars

jars

jobs

jobs

sql

sql

LICENSE

LICENSE

README.md

README.md

docker-compose.yml

docker-compose.yml

flink.png

flink.png

s3.png

s3.png

Repository files navigation

Trying out Apache Iceberg with Apache Flink using Docker Compose

About

License

gordonmurray/apache_flink_and_iceberg

Folders and files

Latest commit

History

Repository files navigation

Trying out Apache Iceberg with Apache Flink using Docker Compose

About

Topics

Resources

License

Stars

Watchers

Forks