Skip to content

tum-db/tableaupublic

Repository files navigation

Tableau Public workbook workloads

This repository aims to allow extracting SQL workloads from Tableau Public.

Setup

The following assumes MacOS.

Install the Tableau Public app:
http://public.tableau.com/s/download

Install jq to parse the log files postgres to extract the CSV data:

brew install jq

Install python hyper API for conversion scripts:

pip install --user tableauhyperapi

Extracting Data and Queries

To extract the queries from a workbook, open it in the Tableau GUI and click through all tabs. The workbooks.sh automates this for some interesting public workbooks.

Then, use the convert.sh script to convert the Tableau data files to CSVs and dump the schema.

Afterwards, you can load the schema and the data:

-- create the Extract table that tableau uses
\i ahlyAfrica/excel_direct_42363_552682708300.sql
-- and load the csv data
\i ahlyAfrica/load.sql

Now, you can execute queries:

\i ahlyAfrica/queries.sql

Licenses

Original public_bi_benchmark by CWI Data Architectures Group. Tableau API by Tableau. Everything licensed user MIT.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published