Skip to content

hayatoy/dataflow-tutorial

Repository files navigation

Cloud Dataflow (Python!) Tutorial for Beginners

How to use

  • Create Billing account on Google Cloud Platform
  • Enable Dataflow API
  • Open Datalab

Recommended Datalab settings;
datalab create dftutorial --disk-size-gb 10 --no-create-repository --no-backups

If you want to use Jupyter Notebook..

  • Install Dataflow Python SDK
    pip install google-cloud-dataflow
  • Set up default credential. just run the following command.
    gcloud auth application-default login