Skip to content

Latest commit

 

History

History
18 lines (12 loc) · 611 Bytes

goal_of_book.md

File metadata and controls

18 lines (12 loc) · 611 Bytes

Goal of this book: Data Algorithms with Spark

  1. Keep it SIMPLE!

  2. Goal of this book: enable writing efficient & simpler PySpark code for data algorithms using Spark

  3. A lot of working PySpark code is provided so that the reader can understand how to use basic transformations on using RDDs and DataFrames

  4. As much as possible, I have avoided writing complex code and functions: keep it simple so that you can debug easily and your co-workers can understand them.

  5. CUT-and-PASTE: you may take portions of the code and tailor it to your needs