Skip to content
#

mllib

Here are 185 public repositories matching this topic...

In this tutorial, I explained SparkContext by using map and filter methods with Lambda functions in Python and created RDD from object and external files, transformations and actions on RDD and pair RDD, PySpark DataFrame from RDD and external files, used sql queries with DataFrames by using Spark SQL, used machine learning with PySpark MLlib.

  • Updated Jan 21, 2020
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the mllib topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the mllib topic, visit your repo's landing page and select "manage topics."

Learn more