Skip to content

soniclavier/bigdata-notebook

Repository files navigation

Hadoop and ML repository

A repository to hold all my Hadoop and Machine Learning related codes.

Visit my blog at : www.vishnuviswanath.com

Contents

  1. Flink Streaming
  2. Spark ML, Streaming, SQL and GraphX
  3. Kafka Streams
  4. StormKafka streaming application POC
  5. Flume custom source and config files
  6. Hadoop MapReduce old api joins,custom types etc
  7. Solutions for kaggle problems using numpy or graphlab