Skip to content

Manu10744/Stream-Analytics

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Streaming Analytics for Apache Kafka and Apache Flink

Streaming Analytics Project for the Course "Advanced Analytics and Machine Learning"

  • Summer Term 2021 | Ludwig-Maximilians-Universität München

Collaborators:

  • Giacomo May
  • Manuel Neumayer

Dataset:

  • ~14.500 Tweets from the Twitter Streaming API
LRZ Cloud:

Apache Kafka and Flink are evaluated using VMs on the LRZ Cloud:


The purpose of this project is to analyze and compare two famous Streaming Platforms, Apache Kafka and Apache Flink, regarding metrics like Throughput, Latency, Processing Speed and Scalability in a both Non-Parallel and Parallel Streaming Scenario.

The results are documented in a conference paper.

Terminology

  • Throughput: Amount of MBs sent per unit time (e.g. second)
  • Latency: Amount of elapsed time between the point of sending a stream object and receiving it

Useful reads

About

Performance Analysis of Apache Kafka and Apache Flink Streaming using VMs on the LRZ Cloud.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published