Skip to content

isaacmg/twitterStream

Repository files navigation

Custom Twitter Stream and Analysis with Apache Flink

Build Status

This repository is meant to show how to implement Twitter streaming with Apache Flink and output tweets to KafkaTableSink. By default Apache Flink gets all Tweets in real time, however by implementing the EndPointInitializer you can get just the relevant tweets. This is what Filter.java does. To use Filter.java for instance, you would do something like the following

List relevantWords<String> = new ArrayList<Strings>;
relevantWords.add("politics");
List<Location> relevantLocations = new ArrayList<Location>;
List<Long> userIDS = new ArrayList<Long>;
Filter f = new Filter(relevantWords,relevantLocations,relevantPeople);
// Alternatively you could also pass null
Filter f = new Filter(relevantWords,null,null);

Feel free to reach out if you have any questions/issues.

Article on using Flink to build a dashboard

Additional Information

About

Streaming specific Tweets with Apache Flink

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published