Processing Tweets with Kafka Streams in Scala

The example application consists of two services written in Scala, an ingestion service (code) and an aggregation service (code). The ingestion service subscribes to the Twitter Streaming API and receives fresh tweets filtered by a list of terms. Any raw tweet is sent to the Kafka topic 'tweets' in JSON. The aggregation service retrieves raw tweets, parses tweets, and aggregates word counts in tumbling time windows, see the code here. Kafka Streams uses an embedded RocksDB for maintaining a local state. Any change to the aggregate will be propagated to the topic 'aggregate'.

Both services share the same SBT project, and will be located in the same fat jar including all dependencies. Which allows us to easily share code in this small example project. Both applications access the application.conf in runtime via the Settings object, see code. I wrote a small build script to compile the services, building the Docker images and running the containers.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
docker		docker
project		project
twitterstream/src/main/scala		twitterstream/src/main/scala
.gitignore		.gitignore
LICENSE.md		LICENSE.md
README.md		README.md
application.conf.template		application.conf.template
build-run-containers.sh		build-run-containers.sh
build.sbt		build.sbt
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docker

docker

project

project

twitterstream/src/main/scala

twitterstream/src/main/scala

.gitignore

.gitignore

LICENSE.md

LICENSE.md

README.md

README.md

application.conf.template

application.conf.template

build-run-containers.sh

build-run-containers.sh

build.sbt

build.sbt

docker-compose.yml

docker-compose.yml

Repository files navigation

Processing Tweets with Kafka Streams in Scala

Twitter Hosebird Client: References

Kafka Streams: References

Official Documentation

Other Code Examples

Articles

About

Releases

Packages

Languages

License

jpzk/twitterstream

Folders and files

Latest commit

History

Repository files navigation

Processing Tweets with Kafka Streams in Scala

Twitter Hosebird Client: References

Kafka Streams: References

Official Documentation

Other Code Examples

Articles

About

Topics

Resources

License

Stars

Watchers

Forks

Languages