go-kafka-example

Prerequisites

docker
zookeeper
kafka
aws copilot

This is a monorepo setup with support for CI/CD. The applications in the monorepo are based on go-template

The producer publishes messages to 2 kafka topics - issued-token, new-user-created. The consumer consumes these messages. This is a working setup for message broking in golang using kafka in a micro-service environment.

The consumer also exposes an API that makes an inter-service API call to the producer to fulfil the request.

This application is deployed on AWS ECS using AWS Copilot. They are deployed as 2 services in a cluster - hence are able to use the the service discover DNS for interservice communication.

Run the application

Start zookeeper zookeeper-server-start /usr/local/etc/kafka/zookeeper.properties
Start the kafka service kafka-server-start /usr/local/etc/kafka/server.properties
Start the consumer cd consumer && go run cmd/server/main.go
Start the producer cd ../producer && go run cmd/server/main.go

Optional

Use a kafka visualizer tool. I use this one

java -jar ~/wednesday/kafka-visualizer/rest/target/rest-1.0-SNAPSHOT.jar --zookeeper=localhost:2181 --kafka=localhost:9092 --env=UAT

Kafka cli commands

describe all consumers, topics and partition detiails

kafka-consumer-groups --bootstrap-server localhost:9092 --all-groups --describe

Config

The power of kafka is seen at scale which is possible because we can have multiple partitions. Use the config files found in docs/config to leverage it to some extent. I have 8 partitions set up.

Here is a small write up to explain how kafka works

in kafka the main queue is broken down into many subqueues. Each of these subqueues is called a partition.
A server that holds one or more partition is called a broker. Each item in the partition is called a record.
The field that decides which partition the record will be stored in is called a key. #
If no key is specified then a random partition is assigned.#
A group of partitions handling the same kind of data is called a topic. #
An offset is a sequential number provided to each record.
A record in a topic is identified by a partition number and an offset. #
Having one consumer per partition guarantees ordering per game. Consumers can be scaled easily and without a lot of performance or cost impact.
This is because kafka only needs to maintain the latest offset read by each consumer. here we're letting kafka handle committing
Typically consumers read one record at a time, and pickup where they left off after a restart. here we're manually handling committing
It's quite common to have consumers read all the records from the beginning on startup
Consumers in a consumer group do not share partitions. Each consumer would read different records from the other consumers.
Multiple consumer groups are useful when you have different applications reading the same content
Kafka has retention policies. For example after 24 hours the kafka queue will be cleaned.
Kafka can also store all records on a persistent storage. This makes it fault-tolerant and durable. So if the broker goes down it can recover when it comes back up.
Replication Factor: Kafka replicates partitions so when a broker goes down, a backup parition takes over and processing can resume. This is configured using the replication factor. if you have 3, it means you have 3 copies of a partition. 1 leader and 2 backups. This means we can tolerate upto 2 brokers going down at the same time.

Accessing the APIs

The producer and the consumer service come with out of the box support for GraphQL playground, however if you would like to generate the postman collections you can use this grapqhl-testkit utility written by the folks @wednesday-solutions

Inter Service APIs

If service discovery endpoints are configured for the consumer and producer services, it allows for inter-service requests. Checkout this example for inter-service communication between the consumer and the producer services, the consumer exposes a route (/ping) which on GET requests will make a GET request to the producer service’s (/producer-svc/ping-what) route. The response from the producer is interpreted by the consumer and the response is served.

Provide the producer’s service discovery endpoint as an environment variable(PRODUCER_SVC_ENDPOINT), which will be used by the consumer. When developing locally, the environment variable is set to match the local endpoint of the producer server.

Built in retry mechanism

If there is an issue while processing the incoming message we write the message to a side-topic. The consumer of the side topic retries messages a fixed number of times with an exponential backoff interval. If the message isn't processed after that, it's sent to a dead-letter-queue.

In order to prevent message bloat and for each of the individual consumers to handle updation of count the Scheduler will update the count in redis based on hash of topic, message. We don't use key cause it's possible that the key was randomly generated and you keep getting a unique hash. This should be unique. If the same message is being sent multiple times there is something wrong.

When retrying the first time we will schedule a job to execute in expBackOff(retryCount + 1) to drop that key from the cache.

Name		Name	Last commit message	Last commit date
Latest commit History 90 Commits
.github		.github
consumer		consumer
docs/config		docs/config
libs		libs
producer		producer
utils		utils
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
README.md		README.md
start-servers.sh		start-servers.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.github

.github

consumer

consumer

docs/config

docs/config

libs

libs

producer

producer

utils

utils

.gitignore

.gitignore

.pre-commit-config.yaml

.pre-commit-config.yaml

README.md

README.md

start-servers.sh

start-servers.sh

Repository files navigation

go-kafka-example

Prerequisites

Run the application

Optional

Kafka cli commands

Config

Accessing the APIs

Inter Service APIs

Built in retry mechanism

About

Releases

Packages

Contributors 2

Languages

wednesday-solutions/go-kafka-example

Folders and files

Latest commit

History

Repository files navigation

go-kafka-example

Prerequisites

Run the application

Optional

Kafka cli commands

Config

Accessing the APIs

Inter Service APIs

Built in retry mechanism

About

Topics

Resources

Stars

Watchers

Forks

Languages