Distributed counter

This is a distributed service, consisting of multiple isolated processes which can count the number of items, grouped by tenants that are delivered through an HTTP restful interface.

Resource	Description
`POST /items`	add new items
`GET /items/tenantID/count`	return number of items for given tenant

Setup

Build & run coordinator with 3 counters

$ make up

or specify different number of counters

$ make up COUNTERS=5

or if you want to debug

$ make dev

Run tests

$ make test

Show logs from containers

$ make log

Run system simulation

$ make simulate

You can copy example config file to .env and change values in it

$ cp .env.dist .env

# Http port on which coordinator server will listen
HTTP_PORT=8080

# Debugger port
DEBUG_PORT=40000

Design

System consists of

Coordinator which provides an RESTful API.
N number of counters which can be called only from coordinator itself.

Overview

The requirements are focused mostly on data consistency and handling child node failures.
I decided to choose one matching scenario described in the given article (http://book.mixu.net/distsys/abstractions.html).

AP design was not the case because of the consistency requirement.
CP cannot be fully achieved with only one callable main coordinator. Mechanism to choosing one of the counters as master after coordinator failure would be something worth consideration.
CA and Two-Phase commit protocol approach is something I choose for this system.

Disadvantages

Request is synchronous (blocking).
Possibility of deadlock between transactions.
It is not partition tolerant.

The above drawback may lead to system performance bottleneck too. It is sacrifice of efficiency. Using this approach it was also possible to achieve three of the ACID principles: atomicity, consistency and read-write isolation.

Flow

Add counter

When a new counter instance is added it sends request to coordinator to obtain data from other counters.
If a counter goes down and recover it will get data the same way. This ensures data consistency.

Add items

Coordinator sends unique message to all counters.
Counters must make a decision if they can save items.
If one or more counters refuse all will receive request to forget about previous message.

Get count

To get count coordinator sends request to one random counter.
Docker handles requests balancing in that case. It will not call dead nodes.

Health checks

Coordinator performs counters health checks every 10 seconds. Todo: make health check interval configurable.
If a counter not respond or respond with an error it is marked as dead and it is not query-able.
After 4 more unsuccessful responses coordinator removes that counter. Todo: make number of recovery tries configurable.
Docker performs coordinator health checks every 30 seconds.

Possible improvements

RPC or sockets could be used instead of HTTP for communication between coordinator and counters.
Different distributed algorithm like paxos or raft could be implemented for our system to be partition tolerant.

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.img		.img
coordinator		coordinator
counter		counter
.env.dist		.env.dist
.gitignore		.gitignore
.travis.yml		.travis.yml
Dockerfile		Dockerfile
Dockerfile.dev		Dockerfile.dev
Makefile		Makefile
README.md		README.md
docker-compose.yml		docker-compose.yml

agolebiowska/distributed-counter

Folders and files

Latest commit

History

Repository files navigation

Distributed counter

Setup

Design

System consists of

Overview

Flow

Add counter

Add items

Get count

Health checks

Possible improvements

About

Resources

Stars

Watchers

Forks

Languages