kume

kume is a visualization of image segmentation and color quantization using k-means clustering

Background

The k-means clustering unsupervised algorithm is used to partition a data set into k groups. Each data point is assigned to a particular cluster based on mean positions.

The algorithm randomly initializes k data points as the initial means with the standard Forgy method.

First, each pixel is assigned to the cluster with the smallest color distance to its centroid. Then, the centroids of each cluster are recomputed. Since the centroid is the mean data point of the cluster's pixels, it is the cluster's average color, and one of the image's prominent colors. The algorithm repeatedly finds better centroids by reassigning and repartitioning the data set until the assignments no longer change.

Image segmentation is the process of partitioning an image into sets of pixel segments, and is often applied in medical imaging, surveillance, and recognition tasks like face recognition. K-means clustering is an iterative algorithm that can be applied to accomplish image segmentation by color quantization.

Color quantization is the method of reducing the number of distinct colors used in an image to only the dominant colors. Each pixel in the image is assigned and reassigned to k clusters until the distance between the pixel and its cluster's centroid cannot be further minimized.

The CIE L*a*b* color space describes all visible colors to humans. The L* component describes lightness, the a* channel for red-green, and b* for blue-yellow. Changes in CIELAB channels are intended to mimic the responses of the human eye. CIELAB is perceptually uniform because uniform changes in L*a*b* components correspond to uniform changes in color as perceived by humans & matched by Euclidean distance.

Usage

Choose a value of k and a sample image. Image pixels are plotted to an HTML5 Canvas. The visual 2D representation shows the colors in the sRGB gamut (x-axis a*, y-axis b*) and has some pixel overlap since it omits a separate component for L*.

k colors are initialized as centroids. Each centroid is represented by its mean color for the cluster with an SVG & tooltip. Hover over a centroid to see information about the color.

Clusters and centroids are repeatedly assigned and updated until convergence. D3 handles animating each iteration.

When there are no more new centroid assignments, the cluster data is used to draw a quantized version of the sample image using only k prominent colors.

Technologies

D3v4 for plotting & color space conversion

Planned Features

Handling uploads & scaling
Visualizing & interacting with the Voronoi cells created by the algorithm
Interpolating pixels from the image - Inspiration
Chromaticity diagram with a fixed illuminant for a truer representation
3D Plotting color space components - Inspiration
Better-performing initialization methods for more consistent results
- e.g. k-means++

References

K-means 1, 2, 3, 4, 5
Color spaces 1, 2, 3, 4
Image segmentation 1
Delta E*ab CIE76 color distance
Canvas pixel manipulation 1

Name		Name	Last commit message	Last commit date
Latest commit History 44 Commits
public/assets		public/assets
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
package.json		package.json
webpack.config.js		webpack.config.js

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

public/assets

public/assets

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

index.html

index.html

package.json

package.json

webpack.config.js

webpack.config.js

Repository files navigation

kume

Background

Usage

Technologies

Planned Features

References

About

Releases

Packages

Languages

License

agarun/kume

Folders and files

Latest commit

History

Repository files navigation

Background

Usage

Technologies

Planned Features

References

About

Topics

Resources

License

Stars

Watchers

Forks

Languages