Skip to content

KristianEka/gas-turbine-emision-clustering

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

gas-turbine-emision-clustering

Implementing machine learning by comparing the accuracy of the clustering algorithm on turbine gas emissions

Datasets 💾

Algorithms 🤖

  • K-Medoids (PAM)
  • CLARA

Package 📦︎

  • readxl
  • Amelia
  • ggplot2
  • GGally
  • knitr
  • caret (confusion matrix)
  • openintro
  • dplyr
  • cluster (cluster analysis)
  • factoextra (cluster visualization)
  • clValid (cluster validation)

Conclusion 💻︎

image

  • PAM Connectivity is lower than CLARA, that means PAM is better than CLARA
  • PAM Dunn Index is bigger than CLARA, that means PAM is better than CLARA
  • PAM Silhouette is closer to one than CLARA, that means PAM is better than CLARA

However, in terms of speed, CLARA clustering is faster because it is designed to cluster with large amounts of data.

About

Implementing machine learning by comparing the accuracy of the clustering algorithm on turbine gas emissions

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published