Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Part 3: Clustering #30

Open
stemlock opened this issue Nov 10, 2021 · 0 comments
Open

Part 3: Clustering #30

stemlock opened this issue Nov 10, 2021 · 0 comments

Comments

@stemlock
Copy link
Contributor

  1. General Note: May be useful to add some content on dimensionality reduction (e.g., PCA, SVD, LCA).
  2. K-Means Clustering: May want to touch upon the problem of determine the optimal number of clusters and some common heuristics used (e.g., elbow method, silhouette)
  3. Agglomerative Clustering: Change the n_clusters parameter in the model from 3 to 2.
  4. Agglomerative Clustering: In the visualization, we assign a label to the scatter plots, but never call plt.legend() to add the legend.
  5. Challenge DBSCAN: When comparing the inferred clusters to the set of labels in the blobs data, we do not call len(set(labels)) to compare with the inferred number of clusters as we do in the moons data.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant