Skip to content

karlosos/data_mining

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Data Mining

For development

  1. Create virtual environment with virtualenv .venv.
  2. Activate venv with .venv\Scripts\activate.bat.
  3. Launch jupyter lab with jupyter-lab.

Lab 1

  • basic data selection
  • visualization
  • pandas
  • iris, zoo and autos datasets

Lab 2

  • k-nearest neighbors
  • kd-tree and ball tree
  • generating n-dimensional data linearly separable
  • generating checkerboard

Lab 3

  • k-means
  • fixing permutations - clusterization
  • jaccard
  • PCA visualization
  • Gaussian Mixture
  • Agglomerative Clustering
  • zoo dataset
  • image compression with clusterization

Lab 4

  • markov model
    • words as states
    • letters as states
  • prime ministers exposes dataset

Lab 5