Skip to content

leedongwei/data-scooping

Repository files navigation

data-scooping: flyweight data mining


... but data mining is so fly


Semi-supervised Learning

Algorithms

  • Naive-Bayes
  • Rocchio
  • LELC (PU Learning by Extracting Likely positive and negative micro-Clusters)
    • Li, X., Philip, S. Y., Liu, B., & Ng, S. K. (2009, May). Positive Unlabeled Learning for Data Stream Classification. In SDM (Vol. 9, pp. 257-268).
  • Rocchio-SVM & Rocchio-Clustering-SVM
    • Li, X., & Liu, B. (2003, August). Learning to classify texts using positive and unlabeled data. In IJCAI (Vol. 3, pp. 587-592).
  • Similarity Weight SVM
    • Xiao, Y., Liu, B., Yin, J., Cao, L., Zhang, C., & Hao, Z. (2011, July). Similarity-based approach for positive and unlabeled learning. In IJCAI Proceedings-International Joint Conference on Artificial Intelligence (Vol. 22, No. 1, p. 1577).
  • Spy-EM
    • Liu, B., Lee, W. S., Yu, P. S., & Li, X. (2002, July). Partially supervised classification of text documents. In ICML (Vol. 2, pp. 387-394).

Data Sets