Skip to content
#

data-mining

Here are 5,115 public repositories matching this topic...

This project aims to compare traditional Machine Learning methods for tabular data classification, such as Ensemble methods, Decision Trees, and Naive Bayes, with NLP classification methods like Multinomial Naive Bayes, RNNs, and Transformers. We are utilizing survey data from the CDC via the Behavioral Risk Factor Surveillance System (BRFSS)

  • Updated May 12, 2024
  • Jupyter Notebook
Awesome-FL
desbordante-core

Desbordante is a high-performance data profiler that is capable of discovering many different patterns in data using various algorithms. It also allows to run data cleaning scenarios using these algorithms. Desbordante has a console version and an easy-to-use web application.

  • Updated May 12, 2024
  • C++

Improve this page

Add a description, image, and links to the data-mining topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-mining topic, visit your repo's landing page and select "manage topics."

Learn more