New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature Request: Apriori algorithm (working code) #2872
Comments
Hi Thanks a lot for your enthousiasme. What's the relevant publication. Algorithms included in scikit-learn must be literature standards and come from publications with a high citation count. Cheers Gaël -------- Original message -------- From: dfrusdn notifications@github.com Date:19/02/2014 04:01 (GMT+01:00) To: scikit-learn/scikit-learn scikit-learn@noreply.github.com Subject: [scikit-learn] Feature Request: Apriori algorithm (working code)
(#2872) http://codereview.stackexchange.com/questions/38101/optimizing-apriori-algorithm-python-pandas This algorithm is useful for pattern mining. It does not have all the needed features, but would be a good point to start. — |
Apriori has no shortage of pedigree in data mining ( On 19 February 2014 16:57, Gael Varoquaux notifications@github.com wrote:
|
@jnothman Scikit-learns front page states it provides "Simple and efficient tools for data mining and data analysis" Associative rule learning is a good addition to the data mining component The problems with that algorithm would be the |
Ok I wasn't completely sure that we were talking of this one. Gael -------- Original message -------- From: jnothman notifications@github.com Date:19/02/2014 07:10 (GMT+01:00) To: scikit-learn/scikit-learn scikit-learn@noreply.github.com Cc: Gael Varoquaux gael.varoquaux@normalesup.org Subject: Re: [scikit-learn] Feature Request: Apriori algorithm (working code)
(#2872) On 19 February 2014 16:57, Gael Varoquaux notifications@github.com wrote:
|
A pandas dependency is not acceptable; I for one cannot read the code you posted. What will be the interface for this algorithm? I'm not convinced that it can fit the existing API unless it's presented as a kernel approximation algorithm. |
I think it only fits in the context of something like CBA which classifies documents by learning association rules that map feature groups to target labels. In that context, the learnt rule-set constitutes the model... but not all rules in the general apriori are utilised. |
#2662 is a generalization of this feature request, closing. |
I would like to contribute my Apriori algorithm found here:
http://codereview.stackexchange.com/questions/38101/optimizing-apriori-algorithm-python-pandas
This algorithm is useful for pattern mining.
It does not have all the needed features, but would be a good point to start.
The text was updated successfully, but these errors were encountered: