Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Word Frequency calculation in a phrase list might be wrong #32

Open
vertika-19 opened this issue Dec 10, 2018 · 0 comments
Open

Word Frequency calculation in a phrase list might be wrong #32

vertika-19 opened this issue Dec 10, 2018 · 0 comments

Comments

@vertika-19
Copy link

You are calculating frequency distribution of words from Phrase list.
Phrase list is a set, so a word will be present just once in it. Using Counter(chain.from_iterable(phrase_list)) to find freq_distribution will only take care of the words in a set, which I think is wrong.
Candidate keywords which might occur many times in the text will be there just once in the phrase list.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant