Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Not list out the snippets #9

Closed
lampts opened this issue Mar 13, 2017 · 3 comments
Closed

Not list out the snippets #9

lampts opened this issue Mar 13, 2017 · 3 comments

Comments

@lampts
Copy link

lampts commented Mar 13, 2017

Thank for such a wonderful tool to do visualisation.

I got the error after running on small dataset as attached. There are 1 mention including apple term but the snippets on current is empty.

screen shot 2017-03-13 at 2 47 43 pm

What's wrong with that?
Regards,
Laam

@JasonKessler
Copy link
Owner

Hi Laam,

Thanks for reporting this error, and for the compliment!

I'm having trouble seeing the attached dataset. It would be great if you could upload the actual HTML file produced.

The mention counts displayed on the visualization were determined in Python via spaCy's (or whatever NLP engine you used's) tokenizer, while the excerpts listed are found through a Javascript regex, which may be less robust to special characters.

Jason

@lampts
Copy link
Author

lampts commented Mar 14, 2017

Yes, it seems the mismatched between mentions by spacy and javascript regex to get related snippets. I can share you another file which has the same issue:
(1) query exxon: there is no snippets related to the Past (x axis)
(2) terms: brexit, ipo are not clickable

The attached is for reproducible.

FINZINE-Visualization-trendy-vs-past-v2.html.zip

@JasonKessler
Copy link
Owner

Thanks a lot for including the full output. That really helped in debugging this issue. I found it, and there will be a fix in the next release (which is coming in a few minutes).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants