Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Literature review regarding word embeddings #1

Open
alexhebing opened this issue Mar 5, 2020 · 4 comments
Open

Literature review regarding word embeddings #1

alexhebing opened this issue Mar 5, 2020 · 4 comments
Assignees

Comments

@alexhebing
Copy link
Contributor

alexhebing commented Mar 5, 2020

The two themes in this project will be:

  1. using word embeddings in sentiment analysis. As a starting point, see:

Note that the above two articles were part of the application because they are cited a lot, and were not read extensively (yet)

  1. the potential of word embeddings in working multi-lingually. For example:

Task
Explore the literature with regard to this themes and cook up a direction to take the project in.

Please note I created an Everhour task to track time on (5% of the total project time - I'll try to come up with a number of hours soon).

@alexhebing
Copy link
Contributor Author

alexhebing commented Apr 14, 2020

@BeritJanssen FYI: first set of data available in SurfDrive: '.../DigitalHumanitiesLabatUU/Reader Responses to translated literature/data/initial set (Harry Potter and Dinner)/.

@BeritJanssen
Copy link
Member

@alexhebing in the Dinner scrapings, I find this error to occur a lot in the review csv's: 'review_1366787834;https://www.goodreads.com/review/show/1366787834;20510036-the-dinner;English;Aug 17, 2015;Lauren Davis;en;liked it;3;The narrator's voice was wonderfully written -- highly unreliable and with snark to spare. I would have given the book more stars were it not for the implausible set-up. By this I mean: I find it hard to believe that anyone, let alone a highly public politician, would meet at a highly public and terribly posh restaurant to discuss the horrific murder he has just discovered his child, along with that child's cousin, committed. Because the premise struck me as ridiculous, it tainted my view of the The narrator's voice was wonderfully written -- highly unreliable and with snark to spare. I would have given the book more stars were it not for the implausible set-up. By this I mean: I find it hard to believe that anyone, let alone a highly public politician, would meet at a highly public and terribly posh restaurant to discuss the horrific murder he has just discovered his child, along with that child's cousin, committed. Because the premise struck me as ridiculous, it tainted my view of the rest of the book. ...more'

I.e., if you look closely, the text repeats.

@alexhebing
Copy link
Contributor Author

@BeritJanssen : great catch, many thanks! Moved to its own issue and fixing.

@alexhebing
Copy link
Contributor Author

alexhebing commented Apr 16, 2020

I have now done a new scraping and updated the files in SurfDrive (it actually was a lot faster than earlier this week, probably because it is so early in the morning and it's no so busy on the information highway).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants