Broken Code in Section 5.3.1 #62

kaybenleroll · 2019-04-21T16:22:40Z

The code scraping in section 5.3.1 no longer works as most of the code in the package tm.plugin.webmining is not up-to-date.

I tried switching the GoogleFinanceSource to YahooFinanceSource but that did not work either.

I am sure there are alternatives, but I figured it is best reported here first.

The text was updated successfully, but these errors were encountered:

juliasilge · 2019-05-04T16:55:06Z

Thank you very much for this report! 🙌 I want to acknowledge it and let you know we are aware and looking for a replacement data source to use in the book.

Just to record it here, ideally we would want to find something that:

allows us to demonstrate how to tidy() a document-term matrix
is an appropriate use case for the Loughran and McDonald sentiment lexicon

This may be too high an ask, though, and we need to break these apart and integrate these two bits of information separately. @dgrtwo

kaybenleroll · 2019-05-09T13:52:51Z

Not at all Julia, happy to help! Let me know if you need any help with this - happy to help out any way I can. That book is really useful and has helped me a lot, so happy to contribute back. :)

nattalides · 2020-01-25T15:04:26Z

Same issue - after a bit of search it looks like the service from Yahoo and Google has been deprecated so probably best remove that bit.

@dgrtwo @juliasilge Do you think it would be better/easier to have a stored Corpus/VCorpus/WebCorpus financial article dataset as part of {tidytext} removing dependencies from other packages. This will enable to demonstrate both of the bullet points you raised.

DesmondChoy · 2020-06-03T08:54:19Z

Thank you very much for this report! 🙌 I want to acknowledge it and let you know we are aware and looking for a replacement data source to use in the book.

Just to record it here, ideally we would want to find something that:

allows us to demonstrate how to tidy() a document-term matrix

is an appropriate use case for the Loughran and McDonald sentiment lexicon

This may be too high an ask, though, and we need to break these apart and integrate these two bits of information separately. @dgrtwo

How about company's earnings call transcripts?
I stumbled upon a site that seems to provide these for free: https://news.alphastreet.com/ (Note: I'm not affiliated with them in any way)

juliasilge mentioned this issue Apr 15, 2020

Chapter 1 missing introduction on getting self-generated texts into R #75

Open

juliasilge mentioned this issue Oct 27, 2021

tm.plugin.webminin is no longer working for Chapter 5.3.1 #99

Closed

juliasilge mentioned this issue Jul 13, 2023

Update 05-document-term-matrices.Rmd #110

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Broken Code in Section 5.3.1 #62

Broken Code in Section 5.3.1 #62

kaybenleroll commented Apr 21, 2019

juliasilge commented May 4, 2019 •

edited

kaybenleroll commented May 9, 2019

nattalides commented Jan 25, 2020 •

edited

DesmondChoy commented Jun 3, 2020

Broken Code in Section 5.3.1 #62

Broken Code in Section 5.3.1 #62

Comments

kaybenleroll commented Apr 21, 2019

juliasilge commented May 4, 2019 • edited

kaybenleroll commented May 9, 2019

nattalides commented Jan 25, 2020 • edited

DesmondChoy commented Jun 3, 2020

juliasilge commented May 4, 2019 •

edited

nattalides commented Jan 25, 2020 •

edited