Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

how to run Javascript tag before extracting data from each page #67

Open
hamedf62 opened this issue Oct 12, 2019 · 2 comments
Open

how to run Javascript tag before extracting data from each page #67

hamedf62 opened this issue Oct 12, 2019 · 2 comments

Comments

@hamedf62
Copy link

hi and good day

as i need to extract comments from a page, as long texts are hided and need to push "read more" link at first, it done by javascript code.

so i need to run a scripts in each page before extracting data.

kindly advice how its possible...

warm regards

@omar-elmaria
Copy link

I am also facing the same issue. Mr. developer, your reply would be highly appreciated :))

@pixgarden
Copy link

BY default, I don't think rcrawler is the perfect tool to do that, I would recommend using selenium or puppeteer
both can be used via an R package too.

It might be possible to get the data by looking at which URL is being called (inside the network panel of your browser) when you click on loadmore. It depends on how the website is built.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants