Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Rcrawler is only saving internal HTML pages #61

Open
Mlabrams1 opened this issue Jun 2, 2019 · 0 comments
Open

Rcrawler is only saving internal HTML pages #61

Mlabrams1 opened this issue Jun 2, 2019 · 0 comments

Comments

@Mlabrams1
Copy link

Mlabrams1 commented Jun 2, 2019

When utilizing the network analysis functionality, only the internal HTML pages identified in the Index file are stored as copies. This should store a copy of all HTML pages crawled, including those in NetwIndex, correct?

Rcrawler(Website = "https://github.com/salimk/Rcrawler/issues/new", MaxDepth = 2, no_cores = 4, no_conn = 4 , NetworkData = TRUE, NetwExtLinks =TRUE, statslinks = TRUE)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant