Rcrawler is only saving internal HTML pages #61

Mlabrams1 · 2019-06-02T22:43:16Z

When utilizing the network analysis functionality, only the internal HTML pages identified in the Index file are stored as copies. This should store a copy of all HTML pages crawled, including those in NetwIndex, correct?

Rcrawler(Website = "https://github.com/salimk/Rcrawler/issues/new", MaxDepth = 2, no_cores = 4, no_conn = 4 , NetworkData = TRUE, NetwExtLinks =TRUE, statslinks = TRUE)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rcrawler is only saving internal HTML pages #61

Rcrawler is only saving internal HTML pages #61

Mlabrams1 commented Jun 2, 2019 •

edited

Rcrawler is only saving internal HTML pages #61

Rcrawler is only saving internal HTML pages #61

Comments

Mlabrams1 commented Jun 2, 2019 • edited

Mlabrams1 commented Jun 2, 2019 •

edited