Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Configurable database names #426

Open
wants to merge 4 commits into
base: master
Choose a base branch
from
Open

Conversation

dgoiko
Copy link

@dgoiko dgoiko commented Jan 9, 2020

It is possible to configure the database names from the CrawlControler constructor.

It allowed me to create multiple CrawlerControllers on the same working directory without having the URLs mixed between them.

Substituting #420 because I broke the branch

It is possible to configure the database names from the CrawlControler constructor.
@Chaiavi
Copy link
Contributor

Chaiavi commented Jan 19, 2020

This pull request makes sense to me, I think it should be merged

Please note that I didn't look deeply into the code, but this feature is a good one

Modified to make InProcessPagesDB configurable.
@dgoiko
Copy link
Author

dgoiko commented Jan 24, 2020

Thanks for your note. Just for the record, the only changes performed are changing constants which are used as arguments for sleepycat are now constructor arguments for the calling classes. I've added those constructors maintaining old ones for retro-compatibility.

InProcessPagesDB was not configurable since I didn't need it, but now that someone showed interest for this I've just added it to the code

I was thinking about including it into CrawlConfig variable, however, I thought it would be harded to get merged (more code to review by maintainer).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants