New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Docs take a long time to build #1420
Comments
Yep. I changed the default automl stopping criterion to I like the solutions you listed! Plus one of my own:
I recommend we go with option 2, but with option 3 in mind. |
#1627 was closed as a duplicate, but I think there could still be something there that wasn't covered in this issue, so posting here: I noticed that docs have been taking much longer to build. I think this is likely because the automl docs were changed in c871f3b to use the fraud dataset, instead of the breast cancer data set (+ elsewhere?) to showcase infer_problem_types, since the breast cancer dataset only has numeric columns. I suspect this is a different issue / reason for the even-longer build time of docs, from the previous 20 minutes to now >30 minutes, and could be worth mentioning! @dsherry FYI |
Another possible solution is to use multiple processors to build the docs: https://www.sphinx-doc.org/en/master/man/sphinx-build.html#cmdoption-sphinx-build-j |
Update following discussion with @dsherry. Adding in the This is what a successful build looks like for ReadtheDocs, taking a little over 20 minutes to complete. The differences between the HTML and Latex build times suggests that building the Jupyter notebooks themselves do not take a lot of time, which is good. However, we're also finding instances where the build fails like this. We noticed that for some reason, ReadtheDocs is running the full sequence of commands twice, which causes the build to take much longer (well over 30 minutes each to create the HTML and latex files), and causes the doc build to fail. I'll follow up with the ReadtheDocs support team to see why this is happening and how we can fix this, and I'll update with those results here when I get feedback. |
@bchen1116 contacted support and they said
I believe the "pull request building feature" referenced here is this, confirming. |
Update: |
@bchen1116 is this closeable now? |
Closing now, as there's been no issue with slow doc builds. |
As of late our docs take ~14 minutes to build on circle-ci whereas they took about 6 minutes to build in the previous release. The root cause of this slow-down seems to be that woodwork is inferring some categorical variables as text which then causes AutoML to use the TextFeaturizer. However, even if ww fixes the categorical vs text inference, the time to build the docs will inevitably increase as we write more documentation. This makes it hard for developers to iterate on the docs locally.
Possible solutions:
The text was updated successfully, but these errors were encountered: