Exit with code 1 due to network error: UnknownContentError while converting html to pdf #2187

no1stunna · 2015-02-02T17:11:24Z

There is html file:

https://gist.github.com/no1stunna/36637bac02569ad6744e

When i do

    wkhtmltopdf issue.html test.pdf

There is an error:

Loading pages (1/6)
Counting pages (2/6)
Resolving links (4/6)
Loading headers and footers (5/6)
Printing pages (6/6)
Done
Exit with code 1 due to network error: UnknownContentError

Most of the time its working well. But for some generated google urls like this one it throws such an error.

Thanks a lot.

The text was updated successfully, but these errors were encountered:

ctrlaltdylan · 2015-04-27T22:49:47Z

Bumping having this same issue

kenorb · 2015-05-13T21:34:45Z

The same here.

$ wkhtmltopdf page.html page.pdf
Loading pages (1/6)
Warning: Failed to load file://connect.facebook.net/en_US/sdk.js (ignore)
Warning: SSL error ignored                                        
Counting pages (2/6)                                               
Resolving links (4/6)                                                       
Loading headers and footers (5/6)                                           
Printing pages (6/6)
Done                                                                        
Exit with code 1 due to network error: UnknownContentError

kenorb · 2015-05-13T21:49:11Z

See: QNetworkReply Class

QNetworkReply::UnknownContentError: 299: an unknown error related to the remote content was detected

In #1948 we can read:

It looks like you are trying to load a dynamic resource which is not found (see explanation) -- you might want to see what it is.

However it doesn't solve the problem, because there is no way to ignore such error and continue. Currently it prevents generation of the output file.

team4music · 2015-06-10T23:55:08Z

Same problem here =(

brafdlog · 2015-11-11T14:57:43Z

Same here :(

kai789 · 2016-02-03T21:43:30Z

I hit UnknownContentError as well. It seems like when I reach the 16-18 page mark, with header+footer, it causes the error.

I moved my problem into a new issue: #2778

cm8 · 2016-03-06T22:23:03Z

I hit this error for a single page without any header or footer and without javascript running.

To debug this problem try running the conversion with --no-images which runs fine. I then saved the file locally to find out which image was offending. To test with a local copy do not forget to set e.g. <base href=".."/> in the <head> section of the html appropriately.

If the document is large, use grep -A2 "<img" to find urls to check. Use wget on these urls and inspect their content. This yields an img resource on archive.org, with an html response body, an image url which was broken at archival time.

archive.org replays http status codes for archived resources, in this case it is/was 400: Bad Request
QT webkit transforms this into UnknownContentError, presumably trying to interpret the html response body (the error message) as an image, despite code 400

After a review I can "ignore" this problem, since the pdf in question renders fine without the img resource. However I do not want to ignore non-zero return status in a batch invocation of wkhtmltopdf (to find and break on more serious errors).

The specific problem described above can be coupled to the load-media-error class.

wkhtmltopdf defaults to --load-media-error-handling ignore
the 'ignore' handlers do not trigger non-zero exit codes
UnknownContentError triggered from loading <img> sources (or, more generally any page requisite with an image/* mime type) should be handled like any other load-media-error

(using statically compiled wkhtmltopdf 0.12.3-dev (with patched qt))

bronson · 2016-03-14T14:07:11Z

Yep, I'm getting this error consistently with 0.12.1 and 0.12.3:

(EDIT: I mean, it consistently happens hundreds of times when generating thousands of PDFs. It doesn't happen on every PDF)

$ wkhtmltopdf http://prop.gisweb.com/print/geosearch/m/tCentralFlorida out.pdf
Loading pages (1/6)
QFont::setPixelSize: Pixel size <= 0 (0)                     ] 48%
Counting pages (2/6)
Resolving links (4/6)
Loading headers and footers (5/6)
Printing pages (6/6)
Done
Exit with code 1 due to network error: UnknownContentError

So I upgraded to 0.13.0-alpha-7b36694 and I'm getting:

$ wkhtmltopdf http://prop.gisweb.com/print/geosearch/m/tCentralFlorida out.pdf
Loading page (1/2)
Printing pages (2/2)
Done
Exit with code 1 due to network error: InternalServerError

Sigh. In both cases the PDF appears to be correctly created. I guess I'll hack my app to ignore error 1 for now... Hoping someone finds a better solution.

rk · 2016-03-30T15:15:37Z

I've had this error happen when using the --post key value param on a command that normally works (v0.12.3).

Loading pages (1/6)
content-type missing in HTTP POST, defaulting to application/x-www-form-urlencoded. Use QNetworkRequest::setHeader() to fix this problem.
content-type missing in HTTP POST, defaulting to application/x-www-form-urlencoded. Use QNetworkRequest::setHeader() to fix this problem.
Counting pages (2/6)
Resolving links (4/6)
Loading headers and footers (5/6)
Printing pages (6/6)
Done
Exit with code 1 due to network error: UnknownContentError

I was able to confirm the exact same request worked with jQuery $.post(), for both the cover page and the content. I have 1 external linked resource (a logo on the cover page) referenced by HTTP, so there shouldn't be a problem there.

hbarrington · 2016-05-05T21:04:59Z

We just started receiving this error as well. We're still trying to debug the root cause in our case but passing the --no-images parameter seems to resolve it for us as well. We're not sure why.

brafdlog · 2016-05-05T23:15:34Z

This happened to me when there was a missing image in the page. One the
image was removed from the page it was fixed.
On Fri, 6 May 2016 at 00:05 Hunter Barrington notifications@github.com
wrote:

We just started receiving this error as well. We're still trying to debug
the root cause in our case but passing the --no-images parameter seems to
resolve it for us as well. We're not sure why.

—
You are receiving this because you are subscribed to this thread.
Reply to this email directly or view it on GitHub
#2187 (comment)

omrqs · 2017-03-02T22:53:31Z

This error occurs when url are defined as //url... in place of https://url....
Keep the protocol.
wkhtmltopdf try find double slash as local path...

Solved for me.

seanthebean · 2017-07-23T04:46:22Z

Anyone know of any way to diagnose which resource is causing the problem? (Using the process of elimination can be very tedious/difficult if the PDF includes a lot of resources, or they're being programmatically included.)

Glepooek · 2017-08-23T08:11:08Z

was this problem solved?

DionataNunesGarcia · 2017-10-23T17:26:37Z

I have the same error, when it reaches 18 or more pages and including the footer and the header with html with URL, returns this error, and with menas pages it works normally, but until now I could not solve the problem, I give a URL to the footer and the header always returns the error "Exit with code 1 due to network error: UnknownContentError".
Does anyone already know how to fix it? I already tried everything I researched.

sbont · 2018-02-16T14:02:45Z

@DionataNunesGarcia me too, large documents seem to cause this error too.
Have you found a way to solve this issue?

gokulk16 · 2018-05-23T06:07:35Z

Same error. there should be an option created for handling this ContentNotFoundError.
Also in most cases, those are images.

alloylab · 2018-06-15T12:32:32Z

What flavor of linux are y'all running? What version of wkhtmltopdf? Is a patched QT version?
Is the certificate chain valid for https url?

Do you have openssl & ca-certificates packaged installed?

alloylab · 2018-07-05T18:09:34Z

is this still an issue on 0.12.5?

BenjaminRbt · 2018-07-06T09:33:48Z

@alloylab I was going to ask. Glad to see that some people want to see this issue out.

Tomsgu · 2018-07-19T15:55:11Z

SSL issues were fixed in 0.12.5. It should be also easier to debug the problem, when ContentNotFoundError will appear.

Tomsgu · 2018-07-25T08:35:42Z

Closing as there is no response. Please feel free to open a new issue with a complete example if you have some of the problems from this issue.

rainabba · 2018-10-17T23:42:14Z

I'm seeing this with 12.3 on WSL/Ubuntu 18.04, Windows 1803. When I pull these pages in browser, everything is 200 or 304 as expected.

singhravi1 · 2018-12-01T14:46:01Z

Got this error when i tried to create a 70 page pdf. Any possible solution?

rainabba · 2018-12-01T15:03:56Z

For me, the issue is resolved with 12.4

singhravi1 · 2018-12-01T15:09:22Z

i'm using 0.12.5 version. But still having this issue. Any idea?

edit: ubuntu 16.04

theredled · 2019-01-07T15:07:34Z

Same

rafaeljusto · 2019-11-21T12:41:03Z

This PR will probably solve this issue:
#4461

kenorb mentioned this issue May 13, 2015

Support for src=/ and src=// #2359

Closed

kai789 mentioned this issue Feb 4, 2016

UnknownContentError, limits pdf to about 16 pages #2778

Closed

AvrossHsiao mentioned this issue Mar 15, 2018

Update Steak AvrossHsiao/GoogleSearchCrawlerToJPG#2

Open

gokulk16 mentioned this issue May 23, 2018

Exit with code 1 due to network error: ContentNotFoundError #3917

Closed

alloylab added the NeedInfo label Jun 15, 2018

Tomsgu closed this as completed Jul 25, 2018

This was referenced Apr 8, 2020

incorporate fix to UnknownContentError propertyguru/wkhtmltopdf#1

Merged

Patch to fix UnknownContentError propertyguru/wkhtmltopdf-amd64#1

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Exit with code 1 due to network error: UnknownContentError while converting html to pdf #2187

Exit with code 1 due to network error: UnknownContentError while converting html to pdf #2187

no1stunna commented Feb 2, 2015

ctrlaltdylan commented Apr 27, 2015

kenorb commented May 13, 2015

kenorb commented May 13, 2015

team4music commented Jun 10, 2015

brafdlog commented Nov 11, 2015

kai789 commented Feb 3, 2016

cm8 commented Mar 6, 2016

bronson commented Mar 14, 2016

rk commented Mar 30, 2016

hbarrington commented May 5, 2016

brafdlog commented May 5, 2016

omrqs commented Mar 2, 2017

seanthebean commented Jul 23, 2017

Glepooek commented Aug 23, 2017

DionataNunesGarcia commented Oct 23, 2017

sbont commented Feb 16, 2018

gokulk16 commented May 23, 2018

alloylab commented Jun 15, 2018

alloylab commented Jul 5, 2018

BenjaminRbt commented Jul 6, 2018

Tomsgu commented Jul 19, 2018

Tomsgu commented Jul 25, 2018

rainabba commented Oct 17, 2018

singhravi1 commented Dec 1, 2018

rainabba commented Dec 1, 2018

singhravi1 commented Dec 1, 2018 •

edited

theredled commented Jan 7, 2019

rafaeljusto commented Nov 21, 2019

Exit with code 1 due to network error: UnknownContentError while converting html to pdf #2187

Exit with code 1 due to network error: UnknownContentError while converting html to pdf #2187

Comments

no1stunna commented Feb 2, 2015

ctrlaltdylan commented Apr 27, 2015

kenorb commented May 13, 2015

kenorb commented May 13, 2015

team4music commented Jun 10, 2015

brafdlog commented Nov 11, 2015

kai789 commented Feb 3, 2016

cm8 commented Mar 6, 2016

bronson commented Mar 14, 2016

rk commented Mar 30, 2016

hbarrington commented May 5, 2016

brafdlog commented May 5, 2016

omrqs commented Mar 2, 2017

seanthebean commented Jul 23, 2017

Glepooek commented Aug 23, 2017

DionataNunesGarcia commented Oct 23, 2017

sbont commented Feb 16, 2018

gokulk16 commented May 23, 2018

alloylab commented Jun 15, 2018

alloylab commented Jul 5, 2018

BenjaminRbt commented Jul 6, 2018

Tomsgu commented Jul 19, 2018

Tomsgu commented Jul 25, 2018

rainabba commented Oct 17, 2018

singhravi1 commented Dec 1, 2018

rainabba commented Dec 1, 2018

singhravi1 commented Dec 1, 2018 • edited

theredled commented Jan 7, 2019

rafaeljusto commented Nov 21, 2019

singhravi1 commented Dec 1, 2018 •

edited