Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

'WARC-Identified-Payload-Type' allowed for request, revisit, continuation? #49

Open
wumpus opened this issue Jan 30, 2019 · 2 comments
Open

Comments

@wumpus
Copy link

wumpus commented Jan 30, 2019

In the 1.1 spec, section 5.19, 'WARC-Identified-Payload-Type' is allowed for anything with a well-defined payload.

That makes sense for response, resource, and conversion.

That doesn't make sense for request, revisit, and continuation.

@ato
Copy link
Member

ato commented Jan 30, 2019

It seems useful to allow it for requests as the software creating the warc file may want to identify the content type of the request payload. For example when JavaScript running in a browser constructs a mystery payload and is recorded by a tool like warcprox.

@wumpus
Copy link
Author

wumpus commented Jan 30, 2019

Ah, yes, that's a good one. There are a lot of request json payloads out there with content-type text/plain. And a revisit would potentially have the same situation.

continuation records have a conflicting status. In clause 7, "Segments other than the first should not contain other optional fields" prohibits WARC-Identified-Payload-Type, and that conflicts with 5.19.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants