Feature Request: Chunked/resumable downloads (use Range: byte... request header) #2576

aseanwatson · 2023-12-30T22:53:44Z

Is your feature request related to a problem? Please describe.

It seems like ondrive just does a GET to read file content. If the files are big enough, this just times out and fails.

Describe the solution you'd like

This includes a lot of detailed suggestions. Obviously, whoever is kind enough to implement this should take this as a "serving suggestion" rather than hard requirements.

See downloadFile.

Store .partial files in a temporary folder (not in the folder tree being synchronized). Have your database track in-progress temporary files, including id, eTag, expected length, and hash. It's probably smart to name it by the id to handle renames during download (but renames/moves change the eTag, so maybe it's not worth it).
Have downloadFile check to see if there's an in-progress download; if not create a zero-byte file and register it in the database. (If it exists in the database but has different eTag/hash/length from what the caller wants, delete the temporary file and database row and create new ones).
Open the temporary file in append mode and get its length.
For the GET request add If-Match and Range request headers. The range can be:
- the current length of the temporary file to end (e.g.: "bytes " ~ str(currentLength) ~"-" ) or
- can have some fixed size ("bytes " ~ str(currentLength) ~ "-" ~ str(min(expectedLength, currentLength + maxChunkSize))
You will want to handle a 206 (partial content) response and double check that the Content-Range is right (server is allowed to send back data outside request range, so you may need to skip returned data in response). You can also confirm the server size is expected.
If you're using the maxChunkSize option above, you should loop until you read all the content. Otherwise, if you get a 200/206 with all its content, the file is downloaded.
Once you get the right size file, verify the hash and move it to the correct location/name. (Not sure if this is unlink destination + link temp to destination + unlink temp or something that preserves existing inside-based data hard-links/permissions/crtime/etc.) In monitor mode, you need to think about local changes to the file while it's being downloaded.
If the request is interrupted (timeout, client killed, network disconnected, etc.), just try again when/if you can.
If you get a 412 (eTag mismatch) or a 416 (bad range) response, the file changed in the service while you were downloading it. Delete the temporary file and queue one with the updated eTag/length/hash.
If the file is deleted in the service while downloading, you'll get a 404; delete the temporary file.

Describe alternatives you've considered

I could use a browse/download manager to get big files.

Additional context

No response

The text was updated successfully, but these errors were encountered:

abraunegg · 2023-12-30T23:28:06Z

@aseanwatson
Thanks for your suggestion and detail above. Resumable downloads is currently on the slate for v2.5.x implementation, it just had no formal feature request.

aseanwatson added the Feature Request label Dec 30, 2023

abraunegg added this to the v2.5.x milestone Dec 30, 2023

abraunegg changed the title ~~Chunked/resumable downloads (use Range: byte... request header)~~ Feature Request: Chunked/resumable downloads (use Range: byte... request header) Jan 9, 2024

abraunegg added the Future Work - Planning Required label Feb 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature Request: Chunked/resumable downloads (use Range: byte... request header) #2576

Feature Request: Chunked/resumable downloads (use Range: byte... request header) #2576

aseanwatson commented Dec 30, 2023 •

edited

abraunegg commented Dec 30, 2023

Feature Request: Chunked/resumable downloads (use Range: byte... request header) #2576

Feature Request: Chunked/resumable downloads (use Range: byte... request header) #2576

Comments

aseanwatson commented Dec 30, 2023 • edited

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Describe alternatives you've considered

Additional context

abraunegg commented Dec 30, 2023

aseanwatson commented Dec 30, 2023 •

edited