Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Issues when downloading many files #40

Open
ramarty opened this issue Dec 20, 2023 · 3 comments
Open

Issues when downloading many files #40

ramarty opened this issue Dec 20, 2023 · 3 comments
Assignees
Labels
bug Something isn't working help wanted Extra attention is needed

Comments

@ramarty
Copy link
Contributor

ramarty commented Dec 20, 2023

I was downloading data for multiple days and the "retrieving" failed a few times. The function still worked, as it's designed to retry. But not sure if there's something going on where it's trying to download too much at one time? Not sure if limiting to, say, 2-3 concurrent downloads would help this issue? (I never seem to get these red bars when downloading just a couple tiles at once).

Screen Shot 2023-12-20 at 6 43 16 PM

@ramarty ramarty added bug Something isn't working help wanted Extra attention is needed labels Dec 20, 2023
@ramarty
Copy link
Contributor Author

ramarty commented Dec 21, 2023

My internet was really bad when this happened - when I switched to better internet it didn't happen! But maybe still something to think about?

@Holly-Transport
Copy link

Holly-Transport commented Dec 29, 2023

I also encountered many instances of file download timing out, like this (I used a smaller country, Jamaica):
image

@g4brielvs g4brielvs self-assigned this Jan 3, 2024
@g4brielvs
Copy link
Member

g4brielvs commented Jan 3, 2024

I was downloading data for multiple days and the "retrieving" failed a few times. The function still worked, as it's designed to retry. But not sure if there's something going on where it's trying to download too much at one time? Not sure if limiting to, say, 2-3 concurrent downloads would help this issue?

@ramarty @Holly-Transport Thanks (and happy new year)!

That's something to consider. There seems to be 2 things at play. The number of parallel downloads may be too ambitious. In the next release, we can change the default and expose the parameter for the user to choose. Also, especially in slower connections, the default httpx.Client may be timing out too soon. This issue would persist even if we decreased the number of parallel downloads. We can increase the default timeout and also expose it to the user.

All in all, connection issues will always be an upstream constraint imposed by NASA LAADS DAAC, but we can try to find the sweet spot. What do you think?

@g4brielvs g4brielvs mentioned this issue Feb 22, 2024
3 tasks
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working help wanted Extra attention is needed
Development

No branches or pull requests

3 participants