Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

S3 to DC tool fails by hanging indefinitely #105

Open
alexgleith opened this issue Oct 15, 2020 · 7 comments
Open

S3 to DC tool fails by hanging indefinitely #105

alexgleith opened this issue Oct 15, 2020 · 7 comments
Assignees

Comments

@alexgleith
Copy link
Contributor

It's hard to reproduce, but if you install odc-apps-dc-tools without aiohttp==3.6.2, for example, the tool will just hang.

@Kirill888
Copy link
Member

Are you using "allow pre-release installs" flags? We should only install our libs with those flags.
I think for aio libs it's best to pin them even if this is not the cause of this problem.

@alexgleith
Copy link
Contributor Author

This is the requirements used:

--pre
--extra-index-url https://packages.dea.ga.gov.au/
datacube[performance,s3]==1.8.3
aiobotocore[boto3,awscli]==1.1.1
odc-apps-dc-tools

Adding in pinned aiohttp makes it work.

@Kirill888
Copy link
Member

same as #76, don't use --pre globally, right now it's aiohttp that is breaking, but next some other thing will, --pre should only apply to odc-apps-dc-tools really, and you can avoid using --pre altogether by specifying >={dev_version} on odc-apps-dc-tools.

@alexgleith
Copy link
Contributor Author

Please create a PR on the requirements files in here: https://github.com/opendatacube/datacube-docker

But it is patching the symptom, and doesn't resolve that the tool should fail faster if it's broken and not hang indefinitely.

@Kirill888
Copy link
Member

have you debugged which library it is hanging in?

Nothing I can do if your environment has incompatible versions of unreleased third party software that happen to misbehave when used that way.

@woodcockr
Copy link
Member

I reported this previously. It's not a library issue. Something to do with asyncio race condition when something fails (something related to network, outside the core logic flow anyway) was as far as I could theorize but my skills with aio are extremely limited and I couldn't figure out how to debug it further. Uncaught exception maybe? its quite painful to reproduce but it happens a fair bit, unfortunately you can run the same thing twice and have it not happen for one of them. quite frustrating for debugging.

@Kirill888
Copy link
Member

I have not seen it happen when using non-alpha versions of aiohttp, aiobotocore installed with compatible botocore

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants