You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
After a number of queries (approximately 1000) to S3 via duckdb and httpfs the connection begins to fail with error :
duckdb.duckdb.IOException: IO Error: Connection error for HTTP HEAD to 'https://my-bucket.s3.amazonaws.com/prod/bronze/salesforce/main.parquet'
To reproduce
Difficult to reproduce because it seems tied to the number of connections made to S3. It is very consistent. My pipelines start running at 12pm UTC and after ~ 8-10 hours the connections start returning the error.
Expected behavior
No response
Screenshots
No response
Operating system
duckdb v0.10.1
Mage in docker on EC2.
Additional context
How I am hacking around this?
I am managing the issue by restarting the container everyday.
What have I done to help myself?
I have found these issues on duckdb however they fail to resolve my issue:
Mage version
mage v0.9.67
Describe the bug
After a number of queries (approximately 1000) to S3 via duckdb and httpfs the connection begins to fail with error :
To reproduce
Difficult to reproduce because it seems tied to the number of connections made to S3. It is very consistent. My pipelines start running at 12pm UTC and after ~ 8-10 hours the connections start returning the error.
Expected behavior
No response
Screenshots
No response
Operating system
Additional context
How I am hacking around this?
I am managing the issue by restarting the container everyday.
What have I done to help myself?
I have found these issues on duckdb however they fail to resolve my issue:
read_parquet('s3://...', union_by_name=True)
throws "Connection error for HTTP HEAD" butunion_by_name=False
does not duckdb/duckdb#11695Code producing error
traceback
stacktrace
Additonal info
Slack thread
The text was updated successfully, but these errors were encountered: