Add custom retry and timeout options for partition reads and queries #261
Labels
api: spanner
Issues related to the googleapis/python-spanner API.
priority: p2
Moderately-important priority. Fix may not be included in next release.
type: feature request
‘Nice-to-have’ improvement, new feature or different behavior or design.
I am building an application in Apache Beam and Python that runs in Google DataFlow. I am using the
ReadFromSpanner
method inapache_beam.io.gcp.experimental.spannerio
. This works for most of my Spanner tables but the really large ones that are >40m rows tend to fail due to the following error:From my understanding this error comes from the
ReadFromSpanner
operation as it's workers have timed out.To solve this I have tried the following:
--experiments=shuffle_mode=service
flag as suggested in Google's Common Error Guidancen1-standard-1
up ton1-standard-32
A potential solution is to edit the timeout control; I have seen this being available in Java but not in Python. How can I edit timeout control in Python or is there any other solution to this issue?
The text was updated successfully, but these errors were encountered: