docs: pandas DataFrame samples are more standalone #224

tswast · 2021-06-30T21:17:05Z

In response to customer issue 179797311

This updates the code samples on https://cloud.google.com/bigquery/docs/bigquery-storage-python-pandas#objectives to include relevant imports.

Also:

Add shared fixture for project_id
Use create_bqstorage_client instead of manually creating one. Comment that this is the default.

Note: the samples in main_test.py are still there. We'll need to remove those once the docs have been updated.

snippet-bot · 2021-06-30T21:17:10Z

Here is the summary of changes.

You are about to add 4 region tags.

samples/to_dataframe/read_query_results.py:17, tag bigquerystorage_pandas_tutorial_read_query_results
samples/to_dataframe/read_table_bigquery.py:17, tag bigquerystorage_pandas_tutorial_read_table
samples/to_dataframe/read_table_bqstorage.py:18, tag bigquerystorage_pandas_tutorial_read_session
samples/to_dataframe/read_table_bqstorage.py:23, tag bigquerystorage_pandas_tutorial_read_session

This comment is generated by snippet-bot.
If you find problems with this result, please file an issue at:
https://github.com/googleapis/repo-automation-bots/issues.
To update this comment, add snippet-bot:force-run label or use the checkbox below:

Refresh this comment

shollyman · 2021-07-01T16:39:33Z

samples/to_dataframe/read_query_results.py

+            # Optionally, explicitly request to use the BigQuery Storage API. As of
+            # google-cloud-bigquery version 1.26.0 and above, the BigQuery Storage
+            # API is used by default.
+            create_bqstorage_client=True,


This seems like it's prone to get people smashing into the guardrail of dependency management even more?

If they're using a version of the BQ client library that doesn't have this on by default, I suspect that getting dependencies updated is a non-trivial matter. And the explicit bqstorage examples can help them probe with less intermediate magic.

Sigh. Even now with BQ Storage as an optional "extra", the package manager doesn't give these users much help. At least now newer versions of the BQ library provide info in the error message about what package versions they need to install.

I'm tempted more and more just to make BQ Storage a required dependency. We have enough gRPC-based libraries now that I'm not as worried about pulling in grpcio as a dependency (in fact, I think we already are, anyway).

I'm a fan, since we're still not great about even documenting the optional dependencies properly. I'd go so far as to even consider arrow as mandatory as well, as anecdotally we see people tripping on dependencies more than feedback about dependency graph being too large etc.

shollyman · 2021-07-01T16:44:39Z

samples/to_dataframe/read_table_bqstorage.py

+    stream = read_session.streams[0]
+    reader = bqstorageclient.read_rows(stream.name)
+
+    # Parse all Arrow blocks and create a dataframe. This call requires a


Now that you get the schema on the first readrows response, passing the session around should be unnecessary. Worth addressing this in the client before finishing out this sample?

True. #168

Yeah, I think that simplifying this sample is good motivation for working on that feature.

…-samples

🤖 I have created a release \*beep\* \*boop\* --- ### [2.6.1](https://www.github.com/googleapis/python-bigquery-storage/compare/v2.6.0...v2.6.1) (2021-07-20) ### Bug Fixes * **deps:** pin 'google-{api,cloud}-core', 'google-auth' to allow 2.x versions ([#240](https://www.github.com/googleapis/python-bigquery-storage/issues/240)) ([8f848e1](https://www.github.com/googleapis/python-bigquery-storage/commit/8f848e18379085160492cdd2d12dc8de50a46c8e)) ### Documentation * pandas DataFrame samples are more standalone ([#224](https://www.github.com/googleapis/python-bigquery-storage/issues/224)) ([4026997](https://www.github.com/googleapis/python-bigquery-storage/commit/4026997d7a286b63ed2b969c0bd49de59635326d)) --- This PR was generated with [Release Please](https://github.com/googleapis/release-please). See [documentation](https://github.com/googleapis/release-please#release-please).

docs: pandas DataFrame samples are more standalone

d44d700

tswast requested a review from a team as a code owner June 30, 2021 21:17

tswast requested review from engelke and removed request for a team June 30, 2021 21:17

product-auto-label bot added the api: bigquerystorage Issues related to the googleapis/python-bigquery-storage API. label Jun 30, 2021

google-cla bot added the cla: yes This human has signed the Contributor License Agreement. label Jun 30, 2021

product-auto-label bot added the samples Issues that are directly related to samples. label Jun 30, 2021

tswast mentioned this pull request Jun 30, 2021

remove samples/to_dataframe/main_test.py #225

Closed

tswast added 4 commits June 30, 2021 16:18

fix region tag

94df063

fix region tag

d422182

remove unused imports

87ff7fa

blacken

fdd8b6b

tswast requested review from a team and shollyman and removed request for a team July 1, 2021 15:10

shollyman approved these changes Jul 1, 2021

View reviewed changes

tswast added 2 commits July 9, 2021 16:21

Merge remote-tracking branch 'upstream/master' into b179797311-pandas…

ac19d47

…-samples

remove session from call to rows/to_dataframe

0b7fc64

tswast merged commit 4026997 into googleapis:master Jul 13, 2021

tswast deleted the b179797311-pandas-samples branch July 13, 2021 19:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: pandas DataFrame samples are more standalone #224

docs: pandas DataFrame samples are more standalone #224

tswast commented Jun 30, 2021

snippet-bot bot commented Jun 30, 2021 •

edited

shollyman Jul 1, 2021

tswast Jul 7, 2021

shollyman Jul 9, 2021

shollyman Jul 1, 2021

tswast Jul 7, 2021

docs: pandas DataFrame samples are more standalone #224

docs: pandas DataFrame samples are more standalone #224

Conversation

tswast commented Jun 30, 2021

snippet-bot bot commented Jun 30, 2021 • edited

shollyman Jul 1, 2021

Choose a reason for hiding this comment

tswast Jul 7, 2021

Choose a reason for hiding this comment

shollyman Jul 9, 2021

Choose a reason for hiding this comment

shollyman Jul 1, 2021

Choose a reason for hiding this comment

tswast Jul 7, 2021

Choose a reason for hiding this comment

snippet-bot bot commented Jun 30, 2021 •

edited