Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support for Snowflake Connector's query pushdown #496

Open
Ioankall opened this issue Jul 14, 2023 · 1 comment
Open

Support for Snowflake Connector's query pushdown #496

Ioankall opened this issue Jul 14, 2023 · 1 comment
Labels
question Further information is requested

Comments

@Ioankall
Copy link

Snowflake Connector supports Query Pushdown - a mechanism that turn normal spark transformations into a snowflake query plan. It's a very useful feature as you save all the networking cost of pulling data from huge tables to where your Spark Application is running.

I tried to run the Profiler on a Spark DataFrame that's been loaded from a Snowflake table with the Pushdown mechanism enabled. I got the following error:

23/07/14 11:59:50 INFO SnowflakeTelemetry$: Pushdown fails because of operation: stateful_approx_count_distinct @ AggregationStatement
message: pushdown failed for aggregate expression
isKnown: false

Is anyone in the community familiar with this mechanism/error?

@Ioankall Ioankall added the question Further information is requested label Jul 14, 2023
@jhchee
Copy link
Contributor

jhchee commented Jul 24, 2023

@Ioankall This sounds like a problem on Snowflake driver rather than deequ/Spark.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants