1011 Running sql using sparkconnect should not print full stack trace #1012

b1ackout · 2024-05-10T10:16:03Z

Describe your changes

Added try catch on handle_spark_dataframe. When pyspark.sql.utils.AnalysisException occurs, it shows only the spark error and not the full stack trace

Issue number

Closes #1011

Checklist before requesting a review

Performed a self-review of my code
Formatted my code with pkgmt format
Added tests (when necessary).
Added docstring documentation and update the changelog (when needed)

📚 Documentation preview 📚: https://jupysql--1012.org.readthedocs.build/en/1012/

edublancas · 2024-05-21T00:50:23Z

src/sql/run/sparkdataframe.py

        raise exceptions.MissingPackageError("pysark not installed")

-    return SparkResultProxy(dataframe, dataframe.columns, should_cache)
+    try:


please integrate this with short_errors:

jupysql/src/sql/magic.py

Line 175 in 0433444

short_errors = Bool(

by default, it should raise the exception, if short_errors is True, then just print it

edublancas · 2024-05-21T00:50:43Z

src/sql/run/sparkdataframe.py

-    return SparkResultProxy(dataframe, dataframe.columns, should_cache)
+    try:
+        return SparkResultProxy(dataframe, dataframe.columns, should_cache)
+    except AnalysisException as e:


this except is redundant, the except Exception as e can catch all exceptions

b1ackout · 2024-05-23T07:37:09Z

src/sql/util.py

@@ -559,6 +559,7 @@ def is_non_sqlalchemy_error(error):
        # Pyspark
        "UNRESOLVED_ROUTINE",
        "PARSE_SYNTAX_ERROR",
+        "AnalysisException",


After looking through the code I think adding AnalysisException here will solve the issue since PARSE_SYNTAX_ERROR works as expected.
AnalysisException covers all these error conditions.

I just need to test it somehow. Will try to package the jupysql and install it in a spark environment

you can install like this:

pip install git+https://github.com/b1ackout/jupysql@running-sql-using-sparkconnect-should-not-print-full-stack-trace

b1ackout added 2 commits May 9, 2024 14:42

Prune sparkConnect SQL stack trace on AnalysisException

bf00376

Add AnalysisException on MissingPackageError check

1143d25

b1ackout requested a review from edublancas as a code owner May 10, 2024 10:16

Lint

d7e0e8f

edublancas requested changes May 21, 2024

View reviewed changes

edublancas added the feature Adds a new feature label May 21, 2024

b1ackout added 4 commits May 23, 2024 10:18

Revert changes

11bedfa

Fix typo

7578d8c

Add AnalysisException to is_non_sqlalchemy_error

7102f97

Lint

48c1623

b1ackout marked this pull request as draft May 23, 2024 07:34

Fix typo

d951b42

b1ackout commented May 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1011 Running sql using sparkconnect should not print full stack trace #1012

1011 Running sql using sparkconnect should not print full stack trace #1012

b1ackout commented May 10, 2024 •

edited

edublancas May 21, 2024

edublancas May 21, 2024

b1ackout May 23, 2024

b1ackout May 23, 2024

edublancas May 23, 2024

1011 Running sql using sparkconnect should not print full stack trace #1012

Are you sure you want to change the base?

1011 Running sql using sparkconnect should not print full stack trace #1012

Conversation

b1ackout commented May 10, 2024 • edited

Describe your changes

Issue number

Checklist before requesting a review

edublancas May 21, 2024

Choose a reason for hiding this comment

edublancas May 21, 2024

Choose a reason for hiding this comment

b1ackout May 23, 2024

Choose a reason for hiding this comment

b1ackout May 23, 2024

Choose a reason for hiding this comment

edublancas May 23, 2024

Choose a reason for hiding this comment

b1ackout commented May 10, 2024 •

edited