AnalysisException/DATATYPE_MISMATCH error when generating summary dataframe from sources that have a column named "summary" #264

artruk · 2024-04-18T15:45:22Z

Expected Behavior

Current Behavior

Generating summary dataframe using DataAnalyzer seems to fail whenever the source being analyzed has a column named "summary"

Steps to Reproduce (for bugs)

import dbldatagen as dg

df = spark.range(10).withColumnRenamed("id", "summary")
summary_df = dg.DataAnalyzer(sparkSession=spark, df = df).summarizeToDF()

Context

Your Environment

dbldatagen version used:
Databricks Runtime version:
Cloud environment used:

The text was updated successfully, but these errors were encountered:

ronanstokes-db · 2024-05-21T21:05:57Z

we'll add a fix to this in the next hotfix.

In the meantime you can rename the "summary" field to something else - but avoid using leading underscores as these may conflict with internal column names

ronanstokes-db · 2024-05-22T21:31:27Z

Fixed in hotfix as of 05/22/24

ronanstokes-db self-assigned this May 21, 2024

ronanstokes-db added the bug Something isn't working label May 21, 2024

ronanstokes-db linked a pull request May 21, 2024 that will close this issue

Feature hotfixes #274

Merged

11 tasks

ronanstokes-db closed this as completed in #274 May 22, 2024

ronanstokes-db added the Fixed label May 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AnalysisException/DATATYPE_MISMATCH error when generating summary dataframe from sources that have a column named "summary" #264

AnalysisException/DATATYPE_MISMATCH error when generating summary dataframe from sources that have a column named "summary" #264

artruk commented Apr 18, 2024

ronanstokes-db commented May 21, 2024

ronanstokes-db commented May 22, 2024

AnalysisException/DATATYPE_MISMATCH error when generating summary dataframe from sources that have a column named "summary" #264

AnalysisException/DATATYPE_MISMATCH error when generating summary dataframe from sources that have a column named "summary" #264

Comments

artruk commented Apr 18, 2024

Expected Behavior

Current Behavior

Steps to Reproduce (for bugs)

Context

Your Environment

ronanstokes-db commented May 21, 2024

ronanstokes-db commented May 22, 2024