Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hive table does not match column names present in the parquet data #603

Open
swathimocharla opened this issue Jan 11, 2022 · 0 comments
Open

Comments

@swathimocharla
Copy link

Hi,
We creating a hdfs sink connector based with a partition.field.name. This partition field is present in the middle of the avro data. HDFS Connector creates hive table structure as with the "PARTITIONED BY" as mentioned in the partition.field.name.

The issue here is, the location of the partition column in the parquet data file does not match with the hive table and this is causing wrong data to be read when accessing the particular column.

This seems to be working fine in the older versions of HDFS connect (5.5.2), was there any change made recently? We are seeing this issue on 10.1.1

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant