Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhanced error handling for HiveWriter #358

Open
hannesmiller opened this issue Oct 5, 2017 · 3 comments
Open

Enhanced error handling for HiveWriter #358

hannesmiller opened this issue Oct 5, 2017 · 3 comments

Comments

@hannesmiller
Copy link
Contributor

An enhancement request if possible...

When an exception is thrown from the underlying format writer (Parquet, Orc) ... it would be nice if we could trap the exception higher up the stack in the HiveSink and report the offending column and rethrow with the column in the message.

This would greatly help when trying to find data issues coming from the source (JdbcSource).

@sksamuel
Copy link
Contributor

sksamuel commented Oct 5, 2017

What happens at the moment ?

@hannesmiller
Copy link
Contributor Author

Just to be clear it's NOT an EEL bug.

Whatever the Parquet exception is...it is propagated up the call stack.

In this case I think it gave a number format exception showing the value which is a string - if I knew which column it was I could the query Oracle to find the offending value and check the type.

There are other cases where it's failing to convert a Timestamp and showing the value but in this case there around 6 Timestamp columns in the source.

Something for the future maybe - It's a potential time saver.😄

@sksamuel
Copy link
Contributor

sksamuel commented Oct 5, 2017

Oh I see, improve the error message.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants