Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug]: When optimize fails to commit and then deletes a file, the optimize task will fail when retrying #910

Closed
1 task done
shendanfengg opened this issue Dec 9, 2022 · 1 comment · Fixed by #911 · May be fixed by #2482
Closed
1 task done
Assignees
Labels
priority:blocker security, data-loss, correctness, etc. type:bug Something isn't working

Comments

@shendanfengg
Copy link
Contributor

shendanfengg commented Dec 9, 2022

What happened?

When optimize fails to commit and then deletes a file, the optimize task will fail when retrying.
cgi-bin_mmwebwx-bin_webwxgetmsgimg_ MsgID=4164885277312037946 skey=@crypt_6bd6fcc2_fff7adbb1382a57fa2072b9d3eab009c mmweb_appid=wx_webfilehelper
We can see that the first commit failed and a file was deleted afterwards.
cgi-bin_mmwebwx-bin_webwxgetmsgimg_ MsgID=2928279045979481811 skey=@crypt_6bd6fcc2_fff7adbb1382a57fa2072b9d3eab009c mmweb_appid=wx_webfilehelper
Failed to get status for file: hdfs://nameservice1/user/hive/warehouse/realtime arctic v3.db/zl ual nonrealtime deviceprofile/hive/hdfs par=20221206/1670310415839 5957073387926017638/1-B-6002581311138640498-00000-9064569570000000001.parquet. caused by File does not exist: hdfs://nameservice1/user/hive/warehouse/realtime arctic v3.db/zl uanonrealtime device profile/hive/hdfs par=20221206/1670310415839 5957073387926017638/1-B-6002581311138640498-00000-906456957-0000000001.parquet
When optimize retries, optimize loads the deleted file again, causing the task to fail.

Affects Versions

0.3.x

What engines are you seeing the problem on?

AMS, Optimizer

How to reproduce

Don't know how to reproduce the problem

Relevant log output

No response

Anything else

No response

Code of Conduct

  • I agree to follow this project's Code of Conduct
@zhoujinsong
Copy link
Contributor

Overwrite hive table failed when committing iceberg table success and committing HMS failed.
We should ignore committing errors to HMS when committing iceberg successfully.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
priority:blocker security, data-loss, correctness, etc. type:bug Something isn't working
Projects
None yet
3 participants