-
Notifications
You must be signed in to change notification settings - Fork 55
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Failures in OpenlineageIn Function #192
Comments
@Kishor-Radhakrishnan I apologize for the delay in getting back to you. You can enable a configuration setting that will remove the spark-plan if it exceeds a certain size. Set If you don't care to see the spark plan text / json in your properties, you could set maxQueryPlanSize even smaller to ensure you always get lineage events through even when you have a large number of inputs (that take up more bytes in the message going to event hub). |
@wjohnson I tried setting with much lower value . Still we are seeing failures. Set value as 10000 |
Just for the sake of testing, can you make it a much smaller size @Kishor-Radhakrishnan? Try setting it to 50 and let us know the outcome. |
Tried the same . Still we are seeing many errors |
Would you be able to share the latest logs after adding this setting? You should see something like this in the OpenLineageIn logs |
Yes, am seeing that in logs . But still it looks many events are exceeding eventhub limits |
Latest exception logs |
@Kishor-Radhakrishnan thank you for your patience! These last logs helped us identify an error in the OpenLineageIn code that removed the spark plan in one variable but failed to remove the spark plan in another variable. That other variable was the one actually sending data to Event Hub! I've put the changes in this branch https://github.com/microsoft/Purview-ADB-Lineage-Solution-Accelerator/tree/hotfix/maxQueryPlanOLIn Would you be able to build this branch and deploy to your environment and confirm that maxQueryPlanSize is being respected for OpenLineageIn and PurviewOut? Thank you again for all of your patience. |
Unfortunately we still have many failures with same issue. But failure counts got reduced it looks 4/19/2023, 3:51:51.4891988 PM (Local time) Result: Error in OpenLineageIn function: The message (id:232abc39-61f0-45c6-8644-f53d68c84ecd, size:36280210 bytes) is larger than is currently allowed (1048576 bytes). (eventhubmaestroadbpct6) |
Hi @Kishor-Radhakrishnan , we are facing similar issue with spark jobs in my organization, did you manage to make it works? If yes how? Thanks |
This will be fixed in the next release where we will remove the spark plan and then column lineage information if the payload is still larger than the 1 MB payload limit. There will be future consideration for reducing mount points as in #219 It's still possible that there will be sections of the payload that result in too much information such as:
But only the mount points issue has been encountered so far. It still needs to be determined how to solve the mount point issue. |
We have implemented purview in many databricks workspaces. We are missing some lineage in UI. When troubleshooting issue , we can see the function app is failing multiple times due to event hub limit. We suspect this is causing lineage gaps.
Is there any option to overcome this eventhub limitation to avoid missing of lineage events.
The text was updated successfully, but these errors were encountered: