Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Demo Install - DB compute setup requires addition steps #206

Open
batemansogq opened this issue Jun 7, 2023 · 2 comments
Open

Demo Install - DB compute setup requires addition steps #206

batemansogq opened this issue Jun 7, 2023 · 2 comments
Labels
bug Something isn't working

Comments

@batemansogq
Copy link

Describe the bug
The Demo install process requires additional steps to get running

To Reproduce
Steps to reproduce the behavior:

  1. Update the Settings.sh to include a unique Purview name (or else hit the already exists issue)
  2. Run the Demo installation sh
  3. The DB cluster fails to start with an init script error - not found

Steps to fix:

  1. Update the Settings.sh to include a unique Purview name (or else hit the already exists issue)
  2. Run the Demo installation sh
  3. Using the DB CLI - upload the init script and jar (as per the connector instructions - https://github.com/microsoft/Purview-ADB-Lineage-Solution-Accelerator/blob/main/deploy-base.md#install-openlineage-on-your-databricks-cluster)
  4. Update DB cluster Libraries to include the Maven - mssql-connector
  5. update the DB "abfss-in-abfss-out-sample" - storageServiceName & storageContainerName values as the references dont work within the current set

Expected behavior
A clear and concise description of what you expected to happen.

The Demo installation should work as per the instructions.

Logs

  1. Please include any Spark code being ran that generates this error

https://gist.github.com/batemansogq/c29c2bcfb04b3e966fad5ac4648feb1d = Spark config

  1. Please include a gist to the OpenLineageIn and PurviewOut logs
  2. See how to stream Azure Function Logs

Screenshots
If applicable, add screenshots to help explain your problem.

demo install - DB failure
image

demo install - settings update
image

demo install - spark config
image

Desktop (please complete the following information): - I have the standard MS build

OS: [e.g. Windows, Mac]
OpenLineage Version: [e.g. name of jar]
Databricks Runtime Version: [e.g. 9.1, 10.1, 11.3]
Cluster Type: [e.g. Job, Interactive]
Cluster Mode: [e.g. Standard, High Concurrency, Single]
Using Credential Passthrough: [e.g. Yes, No]
Additional context
Add any other context about the problem here.

This work has been completed in the MS non-prod tenancy, reach out to me via email for access.

@batemansogq batemansogq added the bug Something isn't working label Jun 7, 2023
@wjohnson
Copy link
Collaborator

wjohnson commented Jun 8, 2023

Hi, @batemansogq - Thank you for using the solution accelerator! Would you help me understand what you believe fixed your cluster start issue?

It looks like you took three steps:

  1. Added purviewName to the settings.sh
  2. Update DB cluster Libraries to include the Maven - mssql-connector
  3. update the DB "abfss-in-abfss-out-sample" - storageServiceName & storageContainerName values as the references dont work within the current set

However, I'm not certain how these would have affected your cluster initialization.

Thank you for any additional feedback.

@batemansogq
Copy link
Author

batemansogq commented Jun 9, 2023 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants