Offer Database Hosting Locally instead of AWS s3 #9156

williamAddison · 2023-08-02T18:54:35Z

What is the goal / desired outcome?

Pachyderm currently uploads things automatically to an AWS S3 database and maintains the data there. I notice that there is an option to route the output repo of a pipeline to a MySQL database but there isn't an option to host the entire database on a MySQL database or other alternative.

I bring this up as a security concern for people who don't want to use an outside service of a database and would prefer to host it themselves locally so that they could manage and protect their data their way. This would make pachyderm more service oriented instead of standalone.

If there is a way to accomplish this today via workaround, what does that require?

This would be a fairly simple workaround where instead of creating a pod that connects and manages a MySQL database instead of a AWS S3 gateway (There is already a POSTFRES pod in pachyderm that is used for AWS s3 connection and management and the easiest solution I could think of is to change the functionality of that)

(Optional) What is your proposal for a feature to solve this?

Environment?:

Kubernetes version (use kubectl version):
Pachyderm CLI version (use pachctl version):
Cloud provider (e.g. aws, azure, gke) or local deployment (e.g. minikube vs dockerized k8s):
OS (e.g. from /etc/os-release):
Others:

The text was updated successfully, but these errors were encountered:

williamAddison added the feature request label Aug 2, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Offer Database Hosting Locally instead of AWS s3 #9156

Offer Database Hosting Locally instead of AWS s3 #9156

williamAddison commented Aug 2, 2023

Offer Database Hosting Locally instead of AWS s3 #9156

Offer Database Hosting Locally instead of AWS s3 #9156

Comments

williamAddison commented Aug 2, 2023