New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Batch import from CSV via client #161
Comments
Happy to take this on, unless @binoychitale feels its an organic progression of their work. |
No I think you can pick this :) |
Thanks @davidkuhta! Here's what I'm thinking for CSV formats, but let me know if you disagree or otherwise think there's a better way to structure things. We'd support two CSV formats. One for batch inserting vertices and vertex properties, and one for batch inserting edges and edge properties. VerticesFormat: e.g.:
Would create vertices with properties EdgesFormat: e.g.:
Would create an edge with a |
@ysimonson agree with the overall concept, but a couple of thoughts came to mind:
|
For this we'd use the bulk insert API. Datastores provide different guarantees on validation and transactionalization. Good point on (3), since most datasets don't use UUIDs. I think right now the keys in the CSV can realistically only be UUIDs. For users that don't have UUID keys, they'll need to generate a new CSV with them, and maintain a mapping of key -> UUID. I've opened an issue to better support this though. It's not clear to me what the best solution is yet, but does seem like something that should be addressed. |
Once #157, it'd be great to add support for batch importing from a CSV file. CSV would act as a decent lowest-common-denominator format. In the future, other formats could be added as well (e.g. RDF XML) based off demand.
The text was updated successfully, but these errors were encountered: