Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature request]: Standardized data structure for datasets #97

Open
1 of 6 tasks
laserkelvin opened this issue Jan 13, 2024 · 0 comments
Open
1 of 6 tasks

[Feature request]: Standardized data structure for datasets #97

laserkelvin opened this issue Jan 13, 2024 · 0 comments
Labels
code maintenance Issue/PR for refactors, code clean up, etc. data Issues related to data loading, pipelining, etc. enhancement New feature or request

Comments

@laserkelvin
Copy link
Collaborator

Feature/behavior summary

A consistent, standardized data structure would make new datasets significantly easier to implement and maintain, as well as easier for model and task development by setting reasonable expectations of attribute names, etc.

Request attributes

  • Would this be a refactor of existing code?
  • Does this proposal require new package dependencies?
  • Would this change break backwards compatibility?
  • Does this proposal include a new model?
  • Does this proposal include a new dataset?
  • Does this proposal include a new task/workflow?

Related issues

#89 was where some of these discussions were had, and originated from #85

Solution description

There are two possible ways of implementing this: a flat DataSample structure which may comprise a graph or point cloud, leaving it a little ambiguous; a base AbstractDataSample class, and have PointCloudSample and GraphSample structures.

Not 100% sure how batching will look yet, but perhaps a Batch structure should also be introduced.

Additional notes

No response

@laserkelvin laserkelvin added enhancement New feature or request data Issues related to data loading, pipelining, etc. code maintenance Issue/PR for refactors, code clean up, etc. labels Jan 13, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
code maintenance Issue/PR for refactors, code clean up, etc. data Issues related to data loading, pipelining, etc. enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant