Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DATA: Request for Real World Datasets and Pipelines To Test Our Filters #716

Open
nyoungbq opened this issue Oct 5, 2023 · 0 comments
Open
Labels
Data Involving Datasets for testing good first issue Good for newcomers help wanted Extra attention is needed

Comments

@nyoungbq
Copy link
Contributor

nyoungbq commented Oct 5, 2023

We always need datasets and pipelines to use in test cases to try to identify bugs and better optimize bottlenecks for real world use cases.

PLEASE NOTE THAT PROVIDED DATASETS AND PIPELINES WILL BE OPENSOURCE AS THEY ARE PUBLICLY AVAILABLE IN OUR REPOSITORY

Steps for Submitting:

  1. Create a branch on your fork of the repository named data/data_submission
  2. Add a new file named SUBMISSION.md at the root level and add the following:
# Data Submission

Name: [your-name-here]
DataSet: [link to where we can find the Data] <- leave blank if not applicable
Pipeline: [link to where we can find the Pipeline] <- leave blank if not applicable

Information:
write a short paragraph about what it is, what its for, how it should be used, etc.
  1. Create a pull request from your branch to our repository | Create a Pull Request From Fork
  2. In the description of the PR add information about where the dataset/pipeline came from applications and acknowledgement that the data will be made public such as
I hereby acknowledge that the information is mine or I have received permission from the owner and I provide it with the understanding it will be made public.
@nyoungbq nyoungbq added help wanted Extra attention is needed Data Involving Datasets for testing good first issue Good for newcomers labels Oct 5, 2023
@nyoungbq nyoungbq changed the title DATA: Request for Real World Datasets and Pipelines To Test Our Filters With DATA: Request for Real World Datasets and Pipelines To Test Our Filters Oct 5, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Data Involving Datasets for testing good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

1 participant