Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Ignore Approach and Implementation #52

Open
SensibleWood opened this issue Jun 17, 2022 · 0 comments
Open

Ignore Approach and Implementation #52

SensibleWood opened this issue Jun 17, 2022 · 0 comments
Labels
data quality Problem with transposing source data that needs to fixed data Issue relates to the tooling data collected from data sources investigation required Go do some work to assess what to do next

Comments

@SensibleWood
Copy link
Collaborator

SensibleWood commented Jun 17, 2022

User Story

As a user of tooling-related data I want to ignore anything that appears to be boilerplate code, unmaintained or archived.

As a maintainer of tooling-related data I want to reduce the amount of queries run against projects that appear to be boilerplate code, unmaintained or archived.

Detailed Requirement

Given the "coarse-grained" nature of the data collection approach there is a great deal of opportunity for "dross" to clutter up the tooling dataset. Some examples:

  • Tools removed from sources.
  • Dead repositories with no history or anything particularly interesting about them.
  • Repositories with zero stars that have not changed in eons.
  • Repositories that are tagged but not actually anything to do with OpenAPI.

We therefore need to decide on:

  • The policy for ignoring this stuff.
  • An implementation in the gulp build to sift it out.
@SensibleWood SensibleWood added data quality Problem with transposing source data that needs to fixed investigation required Go do some work to assess what to do next data Issue relates to the tooling data collected from data sources labels Jun 17, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data quality Problem with transposing source data that needs to fixed data Issue relates to the tooling data collected from data sources investigation required Go do some work to assess what to do next
Projects
None yet
Development

No branches or pull requests

1 participant