Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

filter out repositories with samples #227

Open
yegor256 opened this issue Mar 21, 2024 · 2 comments
Open

filter out repositories with samples #227

yegor256 opened this issue Mar 21, 2024 · 2 comments
Assignees
Labels
enhancement New feature or request help wanted Extra attention is needed

Comments

@yegor256
Copy link
Owner

yegor256 commented Mar 21, 2024

There are many popular repositories in Github, which contain Java tutorials and samples (for example: leeowenowen/rxjava-examples). They are rather popular, that's why they get into our dataset. Would be great to find a way to filter them out at the discover-repos.rb script. Maybe by some Github tags that they usually have.

Maybe we can use some ML/LLM techniques for such a filtering (relying on the description of repositories and the content of their README files)?

This can help, maybe:

@yegor256 yegor256 added enhancement New feature or request help wanted Extra attention is needed labels Mar 21, 2024
@h1alexbel
Copy link
Contributor

@yegor256 please assign me

@yegor256
Copy link
Owner Author

@h1alexbel go ahead!

h1alexbel added a commit to h1alexbel/cam that referenced this issue Apr 15, 2024
yegor256 added a commit that referenced this issue Apr 15, 2024
start(#227): fetch topics for repos too, document /steps in steps/README.md
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

2 participants