-
Notifications
You must be signed in to change notification settings - Fork 284
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Drop anemone and use Spidr for Repo discovery #10947
base: master
Are you sure you want to change the base?
Conversation
Doesn't look crazy? Only dep is nokogiri, which we have anyway, tested on modern rubies. Why not. |
cc6dee6
to
934292d
Compare
I am seeing significant performance difference with what's on the PR vs the existing workflow..Looking at ways to speed this up..Will push updates when I get the performance sorted. Update: Should be good to go with latest commit. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Implementation wise I think you should separate the crawler and Docker search into separate classes. Perhaps even the file crawl as well. Right now it's confusing.
app/lib/katello/repo_discovery.rb
Outdated
|
||
def process_page_urls(urls) | ||
urls.each do |url| | ||
url = url.to_s |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this is a bad idea. The URL object is way more valuable and I'd only add it to @to_follow
with to_s
.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
We don't necessarily use the URL object properties once we have the list of URLs in the page. It's just matching strings after that point..Becomes easier to pass that around and work with it IMO?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated it in latest push to pass URL object.
To Do:
What are the changes introduced in this pull request?
Considerations taken when implementing this change?
What are the testing steps for this pull request?
bundle install
Go to Content > Product > Repo discovery
Run repo discovery.