Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] Image Scraping #3748

Open
myalow opened this issue May 21, 2023 · 3 comments · May be fixed by #4422
Open

[Feature] Image Scraping #3748

myalow opened this issue May 21, 2023 · 3 comments · May be fixed by #4422

Comments

@myalow
Copy link
Contributor

myalow commented May 21, 2023

Is your feature request related to a problem? Please describe.
For a while, I've been considering using stash as a replacement for Hydrus, but the main thing standing in my way is that there's no way for me to scrape tags for images.

Describe the solution you'd like
I'd like to see images on parity with scenes and performers, where I can scrape them both through a Stash-box instance, as well as local scrapers. I have ideas on how it could be handled on stash-box's side, but that's a separate issue. I do, however, think MD5 hashes will work fine in lieu of PHashes for images.

Describe alternatives you've considered
there really is no alternative besides me writing an external script to semi-automate this lol

Additional context
My main use would be for hentai, where most scrapers can just search md5:<hash> for whatever image on a booru. That's not as much of an option when it comes to how IRL media is catalogued & distributed, which is why I think Stash-Box integration would still be beneficial. Perhaps for images, stashbox can have a stashID for a given image, and then a list of MD5 hashes and tags tied to said stashID? Again though, for my use-case, I just want to be able to scrape images from hashes en masse. I welcome any replies that can build upon how I see this being implemented.

@ghost
Copy link

ghost commented May 22, 2023

This would be the continuation of #2885
But i personally stopped working on it. Maybe someone one day.

As someone that uses stash mostly for images, i think you're probably better with hydrus for now.
Most of the work on stash goes toward video, and it becomes frustrating really quickly to try and use it for images on mobiles for example.
Image scrapping could be used to get data from boorus, or simply from a social media post, but both of those usecase arent targeted by stash (so far).
I also had to create lots of custom css to make it semi decent to use as the card mode really isnt made for browsing, and the wall view do not allow to see image tags easily. This was the purpose of this pr : #2970

But all this is a matter of opinion ofc. Just know that i have been told countless time on discord that there was almost no benefit to image scrapping, if you consider stash to be only made for galleries and scenes. Image is just a gallery browser without the grouping.
Finally i've made a PR (merged #2837) that allows to make custom localization.
With that you can flavor your stash instance into h themed things. Like having "characters" with "races" instead of "performer" and "nationality"

@Dounial
Copy link

Dounial commented Oct 4, 2023

I might have misunderstood your exact use-case but if you want to scrape pictures by looking for similar ones you could maybe modify this extensions:
Which browser do you use? On chrome and chromium-based browsers you can use this:
Chrome (&Chromium)
On Firefox you can use this:
Firefox
And if you really use Edge (which I doubt) you can use this:
Edge

If you use it for example with another extension which opens a different global Application (my recommendation below):
External Application Launcher for Chromium
you could easily make a scraper out of it with some coding-skills. It would be even better because then it would be your code and you could share it and can use it with clarity since you have made it. :)

Hope this helps a bit at least, if not ask me to delete it if it's not welcome, then I will delete this comment.

@toddhow
Copy link

toddhow commented Dec 19, 2023

Personally I just want the necessary functions and endpoints to be made that would allow the development of 3rd party scrapers.

@DogmaDragon DogmaDragon linked a pull request Feb 18, 2024 that will close this issue
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
Status: No status
Development

Successfully merging a pull request may close this issue.

3 participants