Skip to content

glyh/JavSPn

 
 

Repository files navigation

JavSP

Jav Scraper Package

A Jav(Japanese Adult Video) Scraper that summarizes from multiple websites

By grabbing the bangou in the file name, JavSP can pull data from multiple websites and summarize them, classify them according to a predefined rule and create metadata for softwares like Emby, Jellyfin and Kodi.

Docker & WebUI: Due to limited time and energy, there's no Docker support yet. On top of that, User Interface is not one of the primary goal for this project. If you need Docker support, maybe you can give JavSP-Docker a try.

License LICENSE Python 3.9 Crawlers test Latest release 996.icu

Distinctive Features against upstream

  • Crawls stage photos(optional).
  • Support cropping cover with either: 1. face detection with yunet; 2. body segmentation with pphumanseg.
  • Use cx_Freeze to package, which is smaller in size.
  • Adheres to pip packaging standards, and thus can be used as a library.

Features

This is a non-exhaustive list of implemented and unimplemented features being refined over time.

  • Recognize movie ID automagically
  • Dealing with movies separated into multiple parts
  • Summarize information from multiple sites to generate NFO file.
  • Automatic tests for website crawler on a daily basis
  • Parallel data scraping
  • Downloading HD covers
  • AI based cover crop for atypical covers
  • Check new version and self-updating
  • Translating titles and descriptions
  • Matching local subtitles
  • Using thumb to create folder cover
  • Keeping genre consistency across different websites
  • Different mode of operations(Scraping and Moving, Scrape only)
  • Optional: Allow user to interveine when there's a conflicts during scrapping.

Installation

  • For the impatient

    Visit Github Release Page and download the latest portable version of JavSP (Windows Only).

  • Buliding from source

    • Ensure you have Python >= 3.9

    • Run the following

      git clone --recurse-submodules https://github.com/glyh/JavSPn.git
      cd JavSP
      poetry install
      poetry run javspn
      

Usage

You can modify the configuration file config.ini to instruct how JavSP should work for you.

JavSP also accepts CLI flags and prioritize them over config.ini, you can run JavSP -h to see a list of supportted flags.

For more detailed instructions please visit JavSP Wiki

Please file an issue if you find any problem using this software.😊

Bug report

If you encounter any bug that is not already encountered by other users(you can check this by searching through the issue page), don't hesitate to go and file an isssue.

Contribution

No need to buy me any coffee LoL. If you like to help, please help me through these methods:

  • Help writing and improving the Wiki

  • Help completing the Unit Test (Not necessarilly coding, testcases or insightful obvervations are also welcomed)

  • Help translating the genre

  • Pull Request for bug fix or new feature

  • Give me a star (Thank you!)

License

This project is under the restriction of both the GPL-3.0 License and the Anti 996 License. On top of that, using this software implies that you accept the following terms:

  • I will only use this software for academic purpose

  • I won't advertize this project on any Chinese social media like weibo or wechat.

  • I will follow the local government regulation when using this software.

  • I will not monetrize this software and make profit out of it.

About

Meta data scraper for jav that summarizes from multiple sources.

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 99.8%
  • Shell 0.2%