tag_image

AI image tagging and captioning, using mmdetection framework and BLIP.

install

requires python 3.9+

pip3 install -r ./requirements
# for reference, this project has been developed at ecac3a77becc63f23d9f6980b2a36f86acd00a8a, so checkout this specific commit if you have problems...
git clone https://github.com/open-mmlab/mmdetection
pip3 install -e ./mmdetection

usage

./tag_image.py --help
usage: tag_image.py [-h] [--img_path IMG_PATH] [--config_path CONFIG_PATH]
                    [--tag_threshold TAG_THRESHOLD] [--server SERVER]
                    [--add_caption]

tags images using mmdetection and add caption using BLIP.

options:
  -h, --help            show this help message and exit
  --img_path IMG_PATH   path to the input image, ignored if --server is
                        specified.
  --config_path CONFIG_PATH
                        path to the configuration json
  --tag_threshold TAG_THRESHOLD
                        tagging_detection threshold, discard results with
                        score < threshold, default=0.7f
  --server SERVER       start server on the given iface:port, if set --img
                        is ignored
  --add_caption         also generate caption for the image. ignored if
                        --server is specified.

configuration

the default configuration can be edited to use other models.

commandline

./tag_image.py --tag_threshold 0.5 --img_path ./sample_images/tennis.jpg --add_caption
....
{'status': 'success', 'time_msec': 1684269941755, 'data': {'tags': {'person': 2, 'sports_ball': 1, 'tennis_racket': 1}, 'caption': 'two pictures of a man holding a trophy and a tennis ball', 'file_name': 'tennis.jpg'}}

through REST api

using REST api server, start server first

# http://localhost:8080/docs for docs
./tag_image.py --server 0.0.0.0:8080

then call process_image endpoint

curl -L -F "threshold=0.5" -F "add_caption=True" -F "job_id=prova" -F "file_name=ducks.jpg" -F "file=@./sample_images/ducks.jpg" http://127.0.0.1:8080/process_image
{"status":"success","time_msec":1684269861711,"data":{"tags":{"bird":2},"file_name": "ducks.jpg", "caption":"ducks standing on a rock near the water and a river"},"req_id":"1684269859207291701","job_id":"prova"}

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
api		api
models		models
sample_images		sample_images
.gitignore		.gitignore
README.md		README.md
config.json		config.json
requirements.txt		requirements.txt
tag_image.py		tag_image.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

api

api

models

models

sample_images

sample_images

.gitignore

.gitignore

README.md

README.md

config.json

config.json

requirements.txt

requirements.txt

tag_image.py

tag_image.py

Repository files navigation

tag_image

install

usage

configuration

commandline

through REST api

About

Releases

Packages

Languages

valerino/ai_tag_images

Folders and files

Latest commit

History

Repository files navigation

tag_image

install

usage

configuration

commandline

through REST api

About

Topics

Resources

Stars

Watchers

Forks

Languages