PySemSeg

PySemSeg is a library for training Deep Learning Models for Semantic Segmentation in Pytorch. The goal of the library is to provide implementations of SOTA segmentation models, with pretrained versions on popular datasets, as well as an easy-to-use training loop for new models and datasets. Most Semantic Segmentation datasets with fine-grained annotations are small, so Transfer Learning is crucial for success and is a core capability of the library. PySemSeg can use visdom or tensorboardX for training summary visualialization.

Installation

Using pip:

pip install git+https://github.com/petko-nikolov/pysemseg

Models

FCN [paper] - FCN32, FCN16, FCN8 with pre-trained VGG16
UNet [paper]
Tiramisu (FC DenseNets)[paper] - FC DenseNet 56, FC DenseNet 67, FC DensetNet 103 with efficient checkpointing
DeepLab V3 [paper] - Multi-grid, ASPP and BatchNorm fine-tuning with pre-trained resnets backbone
DeepLab V3+ [paper]
RefineNet [paper] - [Upcoming ...]
PSPNet [paper] - [Upcoming ...]

Datasets

Pascal VOC
CamVid
Cityscapes [Upcoming ...]
ADE20K [Upcoming ...]

Train a model from command line

The following is an example command to train a VGGFCN8 model on the Pascal VOC 2012 dataset. In addition to the dataset and the model, a transformer class should be passed (PascalVOCTransform in this case) - a callable where all input image and mask augmentations and tensor transforms are implemented. Run pysemseg-train -h for a full list of options.

pysemseg-train \
   --model VGGFCN8 \
   --model-dir ~/models/vgg8_pascal_model/ \
   --dataset PascalVOCSegmentation \
   --data-dir ~/datasets/PascalVOC/ \
   --batch-size 4 \
   --test-batch-size 1 \
   --epochs 40 \
   --lr 0.001 \
   -- optimizer SGD \
   -- optimizer-args '{"weight_decay": 0.0005, "momentum": 0.9}' \
   --transformer PascalVOCTransform \
   --lr-scheduler PolyLR \
   --lr-scheduler_args '{"max_epochs": 40, "gamma": 0.8}'

or pass a YAML config

pysemseg-train --config config.yaml

model: VGGFCN32
model-dir: models/vgg8_pascal_model/
dataset: PascalVOCSegmentation
data-dir: datasets/PascalVOC/
batch-size: 4
test-batch-size: 1
epochs: 40
lr: 0.001
optimizer: SGD
optimizer-args:
    weight_decay: 0.0005
    momentum: 0.9
transformer: PascalVOCTransform
no-cuda: true
lr-scheduler: PolyLR
lr-scheduler-args:
    max_epochs: 40
    gamma: 0.8

Load and predict with a trained model

To use a checkpoint for inference you have to call load_model with a checkpoint, the model class and the transformer class used during training.

import torch.nn.functional as F
from pysemseg.transforms import CV2ImageLoader
from pysemseg.utils import load_model
from pysemseg.models import VGGFCN32
from pysemseg.datasets import PascalVOCTransform

model = load_model(
    './checkpoint_path', 
    VGGFCN32, 
    PascalVOCTransform
)

image = CV2ImageLoader()('./image_path')
logits = model(image)
probabilities = F.softmax(logits, dim=1)
predictions = torch.argmax(logits, dim=1)

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
pysemseg		pysemseg
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.rst		README.rst
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pysemseg

pysemseg

.gitignore

.gitignore

.pylintrc

.pylintrc

LICENSE

LICENSE

README.rst

README.rst

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

PySemSeg

Installation

Models

Datasets

Train a model from command line

Load and predict with a trained model

About

Releases

Packages

Languages

License

petko-nikolov/pysemseg

Folders and files

Latest commit

History

Repository files navigation

PySemSeg

Installation

Models

Datasets

Train a model from command line

Load and predict with a trained model

About

Topics

Resources

License

Stars

Watchers

Forks

Languages