Smoke Segmentation Project

Smoke segmentation data is scarce online, and the already existing available datasets are either poorly segmented or limited in size. The objective of this repository is to facilitate the creation of an expansive image dataset specifically tailored for smoke segmentation tasks, leveraging a semi-automatic labeling approach based on a foundation model called SAM. This repository offers a comprehensive suite of tools, notably including a segmentation UI, designed for an oracle to filter out (or blacklist) poorly predicted pseudo-ground-truth segmentation masks. Moreover, it encompasses methodologies and tools essential for training smoke segmentation models.

Fine-Tuned Models

Two models have been trained on our smoke segmentation dataset to get a comparison of how effective each is on the task of smoke segmentation.

Results

Model	Pre-trained @	mIU	mIU Smoke	FPS
BiSeNet - R18	Cityscapes	80.81%	69.48%	21.4
PIDNet Small	Camvid	81.64%	70.69%	25.0

Details - Demo

The resulting predictions along with a metrics diagram follows, showcasing the BiSeNet (R18 backbone) model's performance and its training potential.

Figure 1. The left image shows the input, the middle image is the combination of the input with the predicted segmentation mask, and the right image is the (pseudo) ground truth mask acquired by SAM.

Figure 2. Depicted mean IU plot computed on the test set.

Specs & OS

All the experiments referenced were conducted on a system equipped with an Intel i5-12500H CPU, 38.9 GB of memory, and an RTX 4060 GPU (laptop version), running Ubuntu 22.04.3 LTS.

D-Fire - External Dataset

[D-Fire] is an image dataset of fire and smoke occurrences designed for machine learning and object detection algorithms with more than 21,000 images. The bounding box labels are stored inside .txt files in YOLO format (class + cxcywh).

S-Smoke

We name S-Smoke to be any dataset consisted of D-Fire's images, with segmentation ground truth masks produced for the task of image smoke segmentation, where the ground truth masks are generated from a pre-trained [SAM] model, prompted by the bounding box labels of the D-Fire dataset. An oracle then filters out bad predictions.

Our produced dataset is named as S-Smoke-var0, and is consisted by

Number of train instances 2400
Number of test instances 287

Data Directory Structure

This is the data's directory structure, located in ./datasets.

datasets
├── data_info
├── D-Fire
|   ├── test
|   |   ├── images
|   |   └── det_labels
|   └── train
|       ├── images
|       └── det_labels
└── S-Smoke
    ├── curated
    |   ├── test
    |   |   ├── images
    |   |   ├── seg_labels
    |   |   └── combined
    |   └── train
    └── raw
        ├── test
        |   ├── images
        |   ├── seg_labels
        |   └── combined
        └── train

Model Directory Structure

This directory structure has to be built manually.

models
├── PIDNet
|   ├── finetuned
|   └── pretrained
|       └── PIDNet_S_Camvid_Test.pt
└── TorchSeg
    ├── finetuned
    └── pretrained
        └── cityscapes-bisenet-R18.pth

Download and place these at their corresponding directories

Preparation

Dependencies

Apply

sudo apt-get install ninja-build

python3 -m pip install -r requirements.txt

Finally install [Apex].

Next Steps

First visit [data_tools] to perform the semi-annotation by carefully following all instructions given by the corresponding README.md. After that, you will acquire your own S-Smoke dataset and you may train your own segmentation models through [primary_segmenter].

Citation

ycszen. [TorchSeg]
XuJiacong. [PIDNet]
Pedro Vinícius Almeida Borges de Venâncio, Adriano Chaves Lisboa, Adriano Vilela Barbosa: An automatic fire detection system based on deep convolutional neural networks for low-power, resource-constrained devices. In: Neural Computing and Applications, 2022.
Kirillov, A., Mintun, E., Ravi, N., Mao, H., Rolland, C., Gustafson, L., Xiao, T., Whitehead, S., Berg, A., Lo, W.Y., Dollar, P., & Girshick, R. (2023). Segment Anything. arXiv:2304.02643.

Name		Name	Last commit message	Last commit date
Latest commit History 72 Commits
data_tools		data_tools
datasets		datasets
models		models
primary_segmenters		primary_segmenters
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
fig0.png		fig0.png
fig1.png		fig1.png
fig2.png		fig2.png
requirements.txt		requirements.txt

License

fl0wxr/SmokeSegmenter

Folders and files

Latest commit

History

Repository files navigation

Smoke Segmentation Project

Fine-Tuned Models

Results

Details - Demo

Specs & OS

D-Fire - External Dataset

S-Smoke

Data Directory Structure

Model Directory Structure

Preparation

Directory

Dependencies

Next Steps

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Languages