In paper SpineNet was trained with three protocols. I only trained with protocol B with 8 Tesla V100 gpus.
- protocol A: random scale between[0.8,1.2];250 epochs;DropBlock
- protocol B: random scale between[0.5,2.0];350 epochs
- protocol C: random scale between[0.5,2.0];500 epochs;stochastic depth;Swish
Backbone | Resolution | my code(B) | paper(A) | paper(B) | paper(C) | Training Time |
---|---|---|---|---|---|---|
SpineNet-49S | 640x640 | 39.2 | —— | 39.9 | 41.5 | 3d 5h |
SpineNet-49 | 640x640 | 42.1 | 40.8 | 42.8 | 44.3 | 4d 2h |
SpineNet-49 | 896x896 | 44.9 | —— | 45.3 | 46.7 | 6d 16h |
SpineNet-96 | 1024x1024 | 46.9 | —— | 47.1 | 48.6 | 11d 7h |
SpineNet-143 | 1280x1280 | 49.2 | —— | 48.1 | 50.7 | 24d 9h |
Backbone | Resolution | box AP(B) | mask AP(B) | paper box AP(B) | paper mask AP(B) | Training Time |
---|---|---|---|---|---|---|
SpineNet-49S | 640x640 | 39.7 | 34.9 | 39.3 | 34.8 | 6d 13h |
SpineNet-49 | 640x640 | 43.3 | 37.8 | 42.9 | 38.1 | 7d 18h |
SpineNet-96 | 1024x1024 | 47.0 | 41.2 | 47.2 | 41.5 | 14d 21h |
SpineNet-143 | 1280x1280 | 48.3 | 41.3 | 48.8 | 42.7 | 26d 19h |
- SpineNet-49S 640
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.392 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.587 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.419 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.209 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.424 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.556 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.326 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.520 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.552 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.332 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.599 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.732
- SpineNet-49 640
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.421 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.621 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.452 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.242 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.458 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.581 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.342 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.545 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.576 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.372 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.625 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.746
- SpineNet-49 896
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.449 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.652 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.484 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.274 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.492 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.599 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.355 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.574 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.609 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.428 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.650 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.756
- SpineNet-96 1024
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.469 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.673 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.507 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.300 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.510 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.608 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.364 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.588 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.624 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.448 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.665 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.768
- SpineNet-143 1280
Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.492 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.692 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.533 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.337 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.537 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.631 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.377 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.609 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.646 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.479 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.686 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.781
- SpineNet-49S 640
Bbox Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.397 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.603 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.429 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.217 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.430 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.550 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.327 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.514 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.540 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.338 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.586 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.705 Segm Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.349 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.567 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.367 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.162 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.379 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.508 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.298 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.456 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.477 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.263 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.525 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.664
- SpineNet-49 640
Bbox Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.433 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.641 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.469 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.260 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.468 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.585 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.344 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.541 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.568 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.377 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.613 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.728 Segm Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.378 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.606 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.399 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.195 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.412 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.539 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.314 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.481 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.502 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.294 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.552 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.682
- SpineNet-96 1024
Bbox Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.470 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.683 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.515 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.305 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.502 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.616 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.364 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.579 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.608 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.429 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.640 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.750 Segm Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.412 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.649 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.440 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.241 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.443 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.562 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.332 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.516 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.540 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.349 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.577 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.698
- SpineNet-143 1280
Bbox Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.483 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.689 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.529 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.330 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.527 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.607 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.364 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.578 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.606 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.445 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.652 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.729 Segm Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.413 Average Precision (AP) @[ IoU=0.50 | area= all | maxDets=100 ] = 0.654 Average Precision (AP) @[ IoU=0.75 | area= all | maxDets=100 ] = 0.444 Average Precision (AP) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.263 Average Precision (AP) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.452 Average Precision (AP) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.540 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 1 ] = 0.322 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets= 10 ] = 0.504 Average Recall (AR) @[ IoU=0.50:0.95 | area= all | maxDets=100 ] = 0.525 Average Recall (AR) @[ IoU=0.50:0.95 | area= small | maxDets=100 ] = 0.363 Average Recall (AR) @[ IoU=0.50:0.95 | area=medium | maxDets=100 ] = 0.569 Average Recall (AR) @[ IoU=0.50:0.95 | area= large | maxDets=100 ] = 0.657