DJW c16313bb6a 第一次提交 9 kuukautta sitten
..
README.md c16313bb6a 第一次提交 9 kuukautta sitten
faster-rcnn_r50-caffe-c4_ms-18k_voc0712.py c16313bb6a 第一次提交 9 kuukautta sitten
faster-rcnn_r50_fpn_1x_voc0712-cocofmt.py c16313bb6a 第一次提交 9 kuukautta sitten
faster-rcnn_r50_fpn_1x_voc0712.py c16313bb6a 第一次提交 9 kuukautta sitten
retinanet_r50_fpn_1x_voc0712.py c16313bb6a 第一次提交 9 kuukautta sitten
ssd300_voc0712.py c16313bb6a 第一次提交 9 kuukautta sitten
ssd512_voc0712.py c16313bb6a 第一次提交 9 kuukautta sitten

README.md

Pascal VOC

The Pascal Visual Object Classes (VOC) Challenge

Abstract

The Pascal Visual Object Classes (VOC) challenge is a benchmark in visual object category recognition and detection, providing the vision and machine learning communities with a standard dataset of images and annotation, and standard evaluation procedures. Organised annually from 2005 to present, the challenge and its associated dataset has become accepted as the benchmark for object detection.

This paper describes the dataset and evaluation procedure. We review the state-of-the-art in evaluated methods for both classification and detection, analyse whether the methods are statistically different, what they are learning from the images (e.g. the object or its context), and what the methods find easy or confuse. The paper concludes with lessons learnt in the three year history of the challenge, and proposes directions for future improvement and extension.

Results and Models

Architecture Backbone Style Lr schd Mem (GB) Inf time (fps) box AP Config Download
Faster R-CNN C4 R-50 caffe 18k - 80.9 config model | log
Faster R-CNN R-50 pytorch 1x 2.6 - 80.4 config model | log
Retinanet R-50 pytorch 1x 2.1 - 77.3 config model | log
SSD300 VGG16 - 120e - - 76.5 config model | log
SSD512 VGG16 - 120e - - 79.5 config model | log

Citation

@Article{Everingham10,
   author = "Everingham, M. and Van~Gool, L. and Williams, C. K. I. and Winn, J. and Zisserman, A.",
   title = "The Pascal Visual Object Classes (VOC) Challenge",
   journal = "International Journal of Computer Vision",
   volume = "88",
   year = "2010",
   number = "2",
   month = jun,
   pages = "303--338",
}