resnext_coco.md 4.1 KB

ResNext (CVPR'2017)
@inproceedings{xie2017aggregated,
  title={Aggregated residual transformations for deep neural networks},
  author={Xie, Saining and Girshick, Ross and Doll{\'a}r, Piotr and Tu, Zhuowen and He, Kaiming},
  booktitle={Proceedings of the IEEE conference on computer vision and pattern recognition},
  pages={1492--1500},
  year={2017}
}

COCO (ECCV'2014)
@inproceedings{lin2014microsoft,
  title={Microsoft coco: Common objects in context},
  author={Lin, Tsung-Yi and Maire, Michael and Belongie, Serge and Hays, James and Perona, Pietro and Ramanan, Deva and Doll{\'a}r, Piotr and Zitnick, C Lawrence},
  booktitle={European conference on computer vision},
  pages={740--755},
  year={2014},
  organization={Springer}
}

Results on COCO val2017 with detector having human AP of 56.4 on COCO val2017 dataset

Arch Input Size AP AP50 AP75 AR AR50 ckpt log
pose_resnext_50 256x192 0.715 0.897 0.791 0.771 0.935 ckpt log
pose_resnext_50 384x288 0.724 0.899 0.794 0.777 0.936 ckpt log
pose_resnext_101 256x192 0.726 0.900 0.801 0.781 0.939 ckpt log
pose_resnext_101 384x288 0.744 0.903 0.815 0.794 0.939 ckpt log
pose_resnext_152 256x192 0.730 0.903 0.808 0.785 0.940 ckpt log
pose_resnext_152 384x288 0.742 0.904 0.810 0.794 0.940 ckpt log