Microsoft COCO: Common Objects in Context

Lin, Tsung-Yi; Maire, Michael; Belongie, Serge; Bourdev, Lubomir; Girshick, Ross; Hays, James; Perona, Pietro; Ramanan, Deva; Zitnick, C. Lawrence; Dollár, Piotr

Computer Science > Computer Vision and Pattern Recognition

arXiv:1405.0312 (cs)

[Submitted on 1 May 2014 (v1), last revised 21 Feb 2015 (this version, v3)]

Title:Microsoft COCO: Common Objects in Context

Authors:Tsung-Yi Lin, Michael Maire, Serge Belongie, Lubomir Bourdev, Ross Girshick, James Hays, Pietro Perona, Deva Ramanan, C. Lawrence Zitnick, Piotr Dollár

View PDF

Abstract:We present a new dataset with the goal of advancing the state-of-the-art in object recognition by placing the question of object recognition in the context of the broader question of scene understanding. This is achieved by gathering images of complex everyday scenes containing common objects in their natural context. Objects are labeled using per-instance segmentations to aid in precise object localization. Our dataset contains photos of 91 objects types that would be easily recognizable by a 4 year old. With a total of 2.5 million labeled instances in 328k images, the creation of our dataset drew upon extensive crowd worker involvement via novel user interfaces for category detection, instance spotting and instance segmentation. We present a detailed statistical analysis of the dataset in comparison to PASCAL, ImageNet, and SUN. Finally, we provide baseline performance analysis for bounding box and segmentation detection results using a Deformable Parts Model.

Comments:	1) updated annotation pipeline description and figures; 2) added new section describing datasets splits; 3) updated author list
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:1405.0312 [cs.CV]
	(or arXiv:1405.0312v3 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.1405.0312

Submission history

From: Piotr Dollár [view email]
[v1] Thu, 1 May 2014 21:43:32 UTC (6,986 KB)
[v2] Sat, 5 Jul 2014 18:39:56 UTC (7,484 KB)
[v3] Sat, 21 Feb 2015 01:48:49 UTC (7,891 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Microsoft COCO: Common Objects in Context

Submission history

Access Paper:

References & Citations

5 blog links

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Microsoft COCO: Common Objects in Context

Submission history

Access Paper:

References & Citations

5 blog links

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators