View article

[PDF] from thecvf.com

Image retrieval using scene graphs

Authors

Justin Johnson, Ranjay Krishna, Michael Stark, Li-Jia Li, David Shamma, Michael Bernstein, Li Fei-Fei

Publication date

2015

Conference

Proceedings of the IEEE conference on computer vision and pattern recognition

Pages

3668-3678

Description

This paper develops a novel framework for semantic image retrieval based on the notion of a scene graph. Our scene graphs represent objects (" man"," boat"), attributes of objects (" boat is white") and relationships between objects (" man standing on boat"). We use these scene graphs as queries to retrieve semantically related images. To this end, we design a conditional random field model that reasons about possible groundings of scene graphs to test images. The likelihoods of these groundings are used as ranking scores for retrieval. We introduce a novel dataset of 5,000 human-generated scene graphs grounded to images and use this dataset to evaluate our method for image retrieval. In particular, we evaluate retrieval using full scene graphs and small scene subgraphs, and show that our method outperforms retrieval methods that use only objects or low-level image features. In addition, we show that our full model can be used to improve object localization compared to baseline methods.

Total citations

Cited by 1173

201520162017201820192020202120222023202412 34 59 87 135 172 178 193 219 70

Scholar articles

Image retrieval using scene graphs

J Johnson, R Krishna, M Stark, LJ Li, D Shamma… - Proceedings of the IEEE conference on computer …, 2015

Cited by 1173 Related articles All 12 versions