iccv iccv2013 iccv2013-400 iccv2013-400-reference knowledge-graph by maker-knowledge-mining
Source: pdf
Author: Matthijs Douze, Jérôme Revaud, Cordelia Schmid, Hervé Jégou
Abstract: This paper makes two complementary contributions to event retrieval in large collections of videos. First, we propose hyper-pooling strategies that encode the frame descriptors into a representation of the video sequence in a stable manner. Our best choices compare favorably with regular pooling techniques based on k-means quantization. Second, we introduce a technique to improve the ranking. It can be interpreted either as a query expansion method or as a similarity adaptation based on the local context of the query video descriptor. Experiments on public benchmarks show that our methods are complementary and improve event retrieval results, without sacrificing efficiency.