AMMAI: Aggregating local descriptors into a compact image representation

This paper focuses on searching images from a large dataset (10 million images or more). But here we have three parts to optimize.

The representation of an image
The dimension reduction of these representation vectors.
The indexing algorithm.

In this paper, it uses a descriptor called VLAD, which is derived from Bag-of-Words and Fisher Kernel, then aggregates SIFT descriptors.
In the second part, it uses asymmetric distance computation to do approximate nearest neighbors search. Then it uses PCA to do dimension reduction.
We mainly focus on part 1 and part 2. And here is the flow chart[1] to describe what this paper does.

[1] http://users.auth.gr/espyromi/publications/slides/spyromitrosWIAMIS2012slides.pdf

AMMAI

2016年3月15日星期二

Aggregating local descriptors into a compact image representation

沒有留言:

張貼留言

2016年3月15日 星期二

Aggregating local descriptors into a compact image representation

沒有留言:

張貼留言

2016年3月15日星期二