In simpler terms, how to build a SIFT descriptor?

S

Sacerdos2016-05-12 10:27:52

Programming

Sacerdos, 2016-05-12 10:27:52

Based on this article: https://habrahabr.ru/post/106302/
The first points are clear and simple
1) Gauss passes through the image several times. We save some stages of blurring
2) Subtract the "neighboring" received images
3) We examine each pixel for an extremum together with the points surrounding it of the image itself, as well as the images "above" and "below"
All this is done for several "octaves", i.e. the original image is halved, and so on.
And this is where I fail to understand. What does "approximate the DoG function by a second-order Taylor polynomial" mean, in the sense of what exactly to do with the image in terms of implementation, for example, in C#? I don't understand finding derivatives in pixels.

Reply

Answer the question

In order to leave comments, you need to log in

1 answer(s)

D

Dmitry Pavlov, 2016-05-13
@dmitry_pavlov

We'll have to figure out the accompanying math first. As I understand it, the task comes down to writing logic code to determine whether a point in an octave is a key point, so that you can find the key point of an octave.
Or you can try to understand the existing code - see below. here and here .