|
|
|
|
|
| |
Posted by Randall P. Hootman on February 23, 2006, 2:31 pm
Please log in for more thread options
Last night at the HomeBrew Robotics Club (http://www.hbrobotics.org)
here in the Silicon Valley, 4 of our members put on an amazing
demonstration of visual recognition using Lowe's SIFT algorithm.
How they were able to put on the demonstration in the first place was
actually amazing in it's self. They first searched the web to find any
code that would help them, and when that failed, they plowed through the
mathematical paper on the algorithm and them implemented it. That was a
very tedious task.
It was a great presentation. First, one of the fellows showed his
computer (with webcam attached) first train on objects and then
recognizing these various objects by speaking what they were: a doll, a
dollar bill, etc. What was amazing was these objects could be rotated,
partially obsured, distorted, at varied distance, etc. and could still
be recoginized using this algorithm. For example, the computer trained
on both a one dollar bill and a twenty dollar bill. The bills were then
shown to the webcam rotated, crumpled, with a finger overlaying the bill
at varying distances and the computer said what the object was each and
every time. The kicker is that the computer can be trained on many many
many objects and recognize any or all of them. The training data is
stored in a database and the real problem is that it really comes down
to searching the database speed as to how fast objects are recognized.
Next, there was a gentle introduction to the algorithm. The guy showed a
few slides and then a program he had written in Visual Basic that took
you visually through the steps of the algorithm and explained what was
going on in each step.
Next, there was a slightly more technical explaination.
But what really blew the crowd away (not that the crowd of about 50
members wasn't already) was this: the next presenter had taken a CMUJcam
and attached it to a cheap FPGA in which he had implemented some of the
algorithm. While it was not yet recognizing objects (he is taking
development in steps and he has a real job with Xilinks). He pointed the
CMUcam at us, and at 60 FPS, there the crowd was in outlined form. A guy
in the back of the crowd started throwing and spinning a hat in the air.
No problems. The crowd started moving to see how robust this was. No
problem. Truely amazing.
There now are plans of finishing implementing Lowe's SIFT algorithm in
an FPGA, attaching a camera lens to it and selling them to HBRC members
to play with (read debug). The expected cost was somewhere under $50.
That's right. I'll type it again. $50. But I would think it worth it for
twice that much or more.
|
|
Posted by Padu on February 23, 2006, 3:00 pm
Please log in for more thread options
Really interesting stuff. From my weak knowledge on classification, it seems
that the SIFT algorithm is really useful for extracting a feature vector
from the image at hand. Do you know what type of algorithm they used for
classification?
Cheers
Padu
|
|
Posted by Mark H on February 23, 2006, 6:24 pm
Please log in for more thread options Just checking into this group after being gone for a while, thought I
would weigh in on SIFT. I took a computer vision class last spring and
while we didn't implement SIFT (we did implement Hough), we spent quite
some time studying it. It's not for the faint of heart. If anybody is
interested in it, however, I can probably find some notes and post them
here. The technology is currently being used in some photostitching
software which takes a collection of photos and automatically stitches
them together (instead of manually creating a mapping between photos).
It works *really* well, and I'd be shocked if the inventor isn't a
millionaire some day.
|
|
Posted by JGCASEY on February 23, 2006, 3:16 pm
Please log in for more thread options
I agree with Padu, really interesting stuff.
I googled with the subject line,
Lowe's SIFT algorithm
to get more information.
--
JC
|
|
Posted by on February 23, 2006, 3:30 pm
Please log in for more thread options >He pointed the
>CMUcam at us, and at 60 FPS, there the crowd was in outlined form. A
>guy in the back of the crowd started throwing and spinning a hat in the air.
>No problems. The crowd started moving to see how robust this was. No
>problem. Truely amazing.
what is the point of this paragraph? "no problem" ?
are you saying that the CMUcam had no problem outlining
objects at 60pfs?
Rich
|
| Similar Threads | Posted | | More Visual Recognition and the SIFT algorithm | February 24, 2006, 5:53 pm |
| The SIFT visual recognition algorithm | February 24, 2006, 10:07 pm |
| recognition celebrates of the concrete woman | August 22, 2007, 5:18 am |
| recognition upsets Haji bin Ahmad's counter | August 22, 2007, 11:06 pm |
| who witnesses eventually, when Founasse bursts the mutual recognition of the parish | December 27, 2007, 2:24 am |
| Stepper motor Visual Basic | September 20, 2006, 8:52 am |
| Visual Odometry to Estimate Position without GPS?? | October 31, 2006, 4:11 pm |
| it's very visual, I'll pause particularly or Abdel will flick the purchasers | November 13, 2007, 11:19 pm |
| you won't meet me servicing on board your visual organisation | November 14, 2007, 2:59 pm |
| both receiving now, Mohammed and Gul owed the visual moons regarding well operation | November 10, 2007, 4:06 pm |
|
|
|
>CMUcam at us, and at 60 FPS, there the crowd was in outlined form. A
>guy in the back of the crowd started throwing and spinning a hat in the air.