Revision as of 14:04, 18 April 2008 by Slitkouh (Talk)

Image Matching

Maximum likelihood estimates can be used in image matching (edge template matching and gray-level image matching). This can be applied to stereo matching and feature tracking. More about this topic can be found here ... [1]

Maximum likelihood can also be used in image reconstruction or restoration. Surprisingly, I found the usage of this in compression artifact removal also. See this paper [2]

Face Recognition

Fisher Linear Discriminant (FLD) is widely used in face recognition. Here is a paper for reference: [3]. Also variants of FLD are used for face recognition such as DiaFLD [4]. It has been observed that FLD works better than Principal Component Analysis in classifying the facial features.

See also: Face detection vs Face recognition_Old Kiwi

Image Segmentation

Image Segmentation is performed by conventional graphical methods, but many a times, some pixels not belonging to the same object are classified into the same segment. Also, in images where a wide background is separated by a thin boundary line, image segmentation can be performed by obtaining features from FLD. I experimented this personally and found that the results are better than the conventional methods. This paper gives a starting point in doing this [5].

Here is an example example of a clustering method used for image segmentation. Here the distance criterion used is the absolute value of the distance between the pixel values. Pixels and their neighbors are chosen from a four point neighborhood and then evaluated for their distances. By adjusting the threshold used to connect pixels, different levels of segmentation are achieved. Here are some results for this algorithm using a simple image.

Original Image: File:Original Old Kiwi.jpg

Image with a low distance threshold: File:Low Old Kiwi.jpg This strict threshold generated 27,654 connected sets, 36 of which are shown.

Image with a medium distance threshold: File:Med Old Kiwi.jpg This moderate threshold generated 16,747 connected sets, 41 of which are shown.

Image with a high distance threshold: File:High Old Kiwi.jpg This loose threshold generated 11,192 connected sets, of which 23 are shown.

Here we notice that as the threshold is increased, the criteria for merging regions becomes looser and the amount of regions starts to shrink.

Content-based Image/video Retrieval

With the increasing developments in memory technology, there has been a huge increase in the storage capacities of cameras, PDAs etc. This has led to a dramatic growth in personal image collections, and the image database sizes on the whole. The indexing of the databases however is not organized. So, retrieving images based on their names/indexes is very difficult. Hence, the images have to be retrieved by their content.

Pattern recognition plays an important role in retrieving of images. The features like color, texture, brightness, segments, shapes etc. of the images are used for retrieval. The appropriate feature selection is imperative to achieve good accuracy. Some of well-known image retrieval systems are 'imgseek', 'blobworld' [6], 'Bag of features' [7].

Pattern recognition can also be used to retrieve videos based on their content. This is more complex as compared to the image retrieval, because of the additional dimensions (time, motion). Several attempts have been made to develop good video retrieval systems. In some systems, classifying videos into categories like action, sports, animation, news/lectures etc. helps to reduce the computation complexity and the retrieval time, by retrieving the videos from the appropriate categories.

Alumni Liaison

Abstract algebra continues the conceptual developments of linear algebra, on an even grander scale.

Dr. Paul Garrett