This article is part of the series Color in Image and Video Processing.

Open Access Research Article

Unsupervised Video Shot Detection Using Clustering Ensemble with a Color Global Scale-Invariant Feature Transform Descriptor

Yuchou Chang1, DJ Lee1*, Yi Hong2 and James Archibald1

Author Affiliations

1 Electrical and Computer Engineering Department, Brigham Young University, Provo, UT 84602, USA

2 Department of Computer Science, City University of Hong Kong, Kowloon, Hong Kong

For all author emails, please log on.

EURASIP Journal on Image and Video Processing 2008, 2008:860743 doi:10.1155/2008/860743


The electronic version of this article is the complete one and can be found online at: http://jivp.eurasipjournals.com/content/2008/1/860743


Received:1 August 2007
Revisions received:30 October 2007
Accepted:22 November 2007
Published:11 December 2007

© 2008 The Author(s).

This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Abstract

Scale-invariant feature transform (SIFT) transforms a grayscale image into scale-invariant coordinates of local features that are invariant to image scale, rotation, and changing viewpoints. Because of its scale-invariant properties, SIFT has been successfully used for object recognition and content-based image retrieval. The biggest drawback of SIFT is that it uses only grayscale information and misses important visual information regarding color. In this paper, we present the development of a novel color feature extraction algorithm that addresses this problem, and we also propose a new clustering strategy using clustering ensembles for video shot detection. Based on Fibonacci lattice-quantization, we develop a novel color global scale-invariant feature transform (CGSIFT) for better description of color contents in video frames for video shot detection. CGSIFT first quantizes a color image, representing it with a small number of color indices, and then uses SIFT to extract features from the quantized color index image. We also develop a new space description method using small image regions to represent global color features as the second step of CGSIFT. Clustering ensembles focusing on knowledge reuse are then applied to obtain better clustering results than using single clustering methods for video shot detection. Evaluation of the proposed feature extraction algorithm and the new clustering strategy using clustering ensembles reveals very promising results for video shot detection.

Publisher note

To access the full article, please see PDF.