| In this dissertation, I present a flexible image retrieval system that allows users to define queries of arbitrary shapes in a query-by-example environment. To realize this system, I have addressed a number of major issues, described as follows. (1) Query-by-example (QBE) is the most popular query model for content-based image retrieval (CBIR). A typical query contains not only objects of interest but also irrelevant image areas. The latter, referred to as noise, has limited the effectiveness of existing CBIR systems. I define noise-free queries (NFQs), which are composed of only relevant regions identified by the user at the query time. The challenge is how to precompute the feature vectors if we do not know the matching areas at database build time. I present a similarity model based on a sampling-based matching framework. The model handles NFQs effectively, and is robust with respect to scaling, translation, and semantic constraints of the matching objects. (2) To support large image datasets, I introduce a novel indexing technique for this new environment. The technique represents a unified solution to the following problems. (a) Since we cannot assume any shape for user-defined queries, the proposed indexing structure must be able to handle arbitrary-shaped queries. (b) It must be robust to scaling and translation. (c) Image similarity is typically determined by a large number of features. Current indexing approaches fail for high dimensional searching, a phenomenon known as the curse of dimensionality. An effective dimensionality reduction method has to be devised. (d) Many of the existing indexing techniques are not scalable. That is, the search time increases faster than the linear function of the data size. The proposed technique must be efficient for large data sets. (3) To facilitate the proposed indexing procedure, the core areas of NFQs have to be identified. To automate this task, it is necessary to propose an efficient algorithm that is able to detect the optimal core areas.; A prototype has been developed to demonstrate the feasibility and efficiency of the system. I conclude the dissertation with future research directions. |