Bridging the semantic gap: Exploring descriptive vocabulary for image structure

Posted on:2007-08-29

Degree:Ph.D

Type:Dissertation

University:Indiana University

Candidate:Beebe, Caroline

Full Text:PDF

GTID:1458390005983496

Subject:Library science

Abstract/Summary:

PDF Full Text Request

Content-Based Image Retrieval (CBIR) is a technology made possible by the binary nature of the computer. Although CBIR is used for the representation and retrieval of digital images, these systems make no attempt either to establish a basis for similarity judgments generated by query-by-pictorial-example searches or to address the connection between image content and its internal spatial composition. The disconnect between physical data (the binary code of the computer) and its conceptual interpretation (the intellectual code of the searcher) is known as the semantic gap. A descriptive vocabulary capable of representing the internal visual structure of images has the potential to bridge this gap by connecting physical data with its conceptual interpretation. The research project addressed three questions: Is there a shared vocabulary of terms used by subjects to represent the internal contextuality (i.e., composition) of images? Can the natural language terms be organized into concepts? And, if there is a vocabulary of concepts, is it shared across subject pairs? A natural language vocabulary was identified on the basis of term occurrence in oral descriptions provided by 21 pairs of subjects participating in a referential communication task. In this experiment, each subject pair generated oral descriptions for 14 of 182 images drawn from the domains of abstract art, satellite imagery and photo-microscopy. Analysis of the natural language vocabulary identified a set of 1,319 unique terms which were collapsed into 545 concepts. These terms and concepts were organized into a faceted vocabulary. This faceted vocabulary can contribute to the development of more effective image retrieval metrics and interfaces to minimize the terminological confusion and conceptual overlap that currently exists in most CBIR systems. For both the user and the system, the concepts in the faceted vocabulary can be used to represent shapes and relationships between shapes (i.e., internal contextuality) that constitute the internal spatial composition of an image. Representation of internal contextuality would contribute to more effective image search and retrieval by facilitating the construction of more precise feature queries by the user as well as the selection of criteria for similarity judgments in CBIR applications.

Keywords/Search Tags:

Image, CBIR, Vocabulary, Gap, Retrieval

PDF Full Text Request

Related items

1	Vocabulary Tree-based Image Retrieval Technology Research
2	Visual Vocabulary For Image Retrieval Tree Study
3	Vocabulary Tree-based Image Retrieval, Extraction And Research Of Visual Keywords
4	Studying On The CBIR System Based On NSCT
5	Based On Local Sensitivity For CBIR Research
6	The Research Of Image Retrieval Technology Based On Hashing Methods
7	A CBIR Algorithm Based On The Adaptive Adjustment Of Feature Weights
8	Based Relevance Feedback Image Retrieval Techniques And Realization
9	Research On Some Techniques Of Content-based Image Retrieval
10	Research On Image Retrieval Algorithm Based On Bag-of-Words And Vocabulary Tree