Conceptual structures and computational methods for indexing and organization of visual information

Posted on:2004-09-30

Degree:Ph.D

Type:Dissertation

University:Columbia University

Candidate:Jaimes, Alejandro

Full Text:PDF

GTID:1468390011973785

Subject:Engineering

Abstract/Summary:

We address the problem of automatic indexing and organization of visual information through user interaction at multiple levels. Our work focuses on the following three important areas: (1) understanding of visual content and the way users search and index it; (2) construction of flexible computational methods that learn how to automatically classify images and videos from user input at multiple levels; (3) integration of generic visual detectors in solving practical tasks in the specific domain of consumer photography.; In particular, we present the following: (1) novel conceptual structures for classifying visual attributes (the Multi-Level Indexing Pyramid ); (2) a novel framework for learning structured visual detectors from user input (the Visual Apprentice); (3) a new study of human eye movements in observing images of different visual categories; (4) a new framework for the detection of non-identical duplicate consumer photographs in an interactive consumer image organization system; (5) detailed study of duplicate consumer photographs.; In the Visual Apprentice (VA), first a user defines a model via a multiple-level definition hierarchy (a scene consists of objects, object-parts, etc.). Then, the user labels example images or videos based on the hierarchy (a handshake image contains two faces and a handshake) and visual features are extracted from each example. Finally, several machine learning algorithms are used to learn classifiers for different nodes of the hierarchy. The best classifiers and features are automatically selected to produce a Visual Detector (e.g., for a handshake), which is applied to new images or videos.; In the human eye tracking experiments we examine variations in the way people look at images within and across different visual categories and explore ways of integrating eye tracking analysis with the VA framework.; Finally, we present a novel framework for the detection of non-identical duplicate consumer images for systems that help users automatically organize their collections. Our approach is based on a multiple strategy that combines knowledge about the geometry of multiple views of the same scene, the extraction of low-level features, the detection of objects using the VA and domain knowledge.

Keywords/Search Tags:

Visual, Organization, Indexing, User, Multiple

Related items

1	Research And Implementation On Indexing Mechanism For The Ocean Data Organization
2	Feature-based indexing in visual information systems
3	The Research On The Fusion Method Between Information Self Organization And Artificial Organization Of The Network Community
4	Mixed Signature And Dynamic Indexing For Effective Motion Trajectory Representation And Recognition
5	Indexing multimedia collections and user access An analysis of the indexing systems in place at the BBC Archive and the British Film Institute National Archive
6	The Research Of User-based Information Recourses Organization
7	Research On P2P Network Based Vector Gegraphic Data Organization And Indexing Technogoy
8	Study On The Video Content Organization And Indexing
9	Local Visual Information Based Large-Scale Image Retrieval
10	Research On Soccer Video Indexing Algorithm Based On Multiple Features