Font Size: a A A

Integration and Goal-Guided Scheduling of Bottom-Up and Top-Down Computing Processes in Hierarchical Models

Posted on:2012-07-10Degree:Ph.DType:Thesis
University:University of California, Los AngelesCandidate:Wu, TianfuFull Text:PDF
GTID:2458390011954619Subject:Statistics
Abstract/Summary:
Accuracy performance and computational efficiency are two of the most important issues of object detection and parsing in computer vision, and the trade-off between them is usually guided by vision goals. In the literature, to improve accuracy performance, hierarchical models have been widely and successfully used, but often at the expense of increasing the computational burden largely. The explosion of computing costs would practically prevent a computer vision system from scaling to hierarchical models which consist of a large number of nodes. Meanwhile, hierarchical models are studied with zero-one loss used for nodes (i.e., loss cost-insensitive).;The goal of this thesis is to present a framework of integrating and scheduling bottom-up (BU) and top-down (TD) computing processes in a recursively defined hierarchical And-Or graph (AoG) to address, with a numerical study, the vision-goal-guided trade-off between accuracy performance and computational efficiency in both loss cost-insensitive and cost-sensitive situations.;The BU/TD computing processes consist of three types of processes identified for each node A in an AoG: (i) The alpha(A) process detects node A directly based on image features: (ii) The beta(A) process computes node A by binding its child node(s) bottom-up: and (iii) The gamma(A) process predicts node A top-down from its parent node(s). To evaluate their individual information contributions, the three processes are isolated and then trained separately. The learning of the three processes are formulated under the maximum likelihood estimation (MLE) framework. A numerical study of the information contribution is presented with both computer and human experiments. The experimental results show that the three processes contribute to computing node A from images in complementary ways in terms of scale and occlusion conditions.;To improve the accuracy performance, the alpha-beta-gamma computing processes are integrated by breadth-first search (BFS) in object parsing with AoG formulated under the Bayesian framework. The three processes are explicitly connected to the Bayesian inference and the dynamic programming (DP) implementation. With experiments on human face parsing and hierarchical image structure parsing, the results show performance improvement in the same manner consistent with their evaluated information contributions.;Next, to advance computational efficiency of learnt computing processes given allowable bounds on accuracy performance in cost-sensitive object detection, near-optimal decision policies are learnt for computing processes of terminal nodes and And-nodes by minimizing the corresponding risk function which explicitly takes into account the computational cost, the false negative (FN) and false positive (FP) loss costs.;Finally, a theoretical study is proposed to schedule all the alpha-beta-gamma computing processes in an AoG under the best, firstheuristic search framework to adapt computing orders of nodes in an AoG to different vision tasks and image datasets.
Keywords/Search Tags:Computing, Hierarchical models, Accuracy performance, Vision, Node, Computational efficiency, Aog, Top-down
Related items