Research Of Scene Understanding Algorithm Based On Monocular Vision

Posted on:2015-11-21

Degree:Master

Type:Thesis

Country:China

Candidate:H J Shen

Full Text:PDF

GTID:2308330482456042

Subject:Computational Mathematics

Abstract/Summary:

PDF Full Text Request

With the accelerating pace of urban life, car has become a part of peopleâ€™s life, but meanwhile, the frequency of traffic accident increases also, which turns into globally widespread concern. Safety Driving Assist can solve the problems of traffic safety. Scene Understanding is essential to solve the traffic safety problem, which is the research focus in various nations. On the base of analyzing and summarizing the detection methods of Scene Understanding abroad and home, this thesis proposes a new method.This thesis carries on study about Scene Understanding, makes image segmentation of it, and divides it into three categories:sky, roads and obstacles. In order to mark the scene image categories more accurately and quickly, Extreme Learning Machine (ELM) is used to identify and mark the scene region. Firstly, to extract the scene region HOG (Histogram of Oriented Gradient) features, apply BW dimension reduction methods, use ELM classifier for classification, train three ELM classifiers offline (sky, land, obstacles), and then, identify the corresponding scene categories. Although the scene has been made semantic annotation segmentation, the test results are not satisfactory; therefore, further consideration has been given to other image information, to classify semantic segmentation with the application of CRFs (Conditional Random Fields) on the base of initial ELM training result.For conditional random field image segmentation technology is the research hotspot in the field of computer vision in recent years, the core idea of this approach is to construct a Probabilistic Graphical Model (PGM) with observation data, give a potential function corresponding with the model, according to CRFs learning and inference method, get the maximum probability of making the semantic tags as the final mark. The novelty of this model is that it takes pixel gradient characteristics, location characteristics, the front frame of the relationship between car video frame and the edge information into consideration, and takes it as the largest clique potential function to establish model. With the application of Broyden Fletcher Goldfarb Shanno (BFGS) and Loopy Belief Propagation (LBP), the model has been trained and inferred respectively. Proposed algorithm has been tested in public available data sets like MSRC (Microsoft Research Cambridge), ICCV09DATA and real scenes video, the recognition rate was 84.29%.The purpose of this thesis is to identify the associated region and region, as well as scenes semantic image segmentation in context information.

Keywords/Search Tags:

scene understanding, ELM, conditional random fields, quasi Newton iteration method, cyclic belief propagation

PDF Full Text Request

Related items

1	Spoken Language Understanding Research Based On Conditional Random Fields
2	Research On Online Detection Method Of Reputation Fraud Campaign Based On Conditional Random Fields
3	Study Of The Understanding Of The Spatial Layout And Semantic Segmentation Towards Traffic Scene Images
4	Research On Key Technologies Of Hierarchical Scene Understanding
5	Research On Image Understanding Algorithm By Embedding Prior Information In Conditional Random Field Framework
6	Research On Morpheme Analysis Based On Conditional Random Fields In Chinese Natural Language Understanding
7	Research On Short Utterance Semantic Recognition Method Based On Cascaded Conditional Random Fields
8	Research On Low-Complexity And Fast LDPC Decoding Algorithms
9	SAR Image Change Detection Based On Conditional Random Fields
10	A Study On Chinese Personal Name Recognition Based On Conditional Random Fields