Font Size: a A A

Study On Multimodel Retrieval Method Of Content-Based Video Database

Posted on:2006-12-22Degree:DoctorType:Dissertation
Country:ChinaCandidate:N LvFull Text:PDF
GTID:1118360155453693Subject:Communication and Information System
Abstract/Summary:PDF Full Text Request
contents based Video data retrieve is more and more popular in thecomputer multi-media domain , its methods include various techniquestrategies such as image retrieval , key frame retrieval , graphics retrieval ,audio retrieval and etc . Most of these strategies get retrieve results in theway of extracting feature description from video data . Of all strategiesmentioned above , they can be divides into three kinds of methods . One ismanual selection or providing feature descriptions , that is to say selectingretrieval keywords to retrieval system from various objective attribute orfeatures description library in video data . And the system will carry onretrieving operations according to the designed retrieval strategies .Another method is that to extract the physics feature from the image of videodata . It carris on the retrieving operations by user's selections , and makinguse of describable contents from color, texture, shape, motion of videocamera . The last method is that to extract logic features from video data .Currently , most methods of video data retrieving contains only oneprocessing technique , such as image retrieval , key frame retrieval , tonecolor or tone techniques , but there are few retrieval technologies based onsemantic contents in video data . The research that make use of video dataintegrate with ralated data of other forms , and carry on mutli-patternretrieving in a safe and efficient network environment is rare seen . So thisresearch give you a retrieval methods that foucs on the strategy above ,under the standard framework of MPEG-7 , it can build the speech soundfeature library of video data , which according to contents of video audioinformation and any other related multimodel data information .According to the multimodel video retrieval technique in this paper , wecan build a video data semantic-feature library from the point of view ofsemantic feature.Including audio ,subtitle ,music, plot script , news draft andother related information together with portrait, subtitle, speech sound ,technologies of video recognition and script analysis , we can create amultimodel model that extracting semantic feature of video data .Theretrieval platform also contains speech recognition engine and OCR engine .By video data information characteristics of subject meeting , televisionnews ,teleplay and movies , this paper provide a approach that taking thecritical point of speech sound or music as the scene subdivision ,and takingthe critical point of changes in tone color as the subdivision of videosegment .By feature extraction of the tone color or other audio ,we cansubdivide scene , audio frequency analysis is also helpful for video scenesubdivision . Make use of the speech sound relativity measurement, talker'sself-adapting , selecting subtitle frame, the edge extracting , two valulized insubtitle area , small wave packet decode ,the kernel function technique andso on , together with other related knowledge library , we can create videodata comment feature library that based on semantic comments andsubdivision of video scene. This paper also provide some original methods , the retrievaloperation such as people's name, actor's lines and actors can beimplemented by the way of feature match that make use of descriptioninformation of script , subtitle, speech sound ,portrait analyse . We canexecute the content-based retrieval operations according to therepresentative key image frame under the nearest feature line algorithm ,which is mentioned in literature [46] . The research in this paper based on semantic feature extractionimplement the access and retrieval operations toward video data , whichprovide the method of building the data feature library with descriptionstructure , and the library can be created by automatic , semiautomatic ormanual manipulation .Then the video data feature library framework that wediscuss becomes a traditional relation model . One characteristic of thispaper is that the content of video data and feature description library isseparate when designning the structure of video feature library .The contentof multimodel video feature library is a constraint condition that a retrievalsystem carries on various operations . Video data semantic feature librarytakes content and time constraint as checking conditions when it is built . Concerning construction , synchronization and compression of videostream description model ,this paper describes the presentation of motionimage and audio data in 4-D matrix form that include spatiality ,temporalityand presentation requirements featrues . We can view a video stream as theintegration of image and audio interwoven in a temporally close-coupledfashion .The presentation can be used to improve the compression rate andthe QoS for video ( or multimedia ) database systems and transmissionsystems . Safe is a very important factor when video stream is transmitted in thenetwork , so if we want to have an efficient video data retrieval system , weshould build a safe network enviroment . Currently , the main factors thataffect video stream application research are as follows : usability of thecontents ,the processing ability of the terminal , network performance ,characteristics of Consumer , the natural environment of the consumer :availability of bandwidth ,error rate, contents scale, adaptability, interactionand etc . The active network architecture is efficaciously solution for thisproblem . In an active network , some operations are transplanted to everynode , so DoS/DDoS ( Denial-of-Service/Distributed Denial-of-Service )packets can be discriminated and discarded . Upperstream nodes innetwork could discard these useless packets by the notify fromdownstream nodes , in this way the flow of useful packets can get morestream bandwidths . This paper provide you a system framework thatdiscriminate and control distributed attack and its implement strategy . Thissystem framework is build up on the active network enviroment , mainlyinclude three parts : automatic authentication and control strategy based ongathering architecture , active notify and track strategy based on gatheringarchitecture , control and cooperates strategy based on management domainenviroment . The thesis researching goal is to improve the practicability, efficiency,security, stability and interactivity of content-based video database multimoderetrieval methods. The thesis's major contribution and innovation are asfollows: 1.Under the security safeguards mechanism of the network performance,the thesis advances the multimode semantic interpretation-based videodatabase retrieval methods. According to the systematic analysis to datastructure feature of video, audio and text etc., this retrieval method is putforward after summarizing the correlativity among them. It saves the retrievaltime and storage space, reduces the communication cost, and correlated withall kinds of data. 2 . According to the technical analysis and combination of imageprocessing, audio processing and text processing, the thesis establishes thevideo semantic extracting model. This model offers the convenience for themultimode alternate notes, moreover, correlates all kinds of complex dataattributes so as to retrieve the relevant data efficiently. 3.According to the analysis and extraction of the feature information,such as, caption, speech, portrait and script etc., highly correlated with videodata's lexeme, the thesis advances the multimode alternate note methods...
Keywords/Search Tags:content based, multimodel, semantic feature, video retrieval, active network
PDF Full Text Request
Related items