An Improved Algorithm And Its Parallel Optimization On GPUs For Motion Estimation Of Advanced Video Coding

Posted on:2017-01-13

Degree:Master

Type:Thesis

Country:China

Candidate:W Z Zhu

Full Text:PDF

GTID:2348330503489870

Subject:Computer system architecture

Abstract/Summary:

PDF Full Text Request

Motion estimation is one of the key parts of advanced video coding(AVC) standard H.264. With more effective inter prediction techniques, H.264 can tremendously improve the video compression ratio, but introduces more computation complexity at the same time, which is a big challenge to real-time video coding. Recently, GPU(Graphics Processing Unit) become more and more powerful at parallel processing, and CUDA(Compute Unified Device Architecture) based on GPU is a good programming model for realization of parallelizing motion estimation. Thus, it has great significance for real-time video applications, such as network live broadcast,to deep into the parallelism of motion estimation for accelerating video coding.After optimizing motion search algorithm and analyzing motion estimation's parallelism, a parallel optimization solution based on CUDA for H.264 motion estimation is proposed. Firstly, a novel motion search algorithm TLS(Two Level Search) is proposed, which is adapted to CUDA programming model. The first level search, which is a globally coarse-grained search, is to find best motion vector with 4 step size in search window. The second level search, which is a locally fine-grained search, is to find the final best motion vector within 5�5 quare around the position of the first level search's best motion vector. This algorithm greatly reduce the search points, and can fastly get the best motion vector with GPU's strong parallel computing ability. Secondly, an asynchronous processing model is proposed, which is related to residul coding on CPU and motion estimation on GPU. A frame is partitioned into N parts, which has no data dependency on each other. When CPU is processing residul coding of the part n-1, GPU can process motion estimation of the part n, so if residul coding of the part n-1 is finished, CPU can directly process the next part without delay.Experimental results show that after applying the proposed optimization, the speed of motion estimation part using TLS could be at most 40 times faster than the x264 encoder using ESA(Exhaustive Search Algorithm), and the whole coding speed could be at most 30 times faster, with PSNR(Peak Signal to Noise Ratio) error within 1.2d B. As a conclusion, the proposed solution could greatly accelerate the video coding with an acceptable video quality lose.

Keywords/Search Tags:

Advanced Video Coding, Motion Estimation, GPU, Parallel Optimization

PDF Full Text Request

Related items

1	Research On Moving Video Object Segmentation And Advanced Motion Estimation/Motion Compensation Algorithms
2	Advanced Scalable Coding And Motion Estimation Algorithms For Error Resilient Video Compression
3	Research And Parallel Implementation Of Motion Estimation Used In Video Encoding
4	Research On AVS2 High Speed Parallel Motion Estimation Algorithm Based On GPU
5	An Advanced Hierarchical Motion Estimation Scheme For Beyond High Definition Video Coding
6	Research On Optimization Of Advanced Video Coding And Its Extension
7	Optimization And Research On Entropy Coding And Motion Estimation Of AVS Video Encoder
8	The Research Of Fast Motion Estimation Algorithm Based On H.264
9	Estimation Algorithm Based On The H.264 Standard Movement Parallel Design And Realization
10	Research And Optimization On Motion Estimation Technology In Scalable Video Coding