Reduced complexity on video encoding for wireless multimedia sensor networks

YAN, Zhuge (2020) Reduced complexity on video encoding for wireless multimedia sensor networks. PhD thesis, University of Nottingham.

[thumbnail of Zhuge PhD.pdf] PDF (Thesis - as examined) - Repository staff only - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
Download (2MB)

Abstract

A wireless multimedia sensor network (WMSN) requires a video encoding system that should be energy-efficient because of its special characteristics: limited power capacity but long service life without maintenance. Besides, video is one of the most important information contents that a WMSN delivers. Motion estimation plays an important role in predictive coding, which is the key part of video encoding and requires large amount of computation. In order to reduce the computational complexity of motion estimation, the block-matching search algorithm is able to find the motion vectors using fewer search points. An algorithm which adds a predictive search technique to the enhanced modified orthogonal search (EMOS) has been proposed. It improves the efficiency of block-matching search algorithms, and has two self-adapting search patterns for large and small movements. The proposed algorithm requires less search points to work out the movement of blocks and provides acceptable image quality. This algorithm was also tested on field programmable gate array (FPGA) and Arduino platforms.

Moreover, a back propagation neural network model is introduced for predictive block-matching. The proposed back propagation neural network has very simple structure with only 5 inputs, 5 hidden neurons and 1 output architecture. Because of its simplicity, it requires very little computational power which is negligible compared with existing computation complexity. The test results show the prediction accuracy in 10 - 30\% higher then the competing algorithms with a peak signal-to-noise ratro (PSNR) improvement up to 0.3 dB. The above advantages make it a feasible replacement of the current solution.

With the information technology developing dramatically, there is reason to believe that the next generation video encoding standard HEVC will soon be able to run on very cheap platforms. Therefore, a prospective study on HEVC inter prediction acceleration was also carried out. We extracted specific image features that represent prediction unit texture, incorporated a machine learning technique, namely random forest, in HEVC intra prediction mode selection, to improve the performance of inter coding of HEVC. Benchmarking with other existing algorithms, our method extracts very specific features of image texture changes in terms of angle. Therefore the proposed method can achieve very high prediction accuracy. Having similar reduction in complexity, the proposed algorithm will be demonstrated to have a higher video quality compared with similar algorithms.

Item Type: Thesis (University of Nottingham only) (PhD)
Supervisors: Shaker, Sherif Welsen
Cho, Sui-Yeung
Keywords: wireless multimedia sensor network, video processing
Subjects: T Technology > TK Electrical engineering. Electronics Nuclear engineering
Faculties/Schools: UNNC Ningbo, China Campus > Faculty of Science and Engineering > Department of Electrical and Electronic Engineering
Item ID: 60558
Depositing User: YAN, Zhuge
Date Deposited: 14 May 2020 05:50
Last Modified: 27 May 2020 08:16
URI: https://eprints.nottingham.ac.uk/id/eprint/60558

Actions (Archive Staff Only)

Edit View Edit View