基于感興趣區(qū)域的高性能視頻編碼幀內(nèi)預測優(yōu)化算法

宋人杰; 張元東

doi:10.11999/JEIT190330

基于感興趣區(qū)域的高性能視頻編碼幀內(nèi)預測優(yōu)化算法

doi: 10.11999/JEIT190330

宋人杰,
張元東^,

東北電力大學計算機學院吉林 132012

詳細信息

作者簡介:
宋人杰：女，1966年生，教授，研究方向為數(shù)字圖像處理與可視化應用、計算機視覺與電力應用

張元東：男，1993年生，碩士生，研究方向為感興趣區(qū)域HEVC算法

通訊作者:
張元東　1406632033@qq.com

中圖分類號: TN919.81
計量
- 文章訪問數(shù): 2081
- HTML全文瀏覽量: 927
- PDF下載量: 58
- 被引次數(shù): 0
出版歷程
- 收稿日期: 2019-05-13
- 修回日期: 2020-05-24
- 網(wǎng)絡出版日期: 2020-07-01
- 刊出日期: 2020-11-16

High Efficiency Video Coding Intra Prediction Optimization Algorithm Based on Region of Interest

Renjie SONG,
Yuandong ZHANG^,

School of Computer, Northeast Electric Power University, Jilin 132012, China

摘要

摘要: 針對高性能視頻編碼(HEVC)幀內(nèi)預測編碼算法復雜度較高的問題，該文提出一種基于感興趣區(qū)域的高性能視頻編碼幀內(nèi)預測優(yōu)化算法。首先，根據(jù)圖像顯著性劃分當前幀的感興趣區(qū)域(ROI)和非感興趣區(qū)域(NROI)；然后，對ROI基于空域相關性采用提出的快速編碼單元(CU)劃分算法決定當前編碼單元的最終劃分深度，跳過不必要的CU劃分過程；最后，基于ROI采用提出的預測單元(PU)模式快速選擇算法計算當前PU的能量和方向，根據(jù)能量和方向確定當前PU的預測模式，減少率失真代價的相關計算，達到降低編碼復雜度和節(jié)省編碼時間的目的。實驗結果表明，在峰值信噪比(PSNR)損失僅為0.0390 dB的情況下，所提算法可以平均降低47.37%的編碼時間。
- 高性能視頻編碼 /
- 感興趣區(qū)域 /
- 編碼單元劃分 /
- 預測單元模式選擇
Abstract: For the high complexity of High Efficiency Video Coding (HEVC) intra prediction coding algorithm, an HEVC intra prediction optimization algorithm based on Region Of Interest (ROI) is proposed. Firstly, the algorithm divides the Region Of Interest and Non-Region Of Interest (NROI) of the current frame according to image saliency; Then, the final grading depth of the current coding unit is determined by the proposed fast Coding Unit (CU) partitioning algorithm based on spatial correlation in the ROI, and the unnecessary CU partitioning process is skipped. Finally, the proposed Prediction Unit (PU) mode fast selection algorithm is used to calculate the energy and direction of the current PU based on the ROI, and the current PU prediction mode is determined according to the energy and direction, and the correlation calculation of the rate distortion cost is reduced, Achieving the purposes of reducing coding complexity and saving coding time. The experimental results show that the proposed algorithm can reduce the coding time by 47.37% on average when the Peak Signal-to-Noise Ratio (PSNR) loss is only 0.0390 dB.
- High Efficiency Video Coding(HEVC) /
- Region Of Interest(ROI) /
- Coding Unit(CU) division /
- Prediction Unit(PU) mode selection

HTML全文

圖 1 本文算法與文獻[8]、文獻[9]算法的檢測結果

下載: 全尺寸圖片幻燈片

圖 2 本文算法和HM13.0算法的RD性能比較

下載: 全尺寸圖片幻燈片

表 1 快速CU劃分算法正確率和PU預測模式快速選擇算法命中率(%)

序列	QP=22	QP=27	QP=32	QP=37	平均
Traffic	93.7/91.4	95.6/92.3	96.1/95.6	96.8/96.1	95.6/93.9
BQTerrace	93.1/89.7	94.8/91.4	95.8/93.5	96.4/94.7	95.0/92.3
Partyscene	92.4/90.2	94.7/93.1	95.6/93.9	96.2/94.8	94.7/93.0
Blowing Bubbles	91.1/88.6	93.4/90.3	94.7/92.5	95.8/93.7	93.8/91.3
Johnny	92.3/89.8	94.6/92.7	95.3/94.5	96.1/95.3	94.6/93.1
平均	92.5/89.9	94.6/91.9	95.5/94.0	96.3/94.9	94.7/92.7

下載: 導出CSV

表 2 本文算法與文獻[3]算法及文獻[6]算法實驗結果對比

分辨率	序列	BDBR(%)	BDPSNR(dB)	$T$(%)
$2560 \times 1600$	Traffic	0.7054/0.6874/0.6013	–0.0406/–0.0396/–0.0327	42.19/43.62/46.89
$2560 \times 1600$	PeopleOnStreet	1.2017/1.1047/0.7161	–0.0593/–0.0617/–0.0410	43.94/45.05/50.14
$1920 \times 1080$	Kimono	0.6725/0.6435/0.6314	–0.0351/–0.0309/–0.0293	42.76/43.93/47.93
	Basketball Drive	1.3316/1.2704/1.0341	–0.0296/–0.0311/–0.0274	43.35/44.86/48.19
	Cactus	1.2073/1.3160/0.9758	–0.0314/–0.0348/–0.0317	41.87/45.16/48.34
$832 \times 480$	BQMall	1.1986/1.1476/0.7692	–0.0724/–0.0769/–0.0405	40.01/42.93/45.54
	Basketball Drill	1.3843/1.2543/0.6963	–0.0716/–0.0683/–0.0317	39.16/43.47/46.74
	RaceHorsesC	1.2196/1.1702/0.7163	–0.0631/–0.0574/–0.0385	40.54/43.24/45.83
$416 \times 240$	Keiba	1.4055/1.1394/0.5631	–0.0965/–0.0846/–0.0417	41.96/43.56/46.14
	BQSquare	1.3423/1.2761/0.6176	–0.0913/–0.0877/–0.0475	41.64/44.87/46.86
	BasketballPass	1.4063/1.4322/0.7568	–0.0714/–0.0793/–0.0513	43.45/44.14/47.43
$1280 \times 720$	FourPeople	0.9704/0.9417/0.6975	–0.0542/–0.0523/–0.0372	42.64/43.17/47.39
	Vidy01	0.6725/0.6524/0.7351	–0.0403/–0.0443/–0.0462	41.47/41.83/46.87
	Vidyo3	1.0457/0.9125/0.8143	–0.0562/–0.0549/–0.0496	42.09/42.54/46.13
	平均	1.1260/1.0677/0.7375	–0.0581/–0.0574/–0.0390	41.93/43.74/47.17

下載: 導出CSV

表 3 本文算法與文獻[13]算法實驗結果對比

Class	文獻[13]算法			本文算法
Class	BDBR(%)	BDPSNR(dB)	$T$(%)	BDBR(%)	BDPSNR(dB)	$T$(%)
ClassA	0.9236	–0.0742	44.19	0.6697	–0.0392	48.62
ClassB	1.1747	–0.0557	45.77	0.8926	–0.0327	48.74
ClassC	1.3532	–0.0823	41.89	0.7369	–0.0354	45.86
ClassD	1.3479	–0.1022	43.94	0.6461	–0.0473	46.69
ClassE	1.0754	–0.0837	43.76	0.7493	–0.0441	46.93
平均	1.1750	–0.0796	43.91	0.7389	–0.0397	47.37

下載: 導出CSV

參考文獻(13)

王莉, 曹一凡, 杜高明, 等. 一種低延遲的3維高效視頻編碼中深度建模模式編碼器[J]. 電子與信息學報, 2019, 41(7): 1625–1632. doi: 10.11999/JEIT180798

WANG Li, CAO Yifan, DU Gaoming, et al. A Low-latency depth modelling mode-1 encoder in 3d-high efficiency video coding standard[J]. Journal of Electronics &Information Technology, 2019, 41(7): 1625–1632. doi: 10.11999/JEIT180798

TAI Kuanghan, HSIEH M Y, CHEN Meijuan, et al. A fast HEVC encoding method using depth information of collocated CUs and RD Cost characteristics of PU modes[J]. IEEE Transactions on Broadcasting, 2017, 63(4): 680–692. doi: 10.1109/TBC.2017.2722239

LI Yue, YANG Gaobo, ZHU Yapei, et al. Unimodal stopping model -based early SKIP mode decision for high -efficiency video coding[J]. IEEE Transactions on Multimedia, 2017, 19(7): 1431–1441. doi: 10.1109/TMM.2017.2669863

TAI Kuanghan, CHEN Meijuan, LIN Jieru, et al. Acceleration for HEVC encoder by bimodal segmentation of Rate-Distortion cost and accurate determination of early termination and early split[J]. IEEE Access, 2019, 7: 45259–45273. doi: 10.1109/ACCESS.2019.2900517

HUANG Chao, PENG Zongju, CHEN Fen, et al. Efficient CU and PU decision based on neural network and gray level co-occurrence matrix for intra prediction of screen content coding[J]. IEEE Access, 2018, 6: 46643–46655. doi: 10.1109/ACCESS.2018.2866081

CHEN Meijuan, WU Yude, YEH C H, et al. Efficient CU and PU decision based on motion information for interprediction of HEVC[J]. IEEE Transactions on Industrial Informatics, 2018, 14(11): 4735–4745. doi: 10.1109/TII.2018.2801852

LIU Xingang, LIU Yinbo, WANG Peicheng, et al. An adaptive mode decision algorithm based on video texture characteristics for HEVC intra prediction[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2017, 27(8): 1737–1748. doi: 10.1109/TCSVT.2016.2556278

WANG Xinzhi, FANG Yifan, LI Changdi, et al. Static gesture segmentation technique based on improved Sobel operator[J]. The Journal of Engineering, 2019, 2019(22): 8339–8342. doi: 10.1049/joe.2019.1075

GONG Shenjian, LI Guangqiang, ZHANG Yongju, et al. Application of static gesture segmentation based on an improved canny operator[J]. The Journal of Engineering, 2019, 2019(15): 543–546. doi: 10.1049/joe.2018.9377

余映, 吳青龍, 邵凱旋, 等. 超復數(shù)域小波變換的顯著性檢測[J]. 電子與信息學報, 2019, 41(9): 2231–2238. doi: 10.11999/JEIT180738

YU Ying, WU Qinglong, SHAO Kaixuan, et al. Saliency detection of wavelet transform in hypercomplexdomain[J]. Journal of Electronics &Information Technology, 2019, 41(9): 2231–2238. doi: 10.11999/JEIT180738

ZHANG Tao, SUN Mingting, ZHAO Debin, et al. Fast intra-mode and CU size decision for HEVC[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2017, 27(8): 1714–1726. doi: 10.1109/TCSVT.2016.2556518

PAN Zhaoqing, LEI Jianjun, ZHANG Yun, et al. Fast motion estimation based on content property for low-complexity H.265/HEVC encoder[J]. IEEE Transactions on Broadcasting, 2016, 62(3): 675–684. doi: 10.1109/TBC.2016.2580920

GU Jiawen, TANG Minhao, WEN Jiangtao, et al. Adaptive intra candidate selection with early depth decision for fast intra prediction in HEVC[J]. IEEE Signal Processing Letters, 2018, 25(2): 159–163. doi: 10.1109/LSP.2017.2766766

施引文獻

資源附件(0)

訪問統(tǒng)計

圖(2) / 表(3)

計量

文章訪問數(shù): 2081
HTML全文瀏覽量: 927
PDF下載量: 58
被引次數(shù): 0

姓名
郵箱
手機號碼
標題
留言內(nèi)容
驗證碼

一级黄色片免费播放|中国黄色视频播放片|日本三级a|可以直接考播黄片影视免费一级毛片

留言板

基于感興趣區(qū)域的高性能視頻編碼幀內(nèi)預測優(yōu)化算法

doi: 10.11999/JEIT190330

作者簡介:
宋人杰：女，1966年生，教授，研究方向為數(shù)字圖像處理與可視化應用、計算機視覺與電力應用

張元東：男，1993年生，碩士生，研究方向為感興趣區(qū)域HEVC算法

通訊作者:
張元東　1406632033@qq.com

計量

High Efficiency Video Coding Intra Prediction Optimization Algorithm Based on Region of Interest

計量

目錄

一级黄色片免费播放|中国黄色视频播放片|日本三级a|可以直接考播黄片影视免费一级毛片

留言板

基于感興趣區(qū)域的高性能視頻編碼幀內(nèi)預測優(yōu)化算法

doi: 10.11999/JEIT190330

作者簡介: 宋人杰：女，1966年生，教授，研究方向為數(shù)字圖像處理與可視化應用、計算機視覺與電力應用 張元東：男，1993年生，碩士生，研究方向為感興趣區(qū)域HEVC算法

通訊作者: 張元東 1406632033@qq.com

計量

出版歷程

High Efficiency Video Coding Intra Prediction Optimization Algorithm Based on Region of Interest

計量

出版歷程

目錄

作者簡介:
宋人杰：女，1966年生，教授，研究方向為數(shù)字圖像處理與可視化應用、計算機視覺與電力應用

張元東：男，1993年生，碩士生，研究方向為感興趣區(qū)域HEVC算法

通訊作者:
張元東　1406632033@qq.com