当前位置:网站首页>Loop filtering using SSIM based CNN
Loop filtering using SSIM based CNN
2022-07-21 05:13:00 【Dillon2015】
This article comes from the proposal JVET-T0069《SSIM based CNN model for in-loop filtering》, At present, most of the video coding research based on deep learning is to optimize objective indicators , However, for human vision, a higher objective index sometimes does not mean a higher subjective effect . therefore , The proposal proposes a neural network model CNNLF(convolutional neural network based in-loop filter) For subjective optimization .
brief introduction
CNNLF Used in loop filtering stage , be in deblocking after ,SAO Before , Such as Fig.1 Shown .
The reconstruction frame will first use DF Filter processing , And then pass by CNNLF Model processing , At the end of the day SAO Further treatment . This method works at the block level , adopt RDO Decide whether to use CNNLF Handle , Therefore, it is necessary to pass a flag bit in the code stream to indicate whether CNNLF.
Network structure
The whole network structure is shown in the figure , from 8 A residual unit (3x3 Convolution ) constitute , Except for the last convolution layer generation 3 Dimension output , Other convolution layers are generated 64 Whitman's sign . Finally, there are two SE Block and one 3x3 Convolution connection is used after the network for adaptive mixed features to generate the final reconstructed image . Be careful , The first residual element is not used 1x1 Convolution . Shorting in the network can promote the flow of shallow features .
Residual unit
Pictured above , The residual element consists of a 1x1 Convolution layer and two 3x3 The convolution layer constitutes , The activation function is ReLU, There are two short circuits and one in the unit SE block (squeeze-and-excitation block).
Short circuit will 1x1 The output sum of convolution 3x3 The output of convolution can be added to learn the residual . In the following formula x It's input ,F(x) Is the entire output ,f() Express 1x1 Convolution operation ,g() It means residual learning .
SE The block structure is shown in the figure below , It processes features into multi-channel forms . It adapts the calibration feature channels by showing the dependencies between the modeled feature channels , It is conducive to model learning and compressed information .
Training
Model USES DIV2K Dataset training , contain 800 Training images ,100 Verification images , All images use VTM10.0 stay AI Configure the next encoding ,QP={22,27,32,37}. For different QP Train only one model . And the training indicators are SSIM, Test indicators use VMAF, because VMAF Discontinuous, so it is not suitable for use in training . The training configuration is shown in the table 1, among loss The function uses SSIM and L1 loss A weighted , Two sets of weight parameters are used here (0.2,0.8) and (0.8,0.2)
experimental result
The experimental test configuration is AI、RA and LDB,QP={22,27,32,37}. Due to the high computational complexity ,RA Configuration only runs the first GOP.YUV Do not use GPU, Decoding will use GPU and CPU Decode each time .
Loss = 0.8 x SSIM + 0.2 x L1
Loss = 0.2 x SSIM + 0.8 x L1
Interested parties, please pay attention to WeChat official account Video Coding
边栏推荐
- 初次见面,MySQL
- kubernetes部署单节点Redis服务
- [translation] thinking about machine learning engineering after a year of doctoral study
- MySQL索引与事务
- 国内的边缘计算组织和产品调研
- Comparison table of English abbreviations of network security terms
- 那些做3D、数字人、元宇宙的年轻人,究竟有多赚钱?
- Li Kou sword finger offer II 101 Divide equal sum subsets
- RGB+深度图像 语义分割paper阅读笔记(ICRA2021)
- The direction of this
猜你喜欢
2022年软件测试最新面试题及解析
Kotlin学习之json数据解析
Master these plug-ins and improve your office efficiency by 90% every minute!
如果在加密领域有段位,你是“青铜”还是“王者”?
关于晶圆介绍以及IGBT晶圆的应用
Jenkins使用上传的文件(File Parameter)作为参数
2022-07-19:f(i) : i的所有因子,每个因子都平方之后,累加起来。 比如f(10) = 1平方 + 2平方 + 5平方 + 10平方 = 1 + 4 + 25 + 100 = 130。
03 beautifulsoup parsing library
HCIP-7. LSA header of OSPF
01 课程简介&爬虫概述
随机推荐
【递归】7段码(回溯 + 结束条件通过并查集进行相邻节点约束判断)
[learning notes] insert addition
[learning notes] number theory thinking problem
C # use the merging method to judge whether the two lists are equal and whether they contain
Jmeter 接口必加元素
470-82(200、1049、416、50)
AutoJs学习-实现透明状态栏
What are the advantages of low code platforms over traditional IT development?
【二叉树】重建二叉树(先序找根,中序分治)
【学习笔记】整除分块、线性筛、莫比乌斯反演
Daily question 1: specified interval reversal in the linked list
OTA online upgrade practice record (still need to be updated)
shell脚本学习
[translation] thinking about machine learning engineering after a year of doctoral study
How to understand software testing with the 28 principle? See the following
Arduino IDE for Raspberry Pi Pico开发固件本地化安装教程
Leaflet 加载超图发布的投影图层报错 Uncaught No projection definition for code XXXX
Learn how to choose chart types, and Xiaobai can also play with data analysis
AutoJs学习-投币小游戏
IP command usage guide