当前位置:网站首页>Vcip2021: super resolution using decoded information
Vcip2021: super resolution using decoded information
2022-06-22 14:10:00 【Dillon2015】
This article is from VCIP2021《CNN-based Super Resolution for Video Coding Using Decoded Information》

brief introduction
With the increase of high-resolution video , It brings great challenges to video transmission in the case of limited bandwidth . To solve this problem , The encoding method of resampling can be adopted , Such as Fig.1, The video is down sampled before encoding , Then encode low resolution video , After decoding, the decoder performs up sampling to restore the original resolution .AV1 There is a mode of encoding the down sampled frames and then up sampling at the decoder .VVC China also supports RPR.

With based on CNN Superresolution of (SR) The development of , It has great potential in video coding . In this paper, we propose a method to combine the coding information in video coding SR. In the existing research SR And encoder are usually regarded as independent parts , And the paper put forward in SR Not only reconstruction information but also prediction information is used in 、QP And so on .
Model design
Due to the different characteristics of brightness and chroma , Designed separately for brightness and chroma SR Model .

Fig.2 Is the structure of the brightness model , Input includes reconstruction information 、 Forecast information and QP map, Benchmark model selection single-scale EDSR, Since each convolution layer has only 64 Feature channels, so the model has no residual scaling layer . In the figure RB Is the residual block structure , share 16 individual . The last accretion layer is 4 Channels , And then through shuffle Layers produce high-resolution reconstructed images .
Fig.3 Is the structure of the chromaticity model , The main difference from the brightness model is the input , In order to make full use of texture information, brightness reconstruction information is also used as input . The brightness reconstruction information should pass through the step of 2 The same resolution as chromaticity is sampled under the convolution layer of . The model input also includes chromaticity U Reconstruction information 、 chroma V Reconstruction information and QP map, The input does not contain forecast information .
experimental result
Data sets
Use DIV2K Dataset training model , Image to YUV420 Format , Use VTM11.0 stay RPR Configure the next encoding ,QP={22,27,32,37,42}, The training image is encoded first 2 Double down sampling , Then the decoded low resolution image and the corresponding original resolution image are used for training .
Experimental configuration
Use PyTorch Frame training model , use Tesla V-100 GPU Training ,mini-batch size Set to 16, Use Adam The optimizer learning rate is le-4, Every time 200 Wheel press 0.5 The decay factor reduces the learning rate .
experimental result


The encoder adopts All Intra To configure ,QP={22,27,32,37,42}, Results such as table 1 Shown ,Fig.4 Is the of each sequence RD curve . The gain mainly comes from the low bit rate part , This shows that this method is better for low bandwidth scenarios .
Interested parties, please pay attention to WeChat official account Video Coding

边栏推荐
- 别再用 System.currentTimeMillis() 统计耗时了,太 Low,StopWatch 好用到爆!
- Record of problems encountered in dual network card binding
- [Nacos cloud native] the first step of reading the source code is to start Nacos locally
- History of hash index design
- 机器人方向的刚性需求→个人思考←
- 安装和使用protobuf-c
- Performance related indicators
- CVE-2022-22965複現
- 快速了解常用的对称加密算法,再也不用担心面试官的刨根问底
- Flink状态管理
猜你喜欢

Should programmers choose software talent outsourcing companies?

聊一聊数据库的行存与列存
MySQL如何让一个表中可以有多个自增列

Offline physical stores combined with VR panorama make virtual shopping more realistic

Tables converting to latex format

VR panoramic shooting breaks the deadlock of traditional propaganda that wild geese pass without leaving traces

一文搞懂开放源码软件(OSS)质量保证

openGauss数据库源码解析系列文章—— 密态等值查询技术详解

Stop using system Currenttimemillis() takes too long to count. It's too low. Stopwatch is easy to use!

leetcode-数学题
随机推荐
《Kubernetes监控篇:Grafana通过自动化方式添加datasource和dashboard》
Getting started with shell Basics
Locks in MySQL
polardbx是pg还是mysql?
30岁之前一定要明白的道理
Seven cattle cloud upload picture
Oracle cursor
在CSDN写文几年,我出了“第一本书“,感恩!
Detailed explanation of rules and ideas for advance sale of deposit
论文专利博客写作总结
Which securities company is good for retail investors to open an account? Is it safe to open a mobile account?
Views in MySQL
如何实现接口异常场景测试?测试方法探索与测试工具实现
"N'osez pas douter du Code, vous devez douter du Code" notez une analyse de délai de demande réseau
A simple scientific research secret
一文搞懂开放源码软件(OSS)质量保证
JS advanced programming version 4: learning iterators
天润云上市在即:VC大佬田溯宁大幅减持,预计将套现2.6亿港元
Microservice test efficiency governance
防火墙基础之策略部署