当前位置:网站首页>Target detection | target size prediction based on statistical adaptive linear regression
Target detection | target size prediction based on statistical adaptive linear regression
2022-07-21 18:19:00 【Computer Vision Research Institute】
Computer Vision Institute column
author :Edison_G
YOLOv2 and YOLOv3 It is a typical target detection algorithm based on deep learning , They use statistical adaptive exponential regression model to design the last layer of the network to predict the size of the target .
Long press scan QR code, follow us
One 、 Briefly
What we share today is that researchers have proposed a target size prediction method based on statistical adaptive linear regression .YOLOv2 and YOLOv3 It is a typical object detection algorithm based on deep learning , They use statistical adaptive exponential regression model to design the last layer of the network to predict the size of the object .
However , Because of the properties of exponential function , Exponential regression model can spread the derivative of loss function to all parameters in the network . Researchers propose a statistical adaptive linear regression layer to alleviate the gradient explosion problem of exponential regression model . The proposed statistical adaptive linear regression model is used at the last layer of the network to predict the size of the target estimated from the statistical data of the training data set . Researchers have newly designed YOLOv3tiny The Internet , It's in UFPR-ALPR Data set is better than YOLOv3 It has higher performance .
Two 、 background
This is mainly a Korean paper , It makes my reading process more difficult , So today I will briefly introduce some overall framework ideas , Interested students can further read the paper , Deepen understanding !
Today we won't introduce the traditional detection network , Because we ” Institute of computer vision “ I shared too many dry goods and practices of target detection before , If you want to get more familiar with the entry level, please check the history sharing . See the following link for some sharing :
Previous recommendation
- Target detection dry goods | Multi level feature reuse greatly improves the detection accuracy ( At the end of the paper is attached the thesis download )
- SSD7-FFAM | Embedded friendly target detection network , For the safety of kindergarten children
- A new way of target detection | class-agnostic The detector is used for target detection ( A link to download the paper is attached )
- dried food | Using hand-held camera image, real-time rice detection is carried out through convolution neural network ( Pay homage to Mr. Yuan )
- CVPR 2021 | No need to label ? See how self supervised learning framework can help target detection
- object detection | Rich feature orientation Refinement Network For target detection ( attach github Source code )
3、 ... and 、 Frame analysis
Above, YOLOv2 and YOLOv3 The post-processing process .
The proposed method uses the estimable statistical data in the learning data set to predict the width and height of the target , This is related to YOLOv2 and YOLOv3 identical . The process of estimating the statistical value of the learning data set is as follows : According to the width and height of the target in the learning data set , Classify targets as K A cluster of , Then estimate the arithmetic mean of the width and height of the target in each cluster .
The statistical adaptive linear regression model proposed by the researchers further estimates the standard deviation of the width and height of the target in each cluster . then , The mean and standard deviation of the width and height of the target predicted by the network are designed to follow the mean and standard deviation of the width and height of the target in the learning data set . Use the estimable statistical values in the learning data set to constrain the statistical values of the predicted values , It can make the network more stable in the learning stage , Improve detection performance .
The existing YOLOv2, As for YOLOv3 Modification of statistical adaptive exponential regression model for target size prediction in , The researchers redefined the loss function of the statistical adaptive linear regression model for learning the proposed target size prediction , The proposed loss function is as above .
Four 、 experiment
UFPR-ALPR dataset
The network architecture of newly designed YOLOv3 tiny for experiments
The comparison on UFPR-ALPR test dataset
THE END
Please contact the official account for authorization.
The learning group of computer vision research institute is waiting for you to join !
The Institute of computer vision is mainly involved in the field of deep learning , Mainly devoted to face detection 、 Face recognition , Multi target detection 、 Target tracking 、 Image segmentation and other research directions . The Research Institute will continue to share the latest paper algorithm new framework , The difference of our reform this time is , We need to focus on ” Research “. After that, we will share the practice process for the corresponding fields , Let us really experience the real scene of getting rid of the theory , Develop the habit of hands-on programming and brain thinking !
Sweep yards attention
Institute of computer vision
official account ID|ComputerVisionGzq
Study Group | Scan the code to get the join mode on the homepage
Source download | reply “OSP” Get Downloads
边栏推荐
- MySQL transaction log
- 目标检测 | 基于统计自适应线性回归的目标尺寸预测
- 看懂RTK定位,这一篇就够啦!
- 网络安全学习(千锋网络安全笔记)4--用户与组管理
- [QT notes] use the timer qtimer in the run() function of qthread
- Resume: intelligent cockpit series article 5 - implicit interaction of its three interaction modes
- [wechat applet] page navigation details
- 不知道 MySQL 咋学?刷完牛客这 50 道题就够了!(第八篇)
- MFC计算器
- 【微信小程序】页面导航详解
猜你喜欢
Resume: smart cockpit series article 3 - the touch of five interactive technologies behind it
2022-07-20: the following go language code is about JSON and context. What is the output? A:{}; B: {"a": "B"}; c:{"context": 0}; D: uncertain. Package main imp
js发送消息框动画js特效
Understand RTK Positioning, this one is enough!
Use the esp32 download tool to burn firmware for esp32
2163. Minimum difference of sum after deleting elements [DP + heapq]
leetcode:1125. The smallest necessary team [shape pressing DP board + set covering board]
TCP/IP基础
Raspberry pie 3B builds Flink cluster
Crazy God redis notes 05
随机推荐
2022-07-20: the following go language code is about JSON and context. What is the output? A:{}; B: {"a": "B"}; c:{"context": 0}; D: uncertain. Package main imp
Vi Practice -- import and export of Excel tables
pytorch-California House Prices
Baize_foc esp32无刷电机驱动错误问题记录贴
Rxjs see the moon JS special effect code through the window
Vulnerability statements of the sorted burp official website
[QT notes] use the timer qtimer in the run() function of qthread
二维码制作完成后,还能修改二维码的内容吗?
Explain cache penetration, cache avalanche, cache breakdown
解析TCP之滑动窗口
不知道 MySQL 咋学?刷完牛客这 50 道题就够了(第十篇)
Browsersync web server setup
掌握 Dowanward API 的妙用,轻松拿捏 kubernetes 环境变量
ASCII码与16进制转换表
复盘:智能座舱系列文六- 它的3种交互方式之显式交互(语音以及显示)
Tiktok system | huge arithmetic interface signature analysis and decryption
iNFTnews | 元宇宙浪潮下,企业正通过AR和VR技术改善客户体验
网络模拟器 eNSP、EVE-NG、GNS3、Packet Tracert
Blocking and wakeup of a shell command
复盘:智能座舱系列文一,他到底是什么