当前位置:网站首页>Instra [anomaly detection: reconstruction_based]
Instra [anomaly detection: reconstruction_based]
2022-07-20 17:52:00 【It's too simple】
2021.4, The effect ranks No 23(papers with code)
Model principle
thought : take transformer Introduce the field of anomaly detection , Transform the generation based reconfiguration problem into inpainting problem ( Cover some areas of the image and restore , It can be regarded as a self-monitoring method ), Use transformer To capture the information of the big receptive field .
Patch Embeddings and Multihead Feature Self-attention
Patch embeddings
Pictured , Divide the input graph into 16*16 Grid graph of , Then select randomly from the grid 7*7 Size chart , Constitutive tensor [252,49,768](batchsize:252,7*7:49,16*16*3(RGB passageway ):768). It is converted into [252,49,512], The graph output to the network is [252,49,512]( It includes unmasked blocks and layers added to supplement the positional relationship ),[252,768]( Covered blocks ).
Multihead Feature Self-attention MFSA The long attention mechanism
query,key,value Graphs whose tensors are all inputs [252,49,512],query,key the (MLP) Two fully connected layers ( Set activation function between layers ) Output [252,49,256], Then transform the layout output of the tensor [252,8,49,32],value Through a full connection layer ( Do not set the activation function ) Output [252,49,256], Then transform the layout output of the tensor [252,8,49,64].
key Tensor retransfer [252,8,32,49] And query Tensor multiplication and division Gain tensor [252,8,49,49]. the softmax Return tensor p_attn[252,8,49,49], take p_attn And value Multiply the return tensor value_pro[252,8,49,64].
take value_pro After conversion and full connection, the tensor is output [252,49,512].
Network Architecture and Training
Pass the input tensor through the full connection layer gelu Activation goes through the full connection layer .
--------------------------------------------
adopt MSFA And MLP build transformer, Post output tensor [252,49,512], Averaging , Output through the full connection layer [252,768].
Map to a graph [252,3,16,16], Calculate the loss function , The loss function includes L2 Loss / Loss of structural similarity (structural similarity)/ Gradient amplitude similarity loss (gradient magnitude similarity)
experiment
details :transformer The training time is relatively long , Sometimes more than 500 individual epoch. When validation sets loss exceed 50 individual epoch There is no obvious decline , Then stop training , And choose the best model to evaluate .
Ablation Experiment : A long connection ,MSA And MFSA, The length of the window ( The original for 7).
Reference resources
边栏推荐
- 清爽的手机移动端个人中心页面
- What is the process of Shenwan Hongyuan mobile account opening? Is it safe to open a mobile account
- 2021-07-07&08 力扣 每日一题-使用哈希表解决问题
- OCR related resources
- 消息中间件
- C语言多线程编程之一
- CentOS 8.2 mysql安装(Xshell6)
- Higherhrnet /pytorch-yolo4 environment deployment
- 发现增长新动力,企业到底需要一朵什么样的云?
- MATIC 录得两位数涨幅,新目标可能是什么?
猜你喜欢
A simple personal accounting system based on ssm+mysql
设置默认为0但是添加为null
College graduation design management system based on ssh+mysql+bootstrap
XML file fuzzy query writing method SQL function find_ in_ set
Set the default value in the database to prevent the added record from becoming null
入职开发环境配置
分布式系統中數據存儲方案實踐
如何创建一个简易的Electron桌面程序
【UE4】复杂背景人像抠图-飞浆AI-paddlepaddle深度训练模型
Spuer and this keywords
随机推荐
测试必知必会的Mock数据方法
Runtime. getRuntime(). Addshutdownhook (New thread()) usage
Unreal Engine01:环境配置
树莓派3B ffmpeg rtmp推流
MATIC 录得两位数涨幅,新目标可能是什么?
基于JSP+Servlet+MySQL简单的酒店后台管理系统
论文阅读-Temporal Fusion Transformers for Interpretable Multi-horizon Time Series Forecasting
Dahua Haikang camera video streaming
请教下flink cdc在sql里面如果有复杂select嵌套,遇到DELETE的binlog时,会
RaspberryPico解析PWM
SQL判断是否是今天 昨天 本月 上个月
Paper reading - using time series data enhancement to improve the accuracy of global prediction models
2022-07-19日报:ICML2022论文太多看不过来?《一句话点评1234篇论文亮点》帮你快速锁定
获取get请求返回值
基于JSP+Servlet+MySQL的图书商城系统
Huawei 5g is blocked in Europe, and another country has announced that it is considering banning it!
数据库持久化+JDBC数据库连接
Practice of data storage scheme in distributed system
树莓派创建自启动服务
D1 understanding neural networks from zero