当前位置:网站首页>Masktextspotterv3 testing and training
Masktextspotterv3 testing and training
2022-07-20 17:30:00 【Between mountains and rivers 2018】
Mask TextSpotter v3
Known as the strongest end-to-end Text recognition Model
Actual test , The detection effect is really extraordinary , Slightly worse in speed than ABCNet. Let's see the effect first
There are also some missed inspections . But the first picture , This is actually detected , It's still very powerful , Need to know ABCNet Yes, in the first picture , The test is really bad .
Speed , as follows ,
ABC:
mst:
The environment and script are as follows :
Python3 (Python3.7 is recommended)
PyTorch >= 1.4 (1.4 is recommended)
cocoapi
yacs
matplotlib
GCC >= 4.9 (This is very important!)
OpenCV
CUDA >= 9.0 (10.0.130 is recommended)
my env:
torch 1.4.0
torchvision 0.5.0
py362
python 3.6.2
cuda 10.1
git clone https://github.com/NVIDIA/apex.git
cd apex
python setup.py install --cuda_ext --cpp_ext
# Be careful :apex Compile in torch==1.5.1 torchvision==0.6.1 Under the
cd MaskTextSpotterV3
# build
python setup.py build develop
# demo :a single image inference by python tools/demo.py
eg:
# 1.single image:
python tools/demo.py --image_path ./demo_images/img_77.jpg --visu_path ./demo_images/img_77_res.jpg
# 2.image folder
eg:
python tools/demo.py --input ./demo_images/ --output ./out/demo_out
# test
python tools/test_net.py --config-file configs/mixtrain/seg_rec_poly_fuse_feature.yaml
# train
# 1.Trained with SynthText
python3 -m torch.distributed.launch --nproc_per_node=8 tools/train_net.py --config-file configs/pretrain/seg_rec_poly_fuse_feature.yaml
Training part , Not yet .
边栏推荐
- Generalized odin: Detecting out-of-distribution image without learning from out-of-distribution data
- Conservative Novelty Synthesizing Network forMalware Recognition in an Open-Set Scenario
- mysql.user表权限字段说明全介绍
- Push rejected: Push to origin/master was rejected
- 李宏毅《机器学习》丨3. Gradient Descent(梯度下降)
- Upload pictures on wechat webpage
- 图像处理之阴影检测实例BDRAR
- 【FreeSwitch开发实践】死锁问题解决Over Session Limit 1000/Locked, Waiting on external entities
- 华为5G欧洲遇阻,又一国家宣布考虑禁用!
- 初识Redis(一)
猜你喜欢
Ampere Altra Max 提供可持续的高分辨率 H.265 编码
OPEN-SET RECOGNITION WITH GRADIENT-BASED REPRESENTATIONS
10. Thread
Enhanced aggregation, theme, data warehouse DWB, DWS layer dimensionality reduction, video, review for the first time, 220620,
Deep open intent classification with adaptive decision boundary aaai2021 interpretation of open set identification papers
Comparison of RDB and AOF persistence methods in redis
LeetCode78-子集详解
DWB layer of data warehouse, 220620, HM,
【FreeSwitch开发实践】死锁问题解决Over Session Limit 1000/Locked, Waiting on external entities
Learning Placeholders for Open-Set Recognition
随机推荐
10. Thread
OCR相关资源
舍友一边玩云顶一边掌握二叉树(超详细)
浏览器的事件循环(Event Loop)
review第1遍,220617,数据仓库DWD层,dwb层,视频,
Leetcode 做题思路笔记
idea2021启动失败 Could not find main class com/intellij/idea/Main
9.进程通信
获取数据库连接的五种方式
XMIND 8 shortcut key is the most complete in history
如何选择合适的体育场馆用LED显示屏
Overview of all new features in the official version of C 9.0
OCR series training notes (I)
Review the first time, 220619, data warehouse DWB layer dimensionality reduction, video,
11. 处理机调度的概念、层次
训练自己的实例分割模型
Join hands with Ziguang zhanrui to enter the Internet of things chip market with German communication through Hezhan microelectronics
C语言的学习之路(一)——初识C语言
李宏毅《机器学习》丨3. Gradient Descent(梯度下降)
Oom memory overflow is a classic that we have to watch in actual combat