当前位置:网站首页>Multimodal model clip4clip takes you to realize mutual search between text and video
Multimodal model clip4clip takes you to realize mutual search between text and video
2022-07-21 01:49:00 【Zilliz Planet】
publisher :Towhee Technical team
CLIP4Clip Cross modal graphic model CLIP Based on , Successfully realized the text / Video retrieval task . Whether it's by text looking for relevant content videos , Or automatically match the most appropriate description for the video ,CLIP4Clip Can help you do . Through a large number of ablation experiments ,CLIP4Clip Proved its effectiveness , And in MSR-VTT、MSVC、LSMDC、ActivityNet and DiDeMo Wait for the text - On the video data set SoTA result .
CLIP4Clip: Main Structure
CLIP4Clip Based on the pre trained graphic model , Complete the task of video retrieval through migration learning or fine-tuning . It uses pre trained CLIP Model as backbone network , It solves the task of video clip retrieval from frame level input , And uses the parameterless type 、 Sequence type and close type similarity calculator to get the final result .
Related information :
Model use case : video-text-embedding/clip4clip
The paper : CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval
More information : CLIP4Clip: CLIP Next city , utilize CLIP Realize video retrieval
For more project updates and details, please pay attention to our project ( https://github.com/towhee-io/towhee/blob/main/towhee/models/README_CN.md) , Your attention is a powerful driving force for us to generate electricity with love , welcome star, fork, slack Three even :)
zilliz User communication
边栏推荐
- [03] let's talk about "performance" through your CPU frequency?
- Go 每日一库之 gore
- 10 个用于网络管理员进行高级扫描的端口扫描工具
- JSP custom tags (an article learned that each line of code has comments)
- Network security comprehensive penetration test cve-2010-2883-pdf vulnerability analysis
- 力扣刷题14. 最长公共前缀
- cannot import name ‘import_ String 'from' Werkzeug '[bug resolution]
- Network security in Secondary Vocational Schools - the thinking of reverse PE reverse problem solving in 2022 National Games
- Monitor Tool 2019(2)Monit on Multiple Instances and Email Alerts
- Flink1.15源码阅读——flink-annotations
猜你喜欢
At32 MCU f415 OTG new function use
matlab-微分方程求解方法汇总
Distribution rules of weights of binary neural networks
20元一支的洗面奶,7天卖了上万,他们是如何做到的?
【服务器数据恢复】某品牌ProLiant服务器raid瘫痪数据库文件损坏的数据恢复
PHP(1)
【R语言文本挖掘】:情感分析与词云图绘制
Network security comprehensive penetration test cve-2010-2883-pdf vulnerability analysis
cannot import name ‘import_ String 'from' Werkzeug '[bug resolution]
C语言_定义结构体和使用变量
随机推荐
C语言入门实战(9):完数判断
Force deduction and question brushing record 4---69 Square root of X
Series operations of map/multimap container
371页20万字2021版智慧城市信息化综合建设方案
acwing 871. Sum of divisors
【琐琐碎碎小知识】 关于部分Unity编辑器在创建瓦片地图时缺乏Tiles选项
JVM调优方法
NETFLOW 与 SNMP两种不同的网络监控方法
C语言_定义结构体和使用变量
STM32 唯一ID读取方法
机器学习笔记 - 构建推荐系统(4) 用于协同过滤的矩阵分解
STM32 unique ID reading method
剑指 Offer II 041. 滑动窗口的平均值_____使用队列 / 循环数组实现
Go 每日一库之 gore
clion创建第一个C项目
LeetCode.208. 实现 Trie (前缀树)____字典树
移动端测试必备技能: adb命令和抓包
scala基础【高阶函数编程】
使用OpenCV调整图像的亮度和对比度
LeetCode.302 场周赛___01_6120. 数组能形成多少数对__简单哈希