当前位置:网站首页>Daily challenges of search engines_ 4_ External heterogeneous resources - Zhihu
Daily challenges of search engines_ 4_ External heterogeneous resources - Zhihu
2020-11-08 07:14:00 【I don't know.】
Write it at the front :
Search engine is an extremely complex system engineering , Search engines don't work wonders , It needs a little bit of polishing . This series records daily problems , In a way that looks at leopards , A little bit to show the charm of search engines .
To the body :
The island effect of mobile ecology is becoming more and more obvious , But they have a certain relationship with each other . For general search engines , Not all the resources 、 Ecology is satisfied one by one , External resources will certainly be introduced .
Compared with Jingdong 、 Ctrip 、 Meituan and others have a large number of searches every day , But unlike general search , They search for their own ecological output , Or structured content . It doesn't have to be like a general search engine at this point , Bear this kind of " Pain ".

The main way to introduce and retrieve external resources is to provide services by exposing interfaces and cards . There are also apps that jump to provide services .

( So now every big factory is building its own ecological content , Standard formatted data , It's also easy to manage . Like the headline 、 There was no. 、 Penguin 、 Even Zhihu column .)
But when resources need to be integrated into the search engine integrated results display page , It will bring A lot of questions to think about :
1 External ways of providing , It's database building , Or request api The way . The magnitude of the database ? The magnitude of the diversion ? Can you resist . Each has its own advantages and disadvantages , Think about it first .
2 How to build a database ? It's built with its own big library ? Or build a separate library ? Both ways have their own advantages and disadvantages .
3 The fields that create the library 、 Recall 、 How to align sorted fields ? How to deal with missing fields ?
4 The way of sorting side fusion , And ecological considerations .
5 Scalability considerations , How to put the standard 、 Put in storage 、 Sorting and other levels of work can be reused as much as possible , Unify management as much as possible .
6 api How to introduce resources , In terms of its content understanding , It's almost hard to do .
6 Audit operational controls . There is no way to audit , Content is not controlled , If there is sensitivity 、 Vulgar content can have a big impact . If the way of warehousing is better ,api The way is a problem .
版权声明
本文为[I don't know.]所创,转载请带上原文链接,感谢
边栏推荐
- Codeforce算法题 | 你能想出解法,让你的基友少氪金吗?
- Astra: Apache Cassandra的未来是云原生
- Adobe Prelude /Pl 2020软件安装包(附安装教程)
- ts流中的pcr与pts计算与逆运算
- 面部识别:攻击类型和反欺骗技术
- What? Your computer is too bad? You can handle these moves! (win10 optimization tutorial)
- Supervisor process management installation and use
- Bili Bili common API
- Wanxin Finance
- 麦格理银行借助DataStax Enterprise (DSE) 驱动数字化转型
猜你喜欢
Macquarie Bank drives digital transformation with datastex enterprise (DSE)
16. File transfer protocol, vsftpd service
解决RabbitMQ消息丢失与重复消费问题
C / C + + Programming Notes: what are the advantages of C compared with other programming languages?
Download, installation and configuration of Sogou input method in Ubuntu
Android 9.0/P WebView 多进程使用的问题
学习Scala IF…ELSE 语句
Ulab 1.0.0 release
分布式共识机制
NOIP 2012 提高组 复赛 第一天 第二题 国王游戏 game 数学推导 AC代码(高精度 低精度 乘 除 比较)+60代码(long long)+20分代码(全排列+深搜dfs)
随机推荐
iOS上传App Store报错:this action cannot be completed -22421 解决方案
Interface
laravel8更新之速率限制改进
Brief history of computer
NOIP 2012 提高组 复赛 第一天 第二题 国王游戏 game 数学推导 AC代码(高精度 低精度 乘 除 比较)+60代码(long long)+20分代码(全排列+深搜dfs)
Go sending pin and email
Idea - the. IML file was not automatically generated by the project
leetcode之判断路径是否相交
CPP (2) creating CPP project
16. File transfer protocol, vsftpd service
Swiper window width changes, page width height changes lead to automatic sliding solution
Fortify漏洞之 Privacy Violation(隐私泄露)和 Null Dereference(空指针异常)
个人短网址生成平台 自定义域名、开启防红、统计访问量
Sentry installation
Awk implements SQL like join operation
QT hybrid Python development technology: Python introduction, hybrid process and demo
模板链表类学习
你的主机中的软件中止了一个已建立的连接。解决方法
Got timeout reading communication packets解决方法
Search and replace of sed