当前位置:网站首页>Zero basis to build a web search engine of its own
Zero basis to build a web search engine of its own
2020-11-06 20:53:00 【Python advanced】
Preface
Before that , I think we should all know more about search engines , It is through the user input text in the browser input box , To show some results , What do you think matches the content you're searching for , You just click on it .
【 One 、 Project preparation 】
browser :360 browser
Editor :Sublime Text 3
plug-in unit :Jquery-3.2.1.Min.Js
【 Two 、 Project implementation 】
Because it is to achieve a web search engine , So we need to borrow the website three swordsmen (Html+Css+Javascript), And then implement this function .
1. Open Baidu Analysis page structure
We can first look at Baidu's search engine :
You can see , Part of the search box is set , Like turning off autocomplete . And then we're just searching around to see what's changed :
You can see some of the keywords we're looking for , So we found the law of request :
https://www.baidu.com/s?+ Query character parameters
This makes up a complete get request , And there are many keyword parameters that can be omitted , Just keep the important part . therefore , Tested , Come to the following conclusion :
https://www.baidu.com/s?wd=keyword
This is the interface address of the request , Just put the keyword Parameters can be replaced by any search keywords to query and jump to the corresponding results page .
2. To write Html Input box , Search button
Read what I wrote before Html Series of articles , You will no longer be confused about this .
<html>
<head>
<title></title>
<style type="text/css">
*{ The inner and outer margins are initially 0
margin:0;
padding:0
}
input{
width:300px;
height:30px
}
span{
position:absolute; Absolute positioning
background-color:red; The background color
border:1px solid gray; Border settings
width:60px;
height:32px;
text-align:center Text location
}
span:hover{ Hover style
background-color:blue
}
</style>
</head>
<body>
<input type="text" name="" placeholder=" Please enter what you want to search for "> The text box
<span>search</span> Search button
</body>
</html>
After writing, enter the browser to view , You can see :
You can see , It's a bit of a browser search box .
3. Import Jquery plug-in unit
<script src='jquery-3.2.1.min.js'></script>
4. To write js Script
This is the top priority , Open the browser ,network, Continue analysis :
You can see that the search results are inside . Then open the request url Address , After many experiments , It is found that only the parameters marked in the graph have changed :
So we can draw a conclusion , We just need to change these two values .
1). Create delete script
So I created a script tag first , It can be removed at any time when not in use , Avoid using memory , Causes the page to open slowly , Performance degradation :
var script=document.createElement('script'); establish script The label of
script.id='jsonp'; Set up id by jsonp
script.src='https://www.baidu.com/sugrec?prod=pc&cb=getData&wd='+wd; Set its address
document.body.appendChild(script); add to script Element to body in
And then wait until it's not used , Delete it at any time :
var script=document.querySelector('#jsonp'); choice id by jsonp The elements of
script.parentNode.removeChild(script); Remove this element from its parent element
2). Generate options drop-down menu
We can see it in the browser , Just one input text , It will pop up the corresponding option for us to choose , So how did this happen ?
<script>
function getlist(wd){ /* Get the drop-down list */
var script=document.createElement('script'); /* establish script The label of */
script.id='jsonp'; /* Set up id by jsonp*/
script.src='https://www.baidu.com/sugrec?prod=pc&cb=getData&wd='+wd; /* Set its address */
document.body.appendChild(script); /* add to script Element to body in */
}
function getData(data){ /* get data */
var script=document.querySelector('#jsonp'); /* choice id by jsonp The elements of */
script.parentNode.removeChild(script); /* Remove this element from its parent element */
$('ol').html(''); /* Set the value of the sequence table to be empty */
var da=data.g; /* Get search results */
if(da){ /* If the result exists, put the result in li In the label */
da.forEach(function(item,index){
$('<li><a target="_blank" href ="https://www.baidu.com/s?wd='+item.q+'">'+item.q+'</a></li>').appendTo('ol');
})
}
}
/* Determine if the keyboard is pressed */
$('input:text').keyup(function(){
var wd=$(this).val(); /* The value of the input box */
if(wd==''){ /* If the value is empty , Then hide , Otherwise, it will show */
$('ol').css('display','none');
$('ol').css('zIndex',-10);
}else{
$('ol').css('display','block');
$('ol').css('zIndex',20);
}
getlist(wd);
});
</script>
You can see , The search results have come out , And there's... Under the sequence table "li" Tags are also generated accordingly .
3). Mark the sequence of options
We can see , And finally it came out , But I want to give it a serial number , So you can know how many search results there are . There are many ways to set tags , You can start with a number , It can also be upper and lower case letters or Roman time . Here I choose numbers , It's simple .
Finally, this function is perfectly realized , Isn't it amazing , Go and have a try .
4). Search refresh
Seeing this, I believe you should all know that this function has been completed , We just need to click on whatever li Tags can be accessed to the corresponding page . therefore , I decided to add a refresh feature , It belongs to the refresh of the reconnection server :
<span onclick='window.location.reload()'>search</span> Click and refresh immediately
【 3、 ... and 、 Project summary 】
in general , For beginners, Xiaobai is a good practice project , I hope you can get something from it .
Need source code small partner , The background to reply “ Search engine ” You can get it in four words . Want to learn more Python Web crawler and data mining knowledge , Go to a professional website :http://pdcfighting.com/
版权声明
本文为[Python advanced]所创,转载请带上原文链接,感谢
边栏推荐
- (2) ASP.NET Core3.1 Ocelot routing
- Get twice the result with half the effort: automation without cabinet
- 代码重构之法——方法重构分析
- ES6 learning notes (2): teach you to play with class inheritance and class objects
- Behind the record breaking Q2 revenue of Alibaba cloud, the cloud opening mode is reshaping
- It is really necessary to build a distributed ID generation service
- Live broadcast preview | micro service architecture Learning Series live broadcast phase 3
- Diamond standard
- An article will take you to understand CSS alignment
- jenkins安装部署过程简记
猜你喜欢
大道至简 html + js 实现最朴实的小游戏俄罗斯方块
CloudQuery V1.2.0 版本发布
Even liver three all night, jvm77 high frequency interview questions detailed analysis, this?
Markdown tricks
事务的本质和死锁的原理
Look! Internet, e-commerce offline big data analysis best practice! (Internet disk link attached)
Some operations kept in mind by the front end foundation GitHub warehouse management
大会倒计时|2020 PostgreSQL亚洲大会-中文分论坛议程安排
[Xinge education] poor learning host computer series -- building step 7 Simulation Environment
It's time for your financial report to change to a more advanced style -- financial analysis cockpit
随机推荐
Multi robot market share solution
It is really necessary to build a distributed ID generation service
Shh! Is this really good for asynchronous events?
WeihanLi.Npoi 1.11.0/1.12.0 Release Notes
Chainlink brings us election results into blockchain everipedia
2020年数据库技术大会助力技术提升
【應用程式見解 Application Insights】Application Insights 使用 Application Maps 構建請求鏈路檢視
Live broadcast preview | micro service architecture Learning Series live broadcast phase 3
行为型模式之备忘录模式
【自学unity2d传奇游戏开发】如何让角色动起来
What are PLC Analog input and digital input
2020年第四届中国 BIM (数字建造)经理高峰论坛即将在杭举办
For a while, a dynamic thread pool was created, and the source code was put into GitHub
ORA-02292: 违反完整约束条件 (MIDBJDEV2.SYS_C0020757) - 已找到子记录
MongoDB与SQL常用语法对应表
Ronglian completed US $125 million f round financing
Swagger 3.0 brushes the screen every day. Does it really smell good?
Some operations kept in mind by the front end foundation GitHub warehouse management
检测证书过期脚本
Analysis of query intention recognition