xxxx18一60岁hd中国/日韩女同互慰一区二区/西西人体扒开双腿无遮挡/日韩欧美黄色一级片 - 色护士精品影院www

  • 大小: 3KB
    文件類型: .py
    金幣: 1
    下載: 0 次
    發布日期: 2021-01-09
  • 語言: Python
  • 標簽: python??搜索引擎??

資源簡介

了解google類似的搜索引擎是怎么實現的;攫取搜索真相。 原理描述請見:http://gaolizhong666.blog.163.com/blog/static/11561504220136242819683/

資源截圖

代碼片段和文件信息

‘‘‘
Created?on?2013-7-2

@author:?glz.shinow
‘‘‘
#the?search?engine?is?divided?into?3?modules:web?crawlbuild?and?use?of?indexpage?rank

#----------------------------web_crawl--------------------------------
def?get_page(url):
????try:
????????import?urllib
????????return?urllib.urlopen(url).read()
????except:
????????return?““
????
def?get_next_target(page):
????start_link?=?page.find(‘????if?start_link?==?-1:
????????return?None?0
????start_quote?=?page.find(‘“‘?start_link)
????end_quote?=?page.find(‘“‘?start_quote?+?1)
????url?=?page[start_quote?+?1:end_quote]
????return?url?end_quote

def?get_all_links(page):
????links?=?[]
????while?True:
????????url?endpos?=?get_next_target(page)
????????if?url:
????????????links.append(url)
????????????

評論

共有 條評論