xxxx18一60岁hd中国/日韩女同互慰一区二区/西西人体扒开双腿无遮挡/日韩欧美黄色一级片 - 色护士精品影院www

  • 大小: 4.31MB
    文件類型: .zip
    金幣: 2
    下載: 0 次
    發(fā)布日期: 2023-08-10
  • 語言: 其他
  • 標(biāo)簽: 新浪微博??爬蟲??

資源簡介

本資源為新浪微博爬蟲,目前支持針對(duì)用戶爬取、針對(duì)超話爬取、針對(duì)地點(diǎn)爬取三種模式。
爬取的信息有:rid、用戶名稱、微博等級(jí)、微博內(nèi)容、微博轉(zhuǎn)發(fā)量、微博評(píng)論量、微博點(diǎn)贊、發(fā)布時(shí)間 發(fā)布設(shè)備、話題名稱、@用戶、搜索地點(diǎn)以及用戶發(fā)過的照片等;詳情使用請(qǐng)看文檔里的ReadME說明。

資源截圖

代碼片段和文件信息

#!/usr/bin/env?python3
#?-*-?coding:?utf-8?-*-
“““
Created?on?Mon?Apr??8?10:44:58?2019

@author:?chenjianyao
“““
import?xlrd
import?xlwt
from?xlutils.copy?import?copy

def?write_excel_xls(path?sheet_name?value):
????index?=?len(value)??#?獲取需要寫入數(shù)據(jù)的行數(shù)
????workbook?=?xlwt.Workbook()??#?新建一個(gè)工作簿
????sheet?=?workbook.add_sheet(sheet_name)??#?在工作簿中新建一個(gè)表格
????for?i?in?range(0?index):
????????for?j?in?range(0?len(value[i])):
????????????sheet.write(i?j?value[i][j])??#?像表格中寫入數(shù)據(jù)(對(duì)應(yīng)的行和列)
????workbook.save(path)??#?保存工作簿
????print(“xls格式表格寫入數(shù)據(jù)成功!“)

def?read_excel_xls(path):
????data?=?[]
????workbook?=?xlrd.open_workbook(path)??#?打開工作簿
????sheets?=?workbook.sheet_names()??#?獲取工作簿中的所有表格
????worksheet?=?workbook.sheet_by_name(sheets[0])??#?獲取工作簿中所有表格中的的第一個(gè)表格
????if?worksheet.nrows?==?1:
????????print(“目前是第一行“)
????else:
????????for?i?in?range(1?worksheet.nrows):?#從第二行取值
????????????dataTemp?=?[]
????????????for?j?in?range(0?worksheet.ncols):
????????????????#print(worksheet.cell_value(i?j)?“\t“?end=““)??#?逐行逐列讀取數(shù)據(jù)
????????????????dataTemp.append(worksheet.cell_value(i?j))
????????????data.append(dataTemp)
????return?data
?????
def?write_excel_xls_append_norepeat(path?value):
????workbook?=?xlrd.open_workbook(path)??#?打開工作簿
????sheets?=?workbook.sheet_names()??#?獲取工作簿中的所有表格
????worksheet?=?workbook.sheet_by_name(sheets[0])??#?獲取工作簿中所有表格中的的第一個(gè)表格
????rows_old?=?worksheet.nrows??#?獲取表格中已存在的數(shù)據(jù)的行數(shù)
????new_workbook?=?copy(workbook)??#?將xlrd對(duì)象拷貝轉(zhuǎn)化為xlwt對(duì)象
????new_worksheet?=?new_workbook.get_sheet(0)??#?獲取轉(zhuǎn)化后工作簿中的第一個(gè)表格
????rid?=?0
????for?i?in?range(0?len(value)):
????????data?=?read_excel_xls(path)
????????data_temp?=?[]
????????for?m?in?range(0len(data)):
????????????data_temp.append(data[m][1:len(data[m])])
????????value_temp?=?[]
????????for?m?in?range(0len(value)):
????????????value_temp.append(value[m][1:len(value[m])])
????????
????????if?value_temp[i]?not?in?data_temp:
????????????for?j?in?range(0?len(value[i])):
????????????????new_worksheet.write(rid+rows_old?j?value[i][j])??#?追加寫入數(shù)據(jù),注意是從i+rows_old行開始寫入
????????????rid?=?rid?+?1
????????????new_workbook.save(path)??#?保存工作簿
????????????print(“xls格式表格【追加】寫入數(shù)據(jù)成功!“)
????????else:
????????????print(“數(shù)據(jù)重復(fù)“)
????
?????
????
???

?屬性????????????大小?????日期????時(shí)間???名稱
-----------?---------??----------?-----??----
?????目錄???????????0??2019-12-21?14:08??weiboSpider\
?????文件???????14412??2019-07-01?18:43??weiboSpider\README.md
?????目錄???????????0??2019-12-21?14:07??weiboSpider\driver\
?????文件?????8393728??2019-07-12?09:30??weiboSpider\driver\chromedriver.exe
?????文件????????2671??2019-07-23?17:19??weiboSpider\driver\excelSave.py
?????文件???????10235??2019-07-23?17:19??weiboSpider\driver\weiboTest.py
?????文件????????2671??2019-07-23?17:19??weiboSpider\excelSave.py
?????目錄???????????0??2019-12-21?14:08??weiboSpider\locationPic\
?????文件??????????42??2019-07-01?18:43??weiboSpider\requirements.txt
?????文件????????8012??2019-12-15?09:52??weiboSpider\searchKeyword.py
?????文件?????????724??2019-07-25?15:22??weiboSpider\test.py
?????文件???????18446??2019-07-30?16:32??weiboSpider\updateWeiboUser.py
?????目錄???????????0??2019-12-21?14:08??weiboSpider\weibo\
?????文件???????12303??2019-12-15?09:54??weiboSpider\weiboLocation.py
?????文件??????108544??2019-12-15?09:59??weiboSpider\weiboLocation.xls
?????文件???????10184??2019-12-15?09:53??weiboSpider\weiboSuperWords.py
?????文件???????19088??2019-10-04?19:12??weiboSpider\weiboUser.py
?????文件??????????27??2019-12-15?09:59??weiboSpider\weiboUsers.csv

評(píng)論

共有 條評(píng)論