能帮忙写一个爬取(国家统计局)福建省人口数据的Python编程吗,谢谢各位了
网页是这个
https://data.stats.gov.cn/search.htm?s=%E7%A6%8F%E5%BB%BA%E7%9C%81%E4%BA%BA%E5%8F%A3%E8%80%81%E9%BE%84%E5%8C%96
拜托拜托
爬取的是福建省最近20年(2001-2020)的“年末常住人口(单位:万人)”的数据
网站:https://data.stats.gov.cn/easyquery.htm?cn=E0103
import requests,json
url='https://data.stats.gov.cn/easyquery.htm?m=QueryData&dbcode=fsnd&rowcode=zb&colcode=sj&wds=%5B%7B%22wdcode%22%3A%22reg%22%2C%22valuecode%22%3A%22350000%22%7D%5D&dfwds=%5B%7B%22wdcode%22%3A%22sj%22%2C%22valuecode%22%3A%22LAST20%22%7D%5D&k1=1637372561679'
headers = {'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/95.0.4638.69 Safari/537.36 Edg/95.0.1020.53',
'Cookie':'_trs_uv=kw75228h_6_gn2o; JSESSIONID=MzQ6-mYMJtm33JCeBw7O3B7YLB_JRT2jiJUTfWTltvFE40CpDtdp!-958151929; u=5; experience=show',
'Accept':'application/json, text/javascript, */*; q=0.01'}
requests.packages.urllib3.disable_warnings()
response = requests.get(url,headers=headers,verify=False)
response.encoding='utf-8'
dic = json.loads(response.text)
lst = dic['returndata']['datanodes'][:20]
#print(lst)
result={}
for item in lst:
result[item['code'][-4:]]=item['data']['strdata']
print(result)
我或许可以帮帮你
国家统计局的数据用selenium爬取是最简单的