爬虫小bai,遇见问题
浏览器为edge
pycharm用的最新版本
写出代码,但是运行不了,又没找出错误😢
望big old指点迷津🤤
import requests
import json
if name=="main":
get_url = " https://movie.douban.com/j/new_search_subjects?%22
start = input('请输入你想看的名次:')
params = {
' sort': 'U',
'range': '0, 1', #
'tags': ' ',
'start': 'start', #用start来动态的控制你想看的名次
'genres': '科幻',
}
#进行UA伪装
headers = {
'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/102.0.5005.124 Safari/537.36 Edg/102.0.1245.44'
}
response = requests.get(url=get_url,params=params,headers=headers)
dic_json = response.json()
wjm = start+'.json'
fp = open(wjm, 'w', encoding='utf-8')
json.dump(dic_json,fp=fp,ensure_ascii=False)
print('爬取成功')
' sort': 'U', sort前面多了个空格
'tags': '', 引号中不要有空格
'start': start, 冒号后的start不要放引号中
'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/102.0.5005.124 Safari/537.36 Edg/102.0.1245.44'
前面少了'User-Agent':
应该是
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/102.0.5005.124 Safari/537.36 Edg/102.0.1245.44'
你题目的解答代码如下:
import requests
import json
if __name__ == "__main__":
get_url = "https://movie.douban.com/j/new_search_subjects?"
start = input('请输入你想看的名次:')
params = {
'sort': 'U',
'range': '0,10',
'tags': '',
'start': start,
'genres': '科幻',
}
#进行UA伪装
headers = {
'User-Agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/102.0.5005.124 Safari/537.36 Edg/102.0.1245.44'
}
response = requests.get(url=get_url,params=params,headers=headers)
dic_json = response.json()
wjm = start+'.json'
fp = open(wjm, 'w', encoding='utf-8')
json.dump(dic_json,fp=fp,ensure_ascii=False)
print(dic_json)
print('爬取成功')
如有帮助,请点击我的回答下方的【采纳该答案】按钮帮忙采纳下,谢谢!
程序有报错么,把报错贴一下