使用requests库进行网页爬取,运行时却一直没有显示,只有文件地址是怎么回事?
代码如下:
import requests
def getHTML(url):
try:
r = requests.get(url)
#print(r.status_code)
r.raise_for_status()
r.encoding = r.apparent_encoding
return r.text
except:
return r.status_code
if name == "main":
url = "https://www.nationalgeographic.com/"
s = getHTML(url)
print(s[100])
print(s[100])
改成
print(s)
你题目的解答代码如下:
import requests
def getHTML(url):
try:
r = requests.get(url)
#print(r.status_code)
r.raise_for_status()
r.encoding = r.apparent_encoding
return r.text
except:
return r.status_code
if __name__ == '__main__':
url = "https://www.nationalgeographic.com/"
s = getHTML(url)
# print(s[100])
print(s)
如有帮助,请点击我的回答下方的【采纳该答案】按钮帮忙采纳下,谢谢!
你要想获取什么内容??
暂时只能将你的抓取的网页信息打印出来:
import requests
def getHTML(url):
try:
r = requests.get(url)
#print(r.status_code)
r.raise_for_status()
r.encoding = r.apparent_encoding
return r.text
except:
return r.status_code
if __name__ == '__main__':
url = "https://www.nationalgeographic.com/"
s = getHTML(url)
print(s)