爬取楼盘图片,名字和地址,都只能爬第一个就结束了
以下是代码
import urllib.request
from bs4 import BeautifulSoup
url='网页地址'
headers={'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/100.0.4896.127 Safari/537.36'}
request=urllib.request.Request(url=url,headers=headers)
res=urllib.request.urlopen(request)
soup=BeautifulSoup(res,'html.parser')
items=soup.find_all('ul',class_="resblock-list-wrapper")
for item in items:
mingzi=item.find('a')['title']
dizhi=item.find('a',class_='resblock-location')['title']
tp_mingzi=item.find('img')['alt']+'.jpg'
tp_url=item.find('img')['data-original']
tp_data=urllib.request.Request(url=tp_url,headers=headers)
tp_lujing='D:\\python\\项目\\4题1\\4题1图片'+tp_mingzi
response=urllib.request.urlopen(tp_data)
r=response.read()
with open(tp_lujing,'wb') as f:
f.write(r)
print(tp_mingzi,'爬取成功')
with open('D:/python/项目/4题1/数据.txt','a') as f:
f.write(f'{mingzi,dizhi}\n')
运行结果
D:\python\项目\hi\venv\Scripts\python.exe D:/python/项目/hi/venv/4题1.py
(第一个的)楼盘图片.jpg 爬取成功
进程已结束,退出代码0