这么耿直吗,换个字段正则不好吗
import requests
import re
from lxml import etree
a = "https://wx.lianjia.com/zufang/nanshao/"
u = requests.get(a)
p1 = re.findall('<span class="content__list--item--time oneline">(.*?)</span>',u.text)
p2 = re.findall('<i class="content__item__tag--is_key">(.*?)</i>',u.text)
print(p1,p2)
x = etree.HTML(u.text)
spen = x.xpath('//*[@id="content"]/div[1]/div[1]/div[1]/div/p[3]/span/text()')
print(spen)
要是用xpath的话,可以在网站上直接复制xpath路径,也可以自己看html结构写xpath
或者用re,bs4也可以