python正则表达式出现问题

img

img


这个程序最后没输出,但单获取网页可以正常输出,正则表达式的规则该咋改啊

最后的r'</ul>'去掉,要不内容并不是</p>结束后接着</ul>

示例代码如下

img

import requests as pc
import re
def get_html(url):
    headers={'user-agent':'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/97.0.4692.71 Safari/537.36'}
    res=pc.get(url,headers=headers)
    res.encoding='utf-8'
    return res.text

def get_movie(html):
    pattern=re.compile(r'<ul class="row" data-com="hrefTo.*?>'
    r'.*?first-line">(.*?)</p>'
    r'.*?second-line">(.*?)</p>'
    ,re.S)
    r=re.findall(pattern,html)
    print(r[0])
def main():
    url="https://piaofang.maoyan.com/rankings/year"
    html=get_html(url)
    get_movie(html)

if __name__=='__main__':
    main()

img

您好,我是有问必答小助手,您的问题已经有小伙伴帮您解答,感谢您对有问必答的支持与关注!
PS:问答VIP年卡 【限时加赠:IT技术图书免费领】,了解详情>>> https://vip.csdn.net/askvip?utm_source=1146287632