在一个网页中匹配出如下的几个url,从url开始匹配不是从href开始匹配
href="http://redirect.wangpansou.cn/redirect.php?url=http%3A%2F%2Fpan.baidu.com%2Fshare%2Flink%3Fuk%3D2803502175%26shareid%3D3310887851%26third%3D0"
href="http://redirect.wangpansou.cn/redirect.php?url=http%3A%2F%2Fpan.baidu.com%2Fshare%2Fhome%3Fuk%3D981206555%26view%3Dshare"
href="http://redirect.wangpansou.cn/redirect.php?url=http%3A%2F%2Fpan.baidu.com%2Fshare%2Flink%3Fuk%3D1075874930%26shareid%3D3128951413%26third%3D0"
python的正则表达式应该怎么写?求大神啊!就结了好久了.
reg = 'href="(.+)"$'
pattern = re.compile(reg,re.S)
让点匹配换行在内的所有字符好像是S,你试试。
http.+pan.baidu.com.+