请问爬取的标题和url地址,如何写入新建的excel文件?

已经提取到了知乎的标题和url,
请问如何将提取到的50个知乎的文字标题和网址,写入新建的excel文档里 ? 请写下最常用最简单的方式(2到3种方法,附上详细代码;)

不要发链接文章, 请贴相关的详细代码给我,并注释清楚;

希望得到的输出结果:

img

img

示例代码如下,使用openpyxl和file来生成Excel和csv


import requests
import re
 
# 1.headers请求头伪装 模拟浏览器
headers = {
   'cookie': '_zap=16257db5-03b1-453b-b96a-dc0d9e28430a; d_c0="AJARGipIqBSPTpravYTtrnrGti1R3KNWYzI=|1647670590"; _xsrf=nqqT8B0425ElxvkY72cUsiAgRUflT6SZ; __snaker__id=GHotV9A6SS36hVC1; _9755xjdesxxd_=32; YD00517437729195%3AWM_TID=E1SsyH02vqZERQUBFRc67t2OAp0JyovX; q_c1=553b1c2e054f4247a0cefdba1081f088|1647843124000|1647843124000; BAIDU_SSP_lcr=https://browser.360.cn/; gdxidpyhxdE=3EDyh8gaaxVDcW6JbPc8lOU0DlDdH6n5fy92Sm%2Bkdh3vdqjWxgXWeHH5n1CsUCg19BEIxWG%5CdavWVf2Hh2auNCAUpjb4V9qMEQ9hOK5wSR%2B76w8MUY0I%2FmCpk%2FpVC9duC3OUgdBGSgJpbUpt9IQSvbqgsAo%5C2Cp%2BRrrV9LQYO9U%2BVGOv%3A1652814040189; YD00517437729195%3AWM_NI=LNMX2N2idXvWXTV%2FfjVXOlvmOqwDV8XNRSnkxdcebmlQjHj%2FFb6pXMTxwgl1zzr3RLnCDr6kzr%2FBBgFwMe3EPR5USaMawb5F9SrBBxeS4WKabfIwa6gMQYPy%2BOEj%2Bo%2BpalE%3D; YD00517437729195%3AWM_NIKE=9ca17ae2e6ffcda170e2e6eeb6d76b92a79893f346b1b48ab7c14b829b9bb1c44ba286afaafc3391b097a5ee2af0fea7c3b92a909283b9b146f2b7acb9f55d9c8c9ad0e5629688818df5678dafb9a5d5349c929eb4ed61bcbcad86e167b08fa6a9e92586befa91db4798a8b988ef3bbaa6bed4d148ae9d8c93b561baaee1a3c16d86a98386b24f948efbb6d26b8fbfb983b84efb8dbea3c174a78dfcb8ca6ba2ba8798e43ea5939fbad373e9918b8baa65e9b0af8dc837e2a3; z_c0=2|1:0|10:1652813172|4:z_c0|92:Mi4xWlJIOUNBQUFBQUFBa0JFYUtraW9GQ1lBQUFCZ0FsVk5kRHR4WXdDMEFYNmcyOFlUNjAycGRkV0QweWdpSUxlVy13|5888ddf291231cfaa655955528a600b90af7df422dd1bdf1f06bf78f1edb6b75; Hm_lvt_98beee57fd2ef70ccdd5ca52b9740c49=1660163660; q_c1=553b1c2e054f4247a0cefdba1081f088|1661472035000|1647843124000; NOT_UNREGISTER_WAITING=1; tst=h; SESSIONID=B4tGCSftAo6Gl9YBcoLMJtXSsFRBFfaTyWjI0P3jCQn; JOID=U10RBkk3QYIOs2P5RDO2mjCZL1ZVcXT3TvM0njF1KrVI2FmkBzhdjWm0ZvlCckopgGyp9iIf2uZbCp900agsS5o=; osd=VFsVAkswR4YKsWT_QDe0nTadK1RSd3DzTPQymjV3LbNM3FujATxZj26yYv1AdUwthG6u8CYb2OFdDpt21q4oT5g=; Hm_lpvt_98beee57fd2ef70ccdd5ca52b9740c49=1661541389; ariaDefaultTheme=undefined; KLBRSID=cdfcc1d45d024a211bb7144f66bda2cf|1661541390|1661538859',
   'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/95.0.4638.69 Safari/537.36'
        }
# 2.确定发送请求网址
url='https://www.zhihu.com/hot'
 
response = requests.get(url=url, headers=headers)
 
title=re.findall('<h2 class="HotItem-title">(.*?)</h2>',response.text) 
link=re.findall('class="HotItem-content"><a href="(.*?)" title="',response.text,re.S)

##方法一,生成Excel文件
from openpyxl import Workbook#注意pip install openpyxl模块
wb=Workbook()
ws=wb.active
ws.append(['标题','链接url'])#表头
for i in range(len(title)):
    ws.append([title[i],link[i]]) 
wb.save('知乎-热门.xlsx')


#方法二,直接IO写csv文件
s='"标题","链接url"'
for i in range(len(title)):
    s+="\n"+'"%s","%s"'%(title[i],link[i])
f=open('知乎-热门.csv','w',encoding='utf-8-sig')
f.write(s)
f.close()


参考这个:https://blog.csdn.net/guoxuying/article/details/112470819


# 保存数据
def save_data_exel(all_info):
    book = xlwt.Workbook(encoding="utf-8")   # 打开新工作薄
    sheet = book.add_sheet('新发地', cell_overwrite_ok=True)  # cell_overwrite_ok=Tru时重复写入单元格不会报错
    col = list(all_info.keys())   # col获取all_info的键将作为标题头写入exel
    value = list(all_info.values())   # value则是讨论内容写入exel
    for i in range(10):           # 标题头写入操作
        sheet.write(0, i, col[i])
    # 根据all_info的格式特点采取按行储存数据
    for j in range(10):   # 范围可自己设定
        for k in range(10):  
            sheet.write(j, k, value[j][k])  
 
    book.save('文件名.xls')   # 文件名+保存Exel文件
您好,我是有问必答小助手,您的问题已经有小伙伴帮您解答,感谢您对有问必答的支持与关注!
PS:问答VIP年卡 【限时加赠:IT技术图书免费领】,了解详情>>> https://vip.csdn.net/askvip?utm_source=1146287632