以下str是某篇文章的拼音化,如果提取出现频率最高的5组字符串(贪婪模式)。
str=‘gongchandangxuanyanbaokuoyinyanhezhengwensizhang1872nian1893nianmakesiheengesixianhouweixuanyandedewenewenyingwenbolanwenyidaliwenbanzhuanxieqipianxuyanqipianxuyanjianyaoshuomingxuanyandejibensixiangjiqizaiguojigongchanzhuyiyundongzhongdelishidiweizhimingxuanyandelilunyuanlishilishiweiwuzhuyibinggenjuwuchanjiejigemingdejingyanhejiaoxunduixuanyanzuobuchonghexiugaigongchandangxuanyanzhongwenbanfengmiangongchandangxuanyanzhongwenbanfengmianxuanyandiyizhangzichanzhehewuchanzhelunshumakesizhuyidejiejidouzhengxueshuozhuyaoshiyimakesizhuyijiejidouzhengxueshuojieshizibenzhuyibiranmiewanggongchanzhuyibiranshenglidekeguanguilüzhichuwuchanjiejidelishishimingbenzhangzhichuziyuanshishizushehuijietiyilaizhijinyiqieshehuidelishidoushijiejidouzhengdelishijiejidouzhengxueshuoshimakesizhuyixueshuodejichuyeshiguanchuanxuanyandeyitiaohongxiantongguoduijiejihejiejidouzhengdefenximakesiheengesidechujielunzichanjiejidemiewanghewuchanjiejideshenglishitongyangbukebimiandewuchanjiejibixubatuifanzichanjiejitongzhijianlizijidetongzhijinerzaiquanshijiexiaomieyiqiebaoxuejiejihebaoxuezhidujiefangquanrenleizuizhongshixiangongchanzhuyizuoweizijidelishishimingxuanyandierzhangwuchanzhehegongchandangrenshuomingwuchanjiejizhengdangdexingzhitedianmudeherenwuyijigongchandangdelilunheganglingzhuyaoshijieshiwuchanjiejiyaoshixianzijidelishishimingbixuyouwuchanjiejizhengdanglailingdaojinerchanshugongchandangdexingzhitedianherenwuchanmingwuchanjiejishixianlishishimingdegenbendaoluzhichugongchandangshiwuchanjiejidezhengdangshiwuchanjiejidexianfengduidaibiaowuchanjiejidegenbenliyibumouqiurenhesiligongchandangrendezuijinmudeshishiwuchanjiejixingchengweijiejituifanzichanjiejidetongzhiyouwuchanjiejijianlizhengquanqizuizhongfendoumubiaoshixiaomiesiyouzhizuizhongshixiangongchanzhuyiweileshixianzheyilishishimingshouxianshiduoquzhengquanjianliwuchanjiejizhuanzhengranhouliyongzijidezhengquanduishengchanziliaosiyouzhijinhangshehuizhuyigaizaozaishehuizhuyigongyouzhijianlihoujishibagongzuozhongxinzhuanyidaojingjijiansheshanglaiweixiangshehuizhuyiguoduchuangzaotiaojiansuizheshehuidechongfenfazhandangshengchanziliaosiyouzhihejiejixiaomieyihouguojiaxiaowangdaonageshihoudaitinacunzaizhejiejihejiejiduilidezichanjiejijiushehuidejiangshizheyangyigelianhetizainalimeigerendeziyoufazhanshiyiqierendeziyoufazhandetiaojianjijinrugongchanzhuyishehuixuanyandisanzhangshehuizhuyidehegongchanzhuyidewenxianpipandangshiliuhangdegezhongjiashehuizhuyifenxigezhongjiashehuizhuyiliupaichanshengdeshehuilishitiaojianzhichuqililunyaohaishehuigenyuanjiejishizhihezuizhongbeitaotaidelishibiranxingcongerhuaqingkexueshehuizhuyiheqitashehuizhuyidejiexian[5]fenwei1fandongdeshehuizhuyi2baoshoudehuozichanjiejideshehuizhuyi3pipandekongxiangdeshehuizhuyihegongchanzhuyi
xuanyandisizhanggongchandangrenduigezhongfanduidangpaidetaidulunshugongchandangrengemingdouzhengdesixiangcelue’
from collections import Counter
def most_frequent(data):
cc =[i[0] for i in Counter(data).most_common()[0:5]]
return cc
print(most_frequent("sdjfhksjfhksjdh234234234234fksjdhfkjsdfhksdfhuw3rhjkser"))