如何提取这些字符中出现频率最高的5组字符串(贪婪)

以下str是某篇文章的拼音化,如果提取出现频率最高的5组字符串(贪婪模式)。

str=‘gongchandangxuanyanbaokuoyinyanhezhengwensizhang1872nian1893nianmakesiheengesixianhouweixuanyandedewenewenyingwenbolanwenyidaliwenbanzhuanxieqipianxuyanqipianxuyanjianyaoshuomingxuanyandejibensixiangjiqizaiguojigongchanzhuyiyundongzhongdelishidiweizhimingxuanyandelilunyuanlishilishiweiwuzhuyibinggenjuwuchanjiejigemingdejingyanhejiaoxunduixuanyanzuobuchonghexiugaigongchandangxuanyanzhongwenbanfengmiangongchandangxuanyanzhongwenbanfengmianxuanyandiyizhangzichanzhehewuchanzhelunshumakesizhuyidejiejidouzhengxueshuozhuyaoshiyimakesizhuyijiejidouzhengxueshuojieshizibenzhuyibiranmiewanggongchanzhuyibiranshenglidekeguanguilüzhichuwuchanjiejidelishishimingbenzhangzhichuziyuanshishizushehuijietiyilaizhijinyiqieshehuidelishidoushijiejidouzhengdelishijiejidouzhengxueshuoshimakesizhuyixueshuodejichuyeshiguanchuanxuanyandeyitiaohongxiantongguoduijiejihejiejidouzhengdefenximakesiheengesidechujielunzichanjiejidemiewanghewuchanjiejideshenglishitongyangbukebimiandewuchanjiejibixubatuifanzichanjiejitongzhijianlizijidetongzhijinerzaiquanshijiexiaomieyiqiebaoxuejiejihebaoxuezhidujiefangquanrenleizuizhongshixiangongchanzhuyizuoweizijidelishishimingxuanyandierzhangwuchanzhehegongchandangrenshuomingwuchanjiejizhengdangdexingzhitedianmudeherenwuyijigongchandangdelilunheganglingzhuyaoshijieshiwuchanjiejiyaoshixianzijidelishishimingbixuyouwuchanjiejizhengdanglailingdaojinerchanshugongchandangdexingzhitedianherenwuchanmingwuchanjiejishixianlishishimingdegenbendaoluzhichugongchandangshiwuchanjiejidezhengdangshiwuchanjiejidexianfengduidaibiaowuchanjiejidegenbenliyibumouqiurenhesiligongchandangrendezuijinmudeshishiwuchanjiejixingchengweijiejituifanzichanjiejidetongzhiyouwuchanjiejijianlizhengquanqizuizhongfendoumubiaoshixiaomiesiyouzhizuizhongshixiangongchanzhuyiweileshixianzheyilishishimingshouxianshiduoquzhengquanjianliwuchanjiejizhuanzhengranhouliyongzijidezhengquanduishengchanziliaosiyouzhijinhangshehuizhuyigaizaozaishehuizhuyigongyouzhijianlihoujishibagongzuozhongxinzhuanyidaojingjijiansheshanglaiweixiangshehuizhuyiguoduchuangzaotiaojiansuizheshehuidechongfenfazhandangshengchanziliaosiyouzhihejiejixiaomieyihouguojiaxiaowangdaonageshihoudaitinacunzaizhejiejihejiejiduilidezichanjiejijiushehuidejiangshizheyangyigelianhetizainalimeigerendeziyoufazhanshiyiqierendeziyoufazhandetiaojianjijinrugongchanzhuyishehuixuanyandisanzhangshehuizhuyidehegongchanzhuyidewenxianpipandangshiliuhangdegezhongjiashehuizhuyifenxigezhongjiashehuizhuyiliupaichanshengdeshehuilishitiaojianzhichuqililunyaohaishehuigenyuanjiejishizhihezuizhongbeitaotaidelishibiranxingcongerhuaqingkexueshehuizhuyiheqitashehuizhuyidejiexian[5]fenwei1fandongdeshehuizhuyi2baoshoudehuozichanjiejideshehuizhuyi3pipandekongxiangdeshehuizhuyihegongchanzhuyi
xuanyandisizhanggongchandangrenduigezhongfanduidangpaidetaidulunshugongchandangrengemingdouzhengdesixiangcelue’


from collections import Counter

def most_frequent(data):
    cc =[i[0] for i in Counter(data).most_common()[0:5]]
    return cc


print(most_frequent("sdjfhksjfhksjdh234234234234fksjdhfkjsdfhksdfhuw3rhjkser"))