3、 实验内容:在英语中英文单词组成句子,在键盘上输入一句含有标点英文语句,统计其中最长的单词以及各单词出现的次数。具体要求如下:
(1) 用户输入一个英文语句,英文语句子中要包含标点符号;若个别单词后面带有标点符号,其标点符号不能计算为单词;
(2) 输出英文语句中最长的单词,若有多个单词,则每行输出一个;
(3) 统计各单词出现的次数,按照数量从多到少排列在屏幕上输出;如果出现的数量相同,则按照单词在句子中的顺序进行显示;输出的格式为:单词左对齐,宽度为10。
例如,如果用户在键盘上输入句子:Hi, we play on the Maldive beach on October the second.
我的代码:
import string
sent = input().strip(
for c in string.punctuation:
sent =string.punctuation.strip (c," " #1去除标点符号
wordlist = sent.split( #分词
wordcounts = {}
wordlens =[]
for word in wordlist:
wordcounts[word] = wordcounts.get(word,0 + 1 #2词频统计
for word in wordcounts.keys():
wordlens.append((word,len(word #计算每个单词长度
sort_wordcounts = sorted(wordcounts,key = lambda x:x[1],reverse =True) #3按照词频排序
sort_wordlens = sorted(wordlens,key = lambda x:x[1],reverse=True #4按照单词长度排序
maxlen = 0 #5最长单词的长度
for maxlen in sort_wordlens: #6输出所有最长单词
if wlen == maxlen:
print("maxlen:{1:<3}word:{0:>10}".format(word,wlen
else:
break #7
for word, count in sort_wordcounts: #输出单词词频
print("{0:<10}{1:->5}".format(word,count #8
如何完善此代码
在你的程序基础上修改如下:
import string
sent = input()
for c in string.punctuation:
sent = sent.replace(c," ") #1去除标点符号
wordlist = sent.split() #分词
wordcounts = {}
wordlens =[]
for word in wordlist:
wordcounts[word] = wordcounts.get(word,0) + 1 #2词频统计
for word in wordcounts.keys():
wordlens.append((word,len(word))) #计算每个单词长度
sort_wordcounts = sorted(wordcounts.items(),key = lambda x:x[1],reverse =True) #3按照词频排序
sort_wordlens = sorted(wordlens,key = lambda x:x[1],reverse=True) #4按照单词长度排序
maxlen = sort_wordlens[0][1] #5最长单词的长度
for wlen in sort_wordlens: #6输出所有最长单词
if wlen[1] == maxlen:
print("maxlen:{0:<3}word:{1:>10}".format(maxlen,wlen[0]))
for word, count in sort_wordcounts: #输出单词词频
print("{0:<10}{1:->5}".format(word,count)) #8
参考一下
w_dict = {}
while True:
word = input('please input:')
if word in w_dict:
w_dict[word] = w_dict[word] + 1
elif word == '0':
break
else:
w_dict[word] = 1
re_list = sorted(w_dict.items(),key=lambda item:-item[1])
print('result:')
for i in re_list:
print(i[0],i[1])