如图表所示,以Avant-Grade流派为例,该流派一共有多首歌曲,他们分别发表于不同的年份。
现在希望求各个流派已知的年份的发型的音乐总数,并最终以列导出为CSV文件,求大佬用代码解决一下这个问题
先把数据读出来,得到歌曲和年份, 再按年份计数就好。
import pandas as pd
if __name__ == "__main__":
# 设置年份
year = 1936
df = pd.read_excel("./data.xls")
res = df.query("year == {}".format(year))["department"].value_counts()
res.to_csv("./res_{}.csv".format(year), header=False)
利用pandas的value_counts可以统计Serises中各值出现的概率
##跟之前一样,mean 改成count
import pandas as pd
df=pd.read_csv('full_music_data time.csv',usecols = list(range(3,20)))
df=df[df['danceability']!='danceability']
df['ct']=1
gp=df.groupby(['department','year']).count()[['ct']]
department=[]
year=[]
for n1,n2 in gp.index:
department.append(n1)
year.append(n2)
gp['department']=department
gp['year']=year
df2=gp.reset_index(drop=True)
df2.to_csv('out_wenjian.csv',index=0)