例如:有个总表记录了
每个教师需要对每个学生的家访次数:
df1 = pd.DataFrame({
'teacher' : ["t1", "t2", "t1", "t2", "t2"],
'student' : ["s1", "s2", "s3", "s3", "s1"],
'numb' : [3, 2, 2, 4, 2]
})
另一个表记录了实际已经拜访的次数:
df2 = pd.DataFrame({
'teacher' : ["t1", "t1", "t2", "t2"],
'student' : ["s1", "s3", "s2", "s3"],
'numb' : [2, 1, 1, 2]
})
我想得到剩余的家访次数df,该如何做呢..
问题描述的很清晰,也能直接复制,怎么没人答
df=pd.concat([df1.set_index(['teacher','student']),-df2.set_index(['teacher','student'])]).groupby(['teacher','student']).sum().reset_index()