第一张表是4条数据生产数据生产13300,第二张表是3条数据出货数据出了13300,需要把出货的13300平分到第一张表,并且要把出货时间给匹配到出货单上,匹配原则就是按照先完工的工单匹配先出货数据,完工时间相同按照工单号小的先分配,用python或者sql
import pandas as pd
# 创建DataFrame
data1 = {
'sd_date': ['20210622', '20210623', '20210624', '20210625','20210629','20210630'],
'sd_id': ['A', 'A', 'A', 'A','B','B'],
'sd_seq': [1,1,1,1,1,1],
'item_num': [5000, 2200, 2800, 3300,3000,4000]
}
data2 = {
'ch_date': ['20210725', '20210729', '20210816','20210630','20210830'],
'sd_id': ['A', 'A', 'A','B','B'],
'ship_num': [4500, 1800,6999,4000,3000]
}
df11 = pd.DataFrame(data1)
df22 = pd.DataFrame(data2)
df3 = pd.DataFrame(columns=['sd_date', 'sd_id', 'sd_seq', 'item_num', 'ch_date_new', 'ship_num_new'])
display(df11,df22)
#遍历‘sd_id’,保证每次取相同sd_id下数据
for sd in df11['sd_id'].unique():
df1 = df11[df11['sd_id'] == sd].reset_index(drop=True)
df2 = df22[df22['sd_id'] == sd].reset_index(drop=True)
residual = df2.loc[0, 'ship_num']
i = 0
j = 0
while i < len(df1) and j < len(df2) :
if int(df1.loc[i, 'sd_date']) <= int(df2.loc[j, 'ch_date']) and df1.loc[i, 'item_num'] <= residual:
new_row = {'sd_date':df1.loc[i, 'sd_date'],'sd_id':sd,'sd_seq':df1.loc[i, 'sd_seq'],'item_num':df1.loc[i, 'item_num'],'ch_date_new':df2.loc[j, 'ch_date'],'ship_num_new':df1.loc[i, 'item_num']}
df3 = df3.append(new_row , ignore_index=True)
residual -= df1.loc[i, 'item_num']
i+=1
elif int(df1.loc[i, 'sd_date']) <= int(df2.loc[j, 'ch_date']) and df1.loc[i, 'item_num'] > residual:
new_row = {'sd_date':df1.loc[i, 'sd_date'],'sd_id':sd,'sd_seq':df1.loc[i, 'sd_seq'],'item_num':df1.loc[i, 'item_num'],'ch_date_new':df2.loc[j, 'ch_date'],'ship_num_new':residual}
df3 = df3.append(new_row , ignore_index=True)
df1.loc[i, 'item_num'] -= residual
j+=1
if j< len(df2):
residual = df2.loc[j, 'ship_num']
else:
j+=1
if j< len(df2):
residual = df2.loc[j, 'ship_num']
print(df3[df3.ship_num_new!=0])
源于chatGPT仅供参考
您可以使用Python和SQL来处理这个需求。以下是一个示例的Python代码,结合了SQL查询和数据处理步骤,以实现将出货的13300平均分配给第一张表,并匹配出货时间到出货单上的功能:
```python
import sqlite3
# 连接数据库
conn = sqlite3.connect('your_database.db')
cursor = conn.cursor()
# 查询第一张表的数据
cursor.execute("SELECT * FROM table1")
table1_data = cursor.fetchall()
# 查询第二张表的数据
cursor.execute("SELECT * FROM table2 ORDER BY shipment_date, work_order ASC")
table2_data = cursor.fetchall()
# 计算每条出货数据应分配的数量
total_shipment_amount = 13300
shipment_count = len(table2_data)
avg_shipment_amount = total_shipment_amount // shipment_count
remainder = total_shipment_amount % shipment_count
# 更新第一张表的数据并插入出货时间
for i in range(len(table1_data)):
# 按照先完工的工单匹配先出货数据
if i < remainder:
shipment_amount = avg_shipment_amount + 1
else:
shipment_amount = avg_shipment_amount
# 获取出货信息
shipment_data = table2_data[i]
shipment_id = shipment_data[0]
shipment_date = shipment_data[1]
# 更新第一张表的数据
updated_amount = table1_data[i][1] + shipment_amount
cursor.execute("UPDATE table1 SET production_amount = ? WHERE id = ?", (updated_amount, table1_data[i][0]))
# 在出货单中插入出货时间
cursor.execute("UPDATE table2 SET shipment_date = ? WHERE id = ?", (shipment_date, shipment_id))
# 提交更改并关闭数据库连接
conn.commit()
cursor.close()
conn.close()
请注意,上述代码仅为示例,您需要根据实际情况调整数据库名称、表名称和字段名称。此外,示例中使用了SQLite数据库作为示例,您可以根据自己的需求将其替换为所使用的数据库类型。
以上代码将出货数量均匀分配给第一张表的数据,并将出货时间匹配到对应的出货单上。在更新数据库时,请确保提前备份数据以免出现意外情况。
```
df1 = pd.read_excel("student.xlsx")
df2 = pd.read_excel("sc.xlsx")
df3 = pd.read_excel("course.xlsx")
df4 = pd.read_excel("teacher.xlsx")
pysqldf = lambda q: sqldf(q, globals())
query1 = "select * from df1 limit 5"
query2 = "select * from df2 limit 5"
query3 = "select * from df3"
query4 = "select * from df4"
sqldf(query1)
sqldf(query2)
sqldf(query3)
sqldf(query4)
部分结果如下: