数据源文件简摘如下:
贵A**49ff272baf8c1b96e62c316223155ef7|2|02|2018-10-30 00:01:00.0|71|H|K33|520115|02|5201000072|1120132|环城高速K65+900M(观山湖区匝道口前花溪至白云2车道)|106.589432|26.631519|1190016223|1190016223080
湘A**df64d271e6a27871aea87a58f906b255|2|02|2018-10-30 00:01:06.0|62|E|K39|520115|02|5201000072|1120132|环城高速K65+900M(观山湖区匝道口前花溪至白云2车道)|106.589432|26.631519|1190016223|1190016223080
数据说明:
字段:号牌号码(加密)|号牌颜色|号牌类型|过车时间|车速|车辆颜色|车辆类型|行政区划代码|车道号|方向代码|设备代码|设备地点|经度|纬度|路段编码|路段区间编码
用Python将近100000条信息整理成表格的形式
目测,读取文件,然后把|替换为,
保存为csv即可
file_object = open('bat文件')
file_context = file_object.read()
file_context =file_context .replace("|", ",")
with open("文件名.csv",'w') as f:
f.write(file_content)
import pandas as pd
f=pd.read_table(bat文件路径名称,sep='|')
f.to_csv(csv文件路径名称)
使用csv库,代码如下,将文件名123.txt改为实际文件即可执行:
import os
import csv
def save2csv(infile, outfile):
fp = open(infile, 'r', encoding='utf-8')
fout = open(outfile, 'w', newline='')
writer = csv.writer(fout)
for line in fp.readlines():
line = line.strip()
if not line:
continue
line = line.split('|')
writer.writerow(line)
fp.close()
fout.close()
if __name__ == "__main__":
save2csv('./123.txt','./123.csv')