python 3,需要从如下字符中提取信息:
['"country":"UK","email":"abc@vip.com","x_id":123,"level":0',
'"country":"UK","email":"bcd@vip.com","x_id":234,"level":1',
'"country":"UK","email":"efg@vip.com","x_id":456]
提取出如下信息放入dataframe:
x_id level
123 0
234 1
456 NAN
代码:
import pandas as pd
import re
`data_raw=['"country":"UK","email":"abc@vip.com","x_id":123,"level":0','"country":"UK","email":"bcd@vip.com","x_id":234,"level":1','"country":"UK","email":"efg@vip.com","x_id":456]`
data=pd.DataFrame(data_raw)
data['x_id']=data.apply(lambda x:re.search(r'(\"x_id\":)\d{1-10}',x))
无法提取
不知道你这个问题是否已经解决, 如果还没有解决的话: