画出箱型图后,我要把75%以上和25%以下的数从表中据删除,该怎么操作呢?
参考一下,这段代码,换成你的数据的对应列名即可 :
import pandas as pd
import numpy as np
df =pd.read_csv('geon.csv',sep=';')
print(np.percentile(df['device'],(25,75)))
a,b=df['device'].quantile([0.25,0.75])
IQR=b-a
print(b-a)
df=df[(df['device']>=a-1.5*IQR) & (df['device']<=b+1.5*IQR)]
df=df.set_index('geoNetwork')
print(df)