df = pd.read_csv("C:\Users\极光\Desktop\NLP_project\date\entertainment_news.csv", encoding='utf-8')
df = df.dropna()
content=df.content.values.tolist()
jieba.load_userdict(u"data/user_dic.txt")
segment=[]
for line in content:
segs=jieba.lcut(line)
for seg in segs:
if len(seg)>1 and seg!='\r\n':
segment.append(seg)
File "<ipython-input-12-b85d0e965018>", line 1 df = pd.read_csv("C:\Users\极光\Desktop\NLP_project\date\entertainment_news.csv", encoding='utf-8') ^ SyntaxError: (unicode error) 'unicodeescape' codec can't decode bytes in position 2-3: truncated \UXXXXXXXX escape
要么用双斜杠 要么在字符串前面加r 一个反斜杠会被认为是转义字符的
C:\\Users\\极光\\Desktop\\NLP_project\\date\\entertainment_news.csv
这样也不行吗?