使用 Python SDK 进行文章分类,遇到如下问题:
- 'UnicodeEncodeError: 'gbk' codec can't encode character '\u2028' in position 46: illegal multibyte sequence',无论如何更改,都还是报这个错误.
- 使用的是 Python3.6.5, MaC, VS code,换到 jupyter 和 pycharm 中还是报错.
- 出错的完整函数部分:
-
def get_article_topic(shu): df = pd.read_csv('/Users/meininghang/Downloads/data.csv') #index_col=['日期']) # 处理数据集 #df = df.copy() title = str(df.题目[shu]) content = str(df.正文[shu]) # pprint.pprint(client.topic(title, content)) marked = client.topic(title, content)['item']['lv1_tag_list'][0]['tag'] try: df['Unnamed: 0'][shu] = marked print(df['Unnamed: 0'][shu]) except: pass
收藏
点赞
0
个赞
请登录后评论
TOP
切换版块
有了该怎么办呢
UnicodeEncodeError: 'gbk' codec can't encode character '\u202c' in position 26: illegal multibyte sequence
调用sentimentClassify的时候出的问题,同求解答
UnicodeEncodeError: 'gbk' codec can't encode character '\u202c' in position 26: illegal multibyte sequence
检查一下你的文本中是否有GBK格式不可编码的字符