python – Pandas读取_excel：’utf-8’编解码器无法解码位置14的字节0xa8：无效的起始字节

试图读取MS Excel文件,版本2016.文件包含几个包含数据的列表.从DataBase下载的文件,可以在MS Office中正确打开.在下面的示例中,我更改了文件名.

编辑：文件包含俄语和英语单词.最有可能使用Latin-1编码,但编码=’latin-1’没有帮助

import pandas as pd
with open('1.xlsx', 'r', encoding='utf8') as f:
        data = pd.read_excel(f)

结果：

UnicodeDecodeError: 'utf-8' codec can't decode byte 0xa8 in position 14: invalid start byte

没有encoding =’utf8′

'charmap' codec can't decode byte 0x9d in position 622: character maps to <undefined>

附：任务是处理52个文件,将每张表中的数据与52个文件中的相应表格合并.所以,请不要处理工作建议.

最佳答案

最有可能的问题是俄罗斯符号.

Charmap是默认解码方法,用于没有注意到编码的情况.

正如我所看到的,如果utf-8和latin-1没有帮助,那么尝试不读取此文件

pd.read_excel(f)

但

pd.read_table(f)

甚至只是

f.readline()

为了检查什么是符号引发一个例外并删除这个符号/符号.

点击查看更多相关文章

转载注明原文：python – Pandas读取_excel：’utf-8’编解码器无法解码位置14的字节0xa8：无效的起始字节 - 乐贴网

JAVA c c++go swift javascript Nginx UI/UE 小程序 Python C#php asp

PyCharm2020.3专业版永久激活(亲测有效，已激活至2089年！已升级到无限重置版！)

Python 4年前 26640

Python 5年前 8394

Python 5年前 6189

Python 5年前 5838

Python 5年前 5447

Python 5年前 3720

Python 5年前 3594

Python 5年前 3568

Python 5年前 3523

Python 5年前 3514

Python 5年前 3017

Python 5年前 2505