为了账号安全,请及时绑定邮箱和手机立即绑定

Python dictreader-如何使CSV列名变为小写?

Python dictreader-如何使CSV列名变为小写?

人到中年有点甜 2021-03-25 13:09:22
我有一个CSV文件,其列名均为大写。我正在使用csv.dictreader读取数据,但需要使用小写字母的列名称。我在这里找到此代码访问csv标头空白且不区分大小写    import csvclass DictReaderInsensitive(csv.DictReader):    # This class overrides the csv.fieldnames property.    # All fieldnames are without white space and in lower case    @property    def fieldnames(self):        return [field.strip().lower() for field in super(DictReaderInsensitive, self).fieldnames]    def __next__(self):        # get the result from the original __next__, but store it in DictInsensitive        dInsensitive = DictInsensitive()        dOriginal = super(DictReaderInsensitive, self).__next__()        # store all pairs from the old dict in the new, custom one        for key, value in dOriginal.items():            dInsensitive[key] = value        return dInsensitiveclass DictInsensitive(dict):    # This class overrides the __getitem__ method to automatically strip() and lower() the input key    def __getitem__(self, key):        return dict.__getitem__(self, key.strip().lower())我的问题是当我使用datafile = open(self.ifs_data_file,'rU')        csvDict = DictReaderInsensitive(datafile)        for row in csvDict:            print row            #self.db.ifs_data.insert(**row)            #self.db.commit()我得到这个错误Traceback (most recent call last):  File "D:\Development\python\supplier_review\supplier_review.py", line 239, in update_ifs_data    for row in csvDict:  File "D:\Python27_5\lib\csv.py", line 103, in next    self.fieldnames  File "D:\Development\python\supplier_review\supplier_review.py", line 288, in fieldnames    return [field.strip().lower() for field in super(DictReaderInsensitive, self).fieldnames]TypeError: must be type, not classobj
查看完整描述

3 回答

?
一只斗牛犬

TA贡献1784条经验 获得超2个赞

您可以将文件的第一行小写,然后再传递给DictReader:


import csv

import itertools


def lower_first(iterator):

    return itertools.chain([next(iterator).lower()], iterator)


with open(ifs_data_file, 'rU') as datafile:

    csvDict = csv.DictReader(lower_first(datafile))

    for row in csvDict:

        print row    


查看完整回答
反对 回复 2021-03-30
?
慕的地10843

TA贡献1785条经验 获得超8个赞

DictReader是一个老式的对象,因此super()在这里根本无法使用。您需要直接访问property父类中的对象。在Python 2中,您要覆盖.next()方法,而不是.__next__():


class DictReaderInsensitive(csv.DictReader):

    # This class overrides the csv.fieldnames property.

    # All fieldnames are without white space and in lower case


    @property

    def fieldnames(self):

        return [field.strip().lower() for field in csv.DictReader.fieldnames.fget(self)]


    def next(self):

        return DictInsensitive(csv.DictReader.next(self))

演示:


>>> example = '''\

... foo,Bar,BAZ

... 42,3.14159,Hello world!'''.splitlines()

>>> csvDict = DictReaderInsensitive(example)

>>> row = next(csvDict)

>>> print row

{'bar': '3.14159', 'foo': '42', 'baz': 'Hello world!'}

>>> row['BAZ']

'Hello world!'


查看完整回答
反对 回复 2021-03-30
?
繁星coding

TA贡献1797条经验 获得超4个赞

对于更简单的方法,您可以在访问字典之前简单地更新DictReader.fieldnames属性,如下所示:


>>> f = open('example-x-y-time.csv', 'rb')

>>> reader = csv.DictReader(f)

>>> reader.fieldnames

['Latitude', 'Longitude', 'Date']

>>> print next(reader)

{'Latitude': '44.8982391', 'Date': '2004-07-12', 'Longitude': '-117.7791061'}

>>> reader.fieldnames = [name.lower() for name in reader.fieldnames]

>>> print next(reader)

{'latitude': '44.6637001', 'date': '1964-04-03', 'longitude': '-123.5997009'}


查看完整回答
反对 回复 2021-03-30
  • 3 回答
  • 0 关注
  • 354 浏览
慕课专栏
更多

添加回答

举报

0/150
提交
取消
微信客服

购课补贴
联系客服咨询优惠详情

帮助反馈 APP下载

慕课网APP
您的移动学习伙伴

公众号

扫描二维码
关注慕课网微信公众号