pandas DataFrame output end of csv

Question

I wonder how to add new DataFrame data onto the end of an existing csv file? The to_csv doesn't mention such functionality.

Possible duplicate of How to add pandas data to an existing csv file? — 9769953
– 9769953, Commented Sep 26, 2018 at 14:53

Community · Accepted Answer · 2017-05-23 11:54:27Z

80

You can append using to_csv by passing a file which is open in append mode:

with open(file_name, 'a') as f:
    df.to_csv(f, header=False)

Use header=None, so as not to append the column names.

In fact, pandas has a wrapper to do this in to_csv using the mode argument (see Joe's answer):

df.to_csv(f, mode='a', header=False)

edited May 23, 2017 at 11:54

CommunityBot

11 silver badge

answered Jun 16, 2013 at 15:52

Andy Hayden

378k110 gold badges640 silver badges546 bronze badges

Sign up to request clarification or add additional context in comments.

10 Comments

perigee Over a year ago

Also need to close the file by f.close(). Andy, you make my day. It works like a charm, I'm from c/c++ ethnic and need to learn the python philosophy. Any suggestion?

perigee Over a year ago

Andy, really appreciated :-D (cannot use @ symbol :-()

Ezekiel Kruglick Over a year ago

Bonus points that this closes the file after to_csv. I have some code that hits to_csv alot and was finding the files left open on later iterations.

Andy Hayden Over a year ago

@EzekielKruglick Were you passing an open file to to_csv or the filename? I recall a related issue where not closing the file led to a 99% speedup of their code (IIRC they were appending to the same file tens of thousands of times).

lesolorzanov Over a year ago

@perigee when "with" is used the file is closed automatically always. blog.lerner.co.il/dont-use-python-close-files-answer-depends

|

Leland Hepworth · Accepted Answer · 2020-09-16 19:22:45Z

47

You can also pass the file mode as an argument to the to_csv method

df.to_csv(file_name, header=False, mode = 'a')

edited Sep 16, 2020 at 19:22

Leland Hepworth

1,01613 silver badges18 bronze badges

answered Jul 28, 2013 at 17:14

Joe Hooper

8165 silver badges6 bronze badges

Comments

KCzar · Accepted Answer · 2015-05-17 22:17:49Z

A little helper function I use (based on Joe Hooper's answer) with some header checking safeguards to handle it all:

def appendDFToCSV_void(df, csvFilePath, sep=","):
    import os
    if not os.path.isfile(csvFilePath):
        df.to_csv(csvFilePath, mode='a', index=False, sep=sep)
    elif len(df.columns) != len(pd.read_csv(csvFilePath, nrows=1, sep=sep).columns):
        raise Exception("Columns do not match!! Dataframe has " + str(len(df.columns)) + " columns. CSV file has " + str(len(pd.read_csv(csvFilePath, nrows=1, sep=sep).columns)) + " columns.")
    elif not (df.columns == pd.read_csv(csvFilePath, nrows=1, sep=sep).columns).all():
        raise Exception("Columns and column order of dataframe and csv file do not match!!")
    else:
        df.to_csv(csvFilePath, mode='a', index=False, sep=sep, header=False)

Is there an API setting for the 3rd test case, column order not matching between dataframe and csv? I want to write without headers, but have the columns be implicitly reordered.

perigee · Accepted Answer · 2013-06-16 16:21:27Z

3

Thank to Andy, the complete solution:

f = open(filename, 'a') # Open file as append mode
df.to_csv(f, header = False)
f.close()

answered Jun 16, 2013 at 16:21

perigee

9,94811 gold badges35 silver badges39 bronze badges

1 Comment

Andy Hayden Over a year ago

Just to mention, this is essentially equivalent to above but after this you're left with a closed file (f), whereas with with it cleans up that for you. :)

Collectives™ on Stack Overflow

pandas DataFrame output end of csv

4 Answers 4

10 Comments

Comments

1 Comment

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

10 Comments

Comments

1 Comment

1 Comment

Linked

Related