How to add pandas data to an existing csv file?

Question

I want to know if it is possible to use the pandas to_csv() function to add a dataframe to an existing csv file. The csv file has the same structure as the loaded data.

I think method suggested by @tlingf is better only because he is using build-in functionality of pandas library. He suggests define mode as "a" . "A" stands for APPEND 'df.to_csv('my_csv.csv', mode='a', header=False)' — Ayrat
– Ayrat, Commented Oct 20, 2014 at 13:14
The answer from @KCzar considers both the cases when the CSV file is not there (i.e. add the column header) and when the CSV is already there (so add just the data rows without headers). In any case it uses the "append" mode and a custom separator, along with checks on the number of columns. — TPPZ
– TPPZ, Commented Apr 17, 2019 at 8:46

Antonio · Accepted Answer · 2022-01-20 14:28:53Z

1000

You can specify a python write mode in the pandas to_csv function. For append it is 'a'.

In your case:

df.to_csv('my_csv.csv', mode='a', header=False)

The default mode is 'w'.

If the file initially might be missing, you can make sure the header is printed at the first write using this variation:

output_path='my_csv.csv'
df.to_csv(output_path, mode='a', header=not os.path.exists(output_path))

edited Jan 20, 2022 at 14:28

Antonio

20.6k14 gold badges109 silver badges221 bronze badges

answered Jul 31, 2013 at 16:19

tlingf

10.1k2 gold badges15 silver badges4 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

datanew Over a year ago

Thanks for the answer. This will allow me append new df on row-wise. But could you let me know how can I append the new df on column-wise?

datanew Over a year ago

I was able to accomplish it by re-read the 'my_csv.csv', then concat the new df, and then save it. If you know some easier method, please DO let me know. I appreciate!

Etisha Over a year ago

How to write header for the first file and rest of the rows gets automatically appended to it?

Michele Tonutti Over a year ago

@Etisha something like df.to_csv(output_path, mode='a', header=not os.path.exists(output_path))

user35915 Over a year ago

Correct answer, of course, just a note: passing index=False will tell df.to_csv not to write the row index to the first column. Depending on the application, this might make sense to avoid a meaningless index column.

|

Andy Hayden · Accepted Answer · 2013-07-08 16:06:49Z

284

You can append to a csv by opening the file in append mode:

with open('my_csv.csv', 'a') as f:
    df.to_csv(f, header=False)

If this was your csv, foo.csv:

,A,B,C
0,1,2,3
1,4,5,6

If you read that and then append, for example, df + 6:

In [1]: df = pd.read_csv('foo.csv', index_col=0)

In [2]: df
Out[2]:
   A  B  C
0  1  2  3
1  4  5  6

In [3]: df + 6
Out[3]:
    A   B   C
0   7   8   9
1  10  11  12

In [4]: with open('foo.csv', 'a') as f:
             (df + 6).to_csv(f, header=False)

foo.csv becomes:

,A,B,C
0,1,2,3
1,4,5,6
0,7,8,9
1,10,11,12

edited Jul 8, 2013 at 16:06

answered Jul 8, 2013 at 15:57

Andy Hayden

378k110 gold badges640 silver badges546 bronze badges

2 Comments

Pouya BCD Over a year ago

Thou it is not harmful but I don't think you need a context manager for using to_csv() method.

Jingnan Jia Over a year ago

Do we really need with open('my_csv.csv', 'a') as f:??

SpiralDev · Accepted Answer · 2018-12-14 03:50:05Z

102

with open(filename, 'a') as f:
    df.to_csv(f, header=f.tell()==0)

Create file unless exists, otherwise append
Add header if file is being created, otherwise skip it

answered Dec 14, 2018 at 3:50

SpiralDev

7,3715 gold badges31 silver badges45 bronze badges

3 Comments

Gabriela Melo Over a year ago

It's missing a mode='a' as a parameter to to_csv (ie df.to_csv(f, mode='a', header=f.tell()==0)

Piyush Over a year ago

@GabrielaMelo That was passed in the function open(filename, 'a').

David Kaufman Over a year ago

I get an extra blank line between every line of data (on Windows, which I guess is vulnerable to that) unless I add some parentheses: header=(f.tell()==0) -- and also write : with open(filename, 'a', newline='') as f:

KCzar · Accepted Answer · 2015-05-17 22:49:32Z

A little helper function I use with some header checking safeguards to handle it all:

def appendDFToCSV_void(df, csvFilePath, sep=","):
    import os
    if not os.path.isfile(csvFilePath):
        df.to_csv(csvFilePath, mode='a', index=False, sep=sep)
    elif len(df.columns) != len(pd.read_csv(csvFilePath, nrows=1, sep=sep).columns):
        raise Exception("Columns do not match!! Dataframe has " + str(len(df.columns)) + " columns. CSV file has " + str(len(pd.read_csv(csvFilePath, nrows=1, sep=sep).columns)) + " columns.")
    elif not (df.columns == pd.read_csv(csvFilePath, nrows=1, sep=sep).columns).all():
        raise Exception("Columns and column order of dataframe and csv file do not match!!")
    else:
        df.to_csv(csvFilePath, mode='a', index=False, sep=sep, header=False)

@JasonGoal df = df.reindex(sorted(df.columns), axis=1); see stackoverflow.com/a/11067072/9095840.

Grant Shannon · Accepted Answer · 2018-01-25 15:51:40Z

6

Initially starting with a pyspark dataframes - I got type conversion errors (when converting to pandas df's and then appending to csv) given the schema/column types in my pyspark dataframes

Solved the problem by forcing all columns in each df to be of type string and then appending this to csv as follows:

with open('testAppend.csv', 'a') as f:
    df2.toPandas().astype(str).to_csv(f, header=False)

answered Jan 25, 2018 at 15:51

Grant Shannon

5,1132 gold badges51 silver badges39 bronze badges

Comments

Ahtisham · Accepted Answer · 2021-02-16 13:13:39Z

2

This is how I did it in 2021

Let us say I have a csv sales.csv which has the following data in it:

sales.csv:

Order Name,Price,Qty
oil,200,2
butter,180,10

and to add more rows I can load them in a data frame and append it to the csv like this:

import pandas

data = [
    ['matchstick', '60', '11'],
    ['cookies', '10', '120']
]
dataframe = pandas.DataFrame(data)
dataframe.to_csv("sales.csv", index=False, mode='a', header=False)

and the output will be:

Order Name,Price,Qty
oil,200,2
butter,180,10
matchstick,60,11
cookies,10,120

edited Feb 16, 2021 at 13:13

answered Feb 16, 2021 at 13:05

Ahtisham

10.2k6 gold badges49 silver badges62 bronze badges

2 Comments

Rafs Over a year ago

I'm not able to find the added value here over stackoverflow.com/a/17975690/3429115

LonelySoul Over a year ago

It does not add the pandas file to existing csv .

ai-shwarya · Accepted Answer · 2017-06-17 00:26:37Z

0

A bit late to the party but you can also use a context manager, if you're opening and closing your file multiple times, or logging data, statistics, etc.

from contextlib import contextmanager
import pandas as pd
@contextmanager
def open_file(path, mode):
     file_to=open(path,mode)
     yield file_to
     file_to.close()


##later
saved_df=pd.DataFrame(data)
with open_file('yourcsv.csv','r') as infile:
      saved_df.to_csv('yourcsv.csv',mode='a',header=False)`

answered Jun 17, 2017 at 0:26

ai-shwarya

1701 silver badge4 bronze badges

2 Comments

baxx Over a year ago

what's the benefit of using a context manager here?

leo Over a year ago

how is this any different from using open as a context manager?

Collectives™ on Stack Overflow

How to add pandas data to an existing csv file?

7 Answers 7

6 Comments

2 Comments

3 Comments

2 Comments

Comments

2 Comments

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

7 Answers 7

6 Comments

2 Comments

3 Comments

2 Comments

Comments

2 Comments

2 Comments

Linked

Related