How to save an Excel worksheet as CSV

Question

I want to write a Python script that reads in an Excel spreadsheet and saves some of its worksheets as CSV files.

How can I do this?

I have found third-party modules for reading and writing Excel files from Python, but as far as I can tell, they can only save files in Excel (i.e. *.xls) format. If I'm wrong here, some example code showing how to do what I'm trying to do with these modules would be appreciated.

I also came across one solution that I can't quite understand, but seems to be Windows-specific, and therefore would not help me anyway, since I want to do this in Unix. At any rate, it's not clear to me that this solution can be extended to do what I want to do, even under Windows.

Sayyor Y · Accepted Answer · 2021-05-23 04:16:12Z

The most basic examples using the two libraries described line by line:

Open the xls workbook
Reference the first spreadsheet
Open in binary write the target csv file
Create the default csv writer object
Loop over all the rows of the first spreadsheet
Dump the rows into the csv

import xlrd
import csv

with xlrd.open_workbook('a_file.xls') as wb:
    sh = wb.sheet_by_index(0)  # or wb.sheet_by_name('name_of_the_sheet_here')
    with open('a_file.csv', 'wb') as f:   # open('a_file.csv', 'w', newline="") for python 3
        c = csv.writer(f)
        for r in range(sh.nrows):
            c.writerow(sh.row_values(r))

import openpyxl
import csv

wb = openpyxl.load_workbook('test.xlsx')
sh = wb.active
with open('test.csv', 'wb') as f:  # open('test.csv', 'w', newline="") for python 3
    c = csv.writer(f)
    for r in sh.rows:
        c.writerow([cell.value for cell in r])

To evaluate the Excel formulas with openpyxl : wb = openpyxl.load_workbook('test.xlsx', data_only=True)
@Zeugma how can I write this csv back to a folder? (In my case aws s3) I keep getting AttributeError: '_io.TextIOWrapper' object has no attribute 'save'

Maximilian Press · Accepted Answer · 2025-02-27 07:32:09Z

18

Using pandas is a bit shorter:

import pandas as pd

df = pd.read_excel('my_file', sheet_name='my_sheet_name')  # sheet_name is optional
df.to_csv('output_file_name', index=False)  # index=False prevents pandas from writing a row index to the CSV.

# oneliner
pd.read_excel('my_file', sheet_name='my_sheet_name').to_csv('output_file_name', index=False)

edited Feb 27 at 7:32

Maximilian Press

3974 silver badges15 bronze badges

answered Jul 25, 2017 at 8:09

FabienP

3,1581 gold badge24 silver badges27 bronze badges

4 Comments

rrs Over a year ago

I don't trust pandas to do this. It's been converting all my leading zeros.

FabienP Over a year ago

Can you give some more details?

Keivan Ipchi Hagh Over a year ago

This implementation works perfectly fine for my scenario, just change sheetname to sheet_name as it's a typo.

Joey Baruch Over a year ago

@rrs why not use pd.read_excel('my_file', dtype=str) ? more info here

jtlz2 · Accepted Answer · 2021-12-02 11:06:56Z

17

As of December 2021 and Python 3:

The openpyxl API has changed sufficiently (see https://openpyxl.readthedocs.io/en/stable/usage.html) that I have updated this part of the answer by @Boud (now @Zeugma?), as follows:

import openpyxl
import csv

wb = openpyxl.load_workbook('test.xlsx')
sh = wb.active # was .get_active_sheet()
with open('test.csv', 'w', newline="") as file_handle:
    csv_writer = csv.writer(file_handle)
    for row in sh.iter_rows(): # generator; was sh.rows
        csv_writer.writerow([cell.value for cell in row])

@Leonid made some helpful comments - in particular:

csv.writer provides some additional options e.g. custom delimiter:

csv_writer = csv.writer(fout, delimiter='|', quotechar='"', quoting=csv.QUOTE_MINIMAL)

HTH

edited Dec 2, 2021 at 11:06

answered Sep 28, 2020 at 9:32

jtlz2

8,52711 gold badges74 silver badges128 bronze badges

4 Comments

eakst7 Over a year ago

A couple of typos here. The "with" needs "as f" on the end, and "sh.iter_rows" should be "sh.iter_rows()" Otherwise, works well, thanks!

jtlz2 Over a year ago

@eakst7 Huge thanks - can you believe I typed it out - now fixed - glad it helped.

Leonid Over a year ago

Thanks, that was useful. Two comments from me: 1. pylama does not like single-letter variable names and the call to csv.writer provides additional options (such as custom delimiter) which would be cool to highlight. For example: csv_writer = csv.writer(fout, delimiter='|', quotechar='"', quoting=csv.QUOTE_MINIMAL)

jtlz2 Over a year ago

@Leonid Thanks so much - updated as per your helpful comments!

Charles Duffy · Accepted Answer · 2012-05-29 16:34:02Z

5

Use the xlrd or openpyxlmodule to read xls or xlsx documents respectively, and the csv module to write.

Alternately, if using Jython, you can use the Apache POI library to read either .xls or .xlsx, and the native CSV module will still be available.

edited May 29, 2012 at 16:34

answered May 29, 2012 at 15:47

Charles Duffy

299k43 gold badges440 silver badges495 bronze badges

2 Comments

Steven Rumbalski Over a year ago

And if you need to read .xlsx files use openpyxl.

John Y Over a year ago

I prefer xlsxrd to read .xlsx files. At some point, it will be merged into xlrd.

akshayk07 · Accepted Answer · 2019-08-01 18:57:45Z

0

First read your Excel spreadsheet into Pandas. The code below will import your Excel spreadsheet into Pandas as an OrderedDict which contains all of your worksheets as DataFrames. Then, simply use the worksheet_name as a key to access specific worksheet as a DataFrame and save only the required worksheet as a csv file by using df.to_csv(). Hope this will work in your case.

import pandas as pd
df = pd.read_excel('YourExcel.xlsx', sheet_name=None)
df['worksheet_name'].to_csv('output.csv')

edited Aug 1, 2019 at 18:57

akshayk07

2,2201 gold badge25 silver badges35 bronze badges

answered Aug 1, 2019 at 17:30

Ashu007

7951 gold badge9 silver badges14 bronze badges

Collectives™ on Stack Overflow

How to save an Excel worksheet as CSV

5 Answers 5

2 Comments

4 Comments

4 Comments

2 Comments

Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

2 Comments

4 Comments

4 Comments

2 Comments

Comments

Linked

Related