How to format txt file in Python

Question

I am trying to convert a txt file into a csv file in Python. The current format of the txt file are several strings separated by spaces. I would like to write each string into one cell in the csv file.

The txt file has got following structure:

UserID Desktop Display (Version) (Server/Port handle), Date

etc.

My approach would be following:

with open('licfile.txt', "r+") as in_file:
    stripped = (line.strip() for line in in_file)
    lines = (line.split(" ") for line in stripped if line)

with open('licfile.csv', 'w') as out_file:
    writer = csv.writer(out_file)
    writer.writerow(('user', 'desktop', 'display', 'version', 'server', 'handle', 'date'))
    writer.writerows(lines)

Unfortunately this is not working as expected. I do get following ValueError: I/O operation on closed file. Additionally only the intended row headers are shown in one cell in the csv file.

Any tips on how to proceed? Many thanks in advance.

You should include the input and output of the script (or some part of it). — luis.parravicini
– luis.parravicini, Commented Sep 19, 2019 at 11:57
Just use read_lines = in_file.readlines() before stripped = (line.strip() for line in in_file) to read the lines in the buffer before iterating over them. Otherwise, the logic of the code seems good enough. — najeeb khan
– najeeb khan, Commented Sep 19, 2019 at 12:55

SpghttCd · Accepted Answer · 2019-09-19 12:32:51Z

3

how about

with open('licfile.txt', 'r') as in_file, open('licfile.csv', 'w') as out_file:
    for line in in_file:
        if line.strip():
            out_file.write(line.strip().replace(' ', ',') + '\n')

and for the german Excel enthusiasts...

...
    ...
        ...
            ... .replace(' ', ';') + '\n')

:)

edited Sep 19, 2019 at 12:32

answered Sep 19, 2019 at 11:56

SpghttCd

10.9k2 gold badges23 silver badges28 bronze badges

Sign up to request clarification or add additional context in comments.

7 Comments

sim_rum Over a year ago

Thanks for the hint. This solution at least writes the data into the csv but unfortunately writes everything into one column, I would like to separate each record from the others.

SpghttCd Over a year ago

What do you understand by column? It puts commas inbetween all entries where were spaces before. If you define column by what-is-put-in-different-columns-when-I-load-that-csv-with-Excel then you should have a look what Excel uses as column separator. You're in Germany? Try ;...

SpghttCd Over a year ago

Yes (not only, but also) in Germany the standard decimal sign is a comma, which is unfortunately exactly the same character which is used as a column separator in - well, comma separated value -files. Therefore the column separator is here a semicolon. This is part of the locale preferences topic with software in general.

sim_rum Over a year ago

You guessed right, thanks for the explanation. Now if I have got several rows in the txt file with equivalent columns, can I simply add that after every date a new row shall be started?

sim_rum Over a year ago

Yes, all records are printed in one line but I would like to have one row per entry in the text file.

|

r.ook · Accepted Answer · 2019-09-19 12:53:06Z

You can also use the built in csv module to accomplish this easily:

import csv

with open('licfile.txt', 'r') as in_file, open('licfile.csv', 'w') as out_file:
    reader = csv.reader(in_file, delimiter=" ")  
    writer = csv.writer(out_file, lineterminator='\n')
    writer.writerows(reader)

I used lineterminator='\n' argument here as the default is \r\n and it ends up giving you an extra line of return per row in most cases.

There are also a few arguments you could use if say quoting is needed or a different delimiter is desired: https://docs.python.org/3/library/csv.html#csv-fmt-params

Amit Nanaware · Accepted Answer · 2019-09-19 12:01:33Z

1

You are using comprehension with round brackets which will cause to create tuple object. Instead of that just use square bracket which will return list. see below example:

stripped = [line.strip() for line in in_file]
lines = [line.split(" ") for line in stripped if line]

answered Sep 19, 2019 at 12:01

Amit Nanaware

3,3961 gold badge8 silver badges19 bronze badges

1 Comment

SpghttCd Over a year ago

No sorry, but this is simply wrong. round brackets with a for-expression result in a generator, not in a tuple. and even though OP probably did that by accident, this is imo not the problem here...

shaik moeed · Accepted Answer · 2019-09-19 11:58:08Z

0

licfile_df = pd.read_csv('licfile.txt',sep=",", header=None)

edited Sep 19, 2019 at 11:58

shaik moeed

5,5882 gold badges25 silver badges63 bronze badges

answered Sep 19, 2019 at 11:54

Akshay Salvi

1972 silver badges5 bronze badges

3 Comments

SpghttCd Over a year ago

@shaikmoeed I think there's some misunderstanding - spaces are the separators in the source file. Target file should be a csv, which by standard is generally comma separated. AFAIU this is in fact the main conversion which the whole post is about...

Akshay Salvi Over a year ago

using pandas it is so easy to convert text file into DataFrame and sep="," is for if your text file is seprated by comma , It varies according to your text file.

SpghttCd Over a year ago

Yes, correct it's easy from usability point of view. But on the one handside it might be a little too heavy library for a simple task like this and then this post is simply not tagged with pandas...

Collectives™ on Stack Overflow

How to format txt file in Python

4 Answers 4

7 Comments

Comments

1 Comment

3 Comments

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

7 Comments

Comments

1 Comment

3 Comments

Related