Python string split by template

Question

I'm new in Python and I struggle with a probably an easy problem for all of you out there and maybe you could help me please

Basically I would need a function that reads a continuous string and break it as following: first 5 characters, inserts comma, next 6 characters, inserts comma, next 6 characters, inserts comma, inserts new line and then repeats

The problem: my string is:

"CARMD000000000003FEFFE000004000004BCCXT000009000025BBT01000035000025"

I need to divide this string into comma by following rule: 5-6-6 \n

Expected result:

CARMD,000000,000003, 

FEFFE,000004,000004, 

BCCXT,000009,000025,

BBT01,000035,000025,

Thank you for your help.

宏杰李 · Accepted Answer · 2016-12-18 10:38:04Z

1

import re

text = "CARMD000000000003FEFFE000004000004BCCXT000009000025BBT01000035000025"
match = re.findall(r'([A-Z]{5})(\d{6})(\d{6})', text)
lines = [','.join(item) for item in match]
print(*lines, sep='\n')

out:

CARMD,000000,000003
FEFFE,000004,000004
BCCXT,000009,000025

use regex to match text, will return a list of tuple:

[('CARMD', '000000', '000003'), ('FEFFE', '000004', '000004'), ('BCCXT', '000009', '000025')]

than use list comprehension to construct a list, each element in list is string, concatenated by the tuple using ','.

lines:

['CARMD,000000,000003', 'FEFFE,000004,000004', 'BCCXT,000009,000025']

answered Dec 18, 2016 at 10:38

宏杰李

12.2k2 gold badges32 silver badges37 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Cheyn Shmuel Over a year ago

This Solution doesn't save the final string into a variable. It just prints it out. I guess that's good if you only want it for one-time use.

宏杰李 Over a year ago

@Cheyn Shmuel i store data in list 'lines'

RomanPerekhrest · Accepted Answer · 2016-12-18 10:44:42Z

1

"One-line" solution using re.findall() and str.join() functions:

s = "CARMD000000000003FEFFE000004000004BCCXT000009000025BBT01000035000025"
chunks = ',\n'.join(','.join(t) for t in re.findall(r'(\w{5})(\w{6})(\w{6})', s))

print(chunks)

The output:

CARMD,000000,000003,
FEFFE,000004,000004,
BCCXT,000009,000025,
BBT01,000035,000025

answered Dec 18, 2016 at 10:44

RomanPerekhrest

93.1k4 gold badges75 silver badges112 bronze badges

Comments

ettanany · Accepted Answer · 2016-12-18 10:53:00Z

1

An alternative to using regex is using list slicing with a for loop like below:

>>> s = 'CARMD000000000003FEFFE000004000004BCCXT000009000025BBT01000035000025'
>>> 
>>> for i in range(len(s) / 17):
...     temp = s[i*17:i*17+17]
...     print '{}, {}, {},'.format(temp[:5], temp[5:11], temp[11:17])
...
CARMD, 000000, 000003,
FEFFE, 000004, 000004,
BCCXT, 000009, 000025,
BBT01, 000035, 000025,

answered Dec 18, 2016 at 10:53

ettanany

20k9 gold badges49 silver badges64 bronze badges

Comments

Cheyn Shmuel · Accepted Answer · 2016-12-18 11:02:52Z

0

A simple program like this should do the trick:

s = "CARMD000000000003FEFFE000004000004BCCXT000009000025BBT01000035000025"
new_s = ''
while s:
    for x in (5, 6, 6):
        new_s += s[:x]
        s = s[x:]
        new_s += ','
    new_s += '\n'

print(new_s)

output:

CARMD,000000,000003,
FEFFE,000004,000004,
BCCXT,000009,000025,
BBT01,000035,000025,

I found a nested loop efficient.

edited Dec 18, 2016 at 11:02

answered Dec 18, 2016 at 10:47

Cheyn Shmuel

4588 silver badges15 bronze badges

Collectives™ on Stack Overflow

Python string split by template

4 Answers 4

2 Comments

Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

2 Comments

Comments

Comments

Comments

Related