What is the usage of pattern before substitute command in sed

Question

There is an example in this link about sed:

To delete the first number on all lines that start with a "#" use:

sed '/^#/ s/[0-9][0-9]*//'

What is the benefit of first pattern(/^#/)? It could be simply:

sed 's/^#[0-9][0-9]*//'

@Bernhard One good reason is maximum portability. I don't think \+ is guaranteed by POSIX. — jw013
– jw013, Commented Apr 18, 2012 at 7:22
@Barnhard I just copied it from the link. But this wikipedia article says that \+ is in POSIX extended regular expressions. en.wikipedia.org/wiki/Regular_expression#Syntax — Majid Azimi
– Majid Azimi, Commented Apr 18, 2012 at 7:54
Every modern implementation of sed I've encountered has the ability to use EREs (sometimes with flag -r, other times with flag -E), and there is talk of adding this capacity to the POSIX standard for sed. @jw013 is correct though that the current POSIX standard doesn't require sed to handle anything other than BREs. EREs handle plain +; some sed implementations enhance their BREs to also handle \+, but if I remember rightly, this is not part of POSIX. Instead of p\+ you could use p\{1,\}, which is a POSIX BRE. — dubiousjim
– dubiousjim, Commented Oct 16, 2012 at 15:51

jw013 · Accepted Answer · 2012-07-25 15:25:30Z

The general format of sed commands is

[address[,address]] function

When a command has a single address, it operates on all lines that match that address. When a command has no address, it operates on every single line.

Reference: POSIX sed

Regarding your specific examples:

/^#/ s/[0-9][0-9]*//
- This command has an address, /^#/, which matches all lines beginning with a #.
- The substitution pattern is /[0-9][0-9]*/. This matches the first sequence of digits wherever it occurs in the line.
- Plain English summary: delete the first sequence of digits in every line beginning with a #.
- Example: # non-digits|5555|non-digits|5555 becomes # non-digits||non-digits|5555
s/^#[0-9][0-9]*//
- There is no address, so this command operates on every single line.
- The substitution pattern, /^#[0-9][0-9]*/, matches a sequence of consecutive digits preceded by a # anchored at the beginning of the line.
- Plain English summary: delete # followed by a sequence of digits (and only that pattern) from the beginning of every line.
- Example: #5555|non-digits|5555 becomes |non-digits|5555, but # non-digits|5555|non-digits|5555 is unchanged because the substitution pattern does not match.

Ignacio Vazquez-Abrams · Accepted Answer · 2012-04-18 06:17:33Z

2

The first will match and substitute:

#abc99

The second will not.

Plus, the second will also remove the initial #.

answered Apr 18, 2012 at 6:17

Ignacio Vazquez-Abrams

46.9k7 gold badges97 silver badges102 bronze badges

Add a comment |

Stack Exchange Network

What is the usage of pattern before substitute command in sed

2 Answers 2

You must log in to answer this question.

Hot Network Questions

What is the usage of pattern before substitute command in sed

2 Answers 2

You must log in to answer this question.

Related

Hot Network Questions