sed delete lines not containing specific string

Question

I'm new to sed and I have the following question. In this example:

some text here
blah blah 123
another new line
some other text as well
another line

I want to delete all lines except those that contain either string 'text' ~~and~~ or string 'blah', so my output file looks like this:

some text here
blah blah 123
some other text as well

Any hints how this can be done using sed?

Must the answer use sed? grep would do this very easily.

Tim
– Tim

2012-03-02 23:48:06 +00:00
Commented Mar 2, 2012 at 23:48 — Tim
– Tim, Commented Mar 2, 2012 at 23:48
askubuntu.com/a/847004/638128

Stack Underflow
– Stack Underflow

2020-05-12 00:57:25 +00:00
Commented May 12, 2020 at 0:57 — Stack Underflow
– Stack Underflow, Commented May 12, 2020 at 0:57

potong · Accepted Answer · 2012-03-03 06:43:52Z

130

This might work for you:

sed '/text\|blah/!d' file
some text here
blah blah 123
some other text as well

answered Mar 3, 2012 at 6:43

potong

59.3k6 gold badges55 silver badges92 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

discipulus Over a year ago

How can I for example specify that text or blah could only appear in the last column ?

Melebius Over a year ago

@lovedynasty If you mean at the end of line, you should use $: '/text$\|blah$/!d'

burglarhobbit Over a year ago

@potong This does not work if I replace the separator char / with #, any idea why?

potong Over a year ago

@burglarhobbit I presume you mean the regexp delimiter /. This may be set to any delimiter in the substitution command e.g. s#...#...# However when used for matching the first delimiter must be quoted e.g set delimiter to # use \#match#d to delete lines that match.

Jonathan Leffler · Accepted Answer · 2014-12-09 14:36:58Z

19

You want to print only those lines which match either 'text' or 'blah' (or both), where the distinction between 'and' and 'or' is rather crucial.

sed -n -e '/text/{p;n;}' -e '/blah/{p;n;}' your_data_file

The -n means don't print by default. The first pattern searches for 'text', prints it if matched and skips to the next line; the second pattern does the same for 'blah'. If the 'n' was not there then a line containing 'text and blah' would be printed twice. Although I could have use just -e '/blah/p', the symmetry is better, especially if you need to extend the list of matched words.

If your version of sed supports extended regular expressions (for example, GNU sed does, with -r), then you can simplify that to:

sed -r -n -e '/text|blah/p' your_data_file

edited Dec 9, 2014 at 14:36

answered Mar 2, 2012 at 23:46

Jonathan Leffler

759k145 gold badges961 silver badges1.3k bronze badges

9 Comments

glenn jackman Over a year ago

If sed does not support -r it probably won't support {} either. This should work with older seds: sed '/text\|blah/!d' file

Jonathan Leffler Over a year ago

The { ... } grouping of commands was in 7th Edition UNIX version of sed; I can't think how you'd come across a version where that was not suppported.

JamesThomasMoon Over a year ago

I found this easier sed -n -e '/keep-this/p' -e '/keep-that/p' -e '/keep-those/p'. No compound command required. No problems with "line would be printed twice". Additionally, in my case, adding the extra command n in {p;n;} would drop the first expression match. This may have been due to the particular input string. But this answer really helped me find where to look!

Jonathan Leffler Over a year ago

@JamesThomasMoon1979 — Be wary of a line that contains "keep-this" he said, and "keep-that", not to mention "keep-those". Your version will print the line three times; mine just once. It depends on your required output. If you want the single line printed three times, your solution is good. If not, then it leaves something to be desired.

JamesThomasMoon Over a year ago

Great feedback. In my case, I ran into this error scenario: Given I want to print both input lines a and a b, the following command only prints the first input line echo -e 'a\na b' | sed -n -e '/b/{p;n;}' -e '/a/{p;n;}'. But if I change the order of expressions, the command will print both input lines echo -e 'a\na b' | sed -n -e '/a/{p;n;}' -e '/b/{p;n;}'.

|

Avinash Raj · Accepted Answer · 2014-09-27 07:06:08Z

12

You could simply do it through awk,

$ awk '/blah|text/' file
some text here
blah blah 123
some other text as well

answered Sep 27, 2014 at 7:06

Avinash Raj

175k32 gold badges246 silver badges289 bronze badges

Comments

Viktor Csomor · Accepted Answer · 2020-06-08 07:43:31Z

1

Are you looking for the grep? Here is an example to look for different texts.

cat yourfile.txt | grep "text\|blah"

answered Jun 8, 2020 at 7:43

Viktor Csomor

211 bronze badge

2 Comments

xtropicalsoothing Jan 8 at 20:50

This worked perfectly and was the simplest solution among all the other answers. Props.

xtropicalsoothing Jan 9 at 16:44

Even simpler with just grep "text\|blah" > your_new_file.txt!

Collectives™ on Stack Overflow

sed delete lines not containing specific string

4 Answers 4

4 Comments

9 Comments

Comments

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

4 Comments

9 Comments

Comments

2 Comments

Linked

Related