Append second field of second line to first line from file

Question

how to generate the following file ( example in ) to the file as described in example out ,

each last word in state line ( example in ) , should be added to the last line of the previos line

example in

HDFS  worker01.gtdns.com
state  STARTED
HDFS  worker02.gtdns.com
state  STOP
HDFS  worker03.gtdns.com
state  STARTED
HDFS  worker05.gtdns.com
state  STARTED
HDFS  worker06.gtdns.com
state  STARTED
HDFS  worker07.gtdns.com
state  STARTED
HDFS  worker08.gtdns.com
state  STARTED
HDFS  worker09.gtdns.com
state  STOP

example out ( expected results )

HDFS  worker01.gtdns.com STARTED
HDFS  worker02.gtdns.com STOP
HDFS  worker03.gtdns.com STARTED
HDFS  worker05.gtdns.com STARTED
HDFS  worker06.gtdns.com STARTED
HDFS  worker07.gtdns.com STARTED
HDFS  worker08.gtdns.com STARTED
HDFS  worker09.gtdns.com STOP

bash is not a text editor.

don_crissti
– don_crissti

2018-01-08 22:51:53 +00:00
Commented Jan 8, 2018 at 22:51 — don_crissti
– don_crissti, Commented Jan 8, 2018 at 22:51

DopeGhoti · Accepted Answer · 2018-01-08 22:58:42Z

1

awk '$1 == "HDFS" { printf( "%s ", $0 ) }; $1=="state" { print $2 }' /path/to/input

The awk script is fairly self-explanatory: On lines where the first field is HDFS, append a space to the line and print it as-is with no trailing newline. On lines where the first field is state, print the second field with the (implied) trailing newline.

edited Jan 8, 2018 at 22:58

answered Jan 8, 2018 at 22:51

DopeGhoti

79.2k10 gold badges107 silver badges141 bronze badges

What's the reason for removing the sed and perl tags ? They're just as valid as awk here...

don_crissti
– don_crissti

2018-01-08 22:55:36 +00:00
Commented Jan 8, 2018 at 22:55
I thought I only removed the sed tag, but may have misdoubleclicked and caught perl's tag inadvertently. Feel free to re-add.

DopeGhoti
– DopeGhoti

2018-01-08 22:56:50 +00:00
Commented Jan 8, 2018 at 22:56
I don't want to re-add anything. As I said I'd like to know what's wrong with using e.g. the sed tag here...

don_crissti
– don_crissti

2018-01-08 22:58:01 +00:00
Commented Jan 8, 2018 at 22:58
Using sed to parse multi-line input is generally more complicated and prone to error than it's worth in my opinion, and given the nature of the OP, probably not the right hammer for this particular nail.

DopeGhoti
– DopeGhoti

2018-01-08 22:59:55 +00:00
Commented Jan 8, 2018 at 22:59
That's your opinion - a very subjective one at that.

don_crissti
– don_crissti

2018-01-08 23:19:10 +00:00
Commented Jan 8, 2018 at 23:19

Add a comment |

RomanPerekhrest · Accepted Answer · 2018-01-08 23:25:11Z

0

Short GNU AWK approach:

awk -v RS='[[:space:]]+state' '{ printf "%s", $0 }' file

-v RS='[[:space:]]+state' - treat state substring with leading whitespace(s) [[:space:]]+ as input record separator RS

The output:

HDFS  worker01.gtdns.com  STARTED
HDFS  worker02.gtdns.com  STOP
HDFS  worker03.gtdns.com  STARTED
HDFS  worker05.gtdns.com  STARTED
HDFS  worker06.gtdns.com  STARTED
HDFS  worker07.gtdns.com  STARTED
HDFS  worker08.gtdns.com  STARTED
HDFS  worker09.gtdns.com  STOP

For a "2-lined" static format - you may also try the following SED approach:

sed '/^[[:space:]]*HDFS/{ N; s/[[:space:]]*state // }' file

edited Jan 8, 2018 at 23:25

answered Jan 8, 2018 at 23:14

RomanPerekhrest

30.9k5 gold badges47 silver badges68 bronze badges

The awk option accepts lines that have anything other than HDFS. Probably it doesn't matter, but worth knowing.

user232326
– user232326

2018-01-09 00:15:15 +00:00
Commented Jan 9, 2018 at 0:15

Add a comment |

Wildcard · Accepted Answer · 2018-01-08 23:34:39Z

Using ex, the POSIX-specified scriptable file editor:

printf '%s\n' 'g/state/s/^ *state *//|-j' x | ex file.txt

The s command is a standard substitution. The -j means "on the previous line (-), execute the join command (j)" which joins the subsequent line with a space separation.

Actually, because the join command ignores leading spaces on the line to be joined, and because s reuses the previous regex if no regex is supplied, the following command works just as will and gives the same result:

printf '%s\n' 'g/state/s///|-j' x | ex file.txt

Note that this saves the changes to the file. To view the changes without saving them, use the following instead:

printf '%s\n' 'g/state/s///|-j' %p | ex file.txt

George Vasiliou · Accepted Answer · 2018-01-08 23:52:23Z

0

gnu awk golfing:

$ awk '1' RS='\n[ \t]*state ' ORS='' file

Testing:

$ awk '1' RS='\n[ \t]*state ' ORS='' file
HDFS  worker01.gtdns.com STARTED
HDFS  worker02.gtdns.com STOP
HDFS  worker03.gtdns.com STARTED
HDFS  worker05.gtdns.com STARTED
HDFS  worker06.gtdns.com STARTED
HDFS  worker07.gtdns.com STARTED
HDFS  worker08.gtdns.com STARTED
HDFS  worker09.gtdns.com STOP

RS is the Input record separator
ORS is the output record separator

edited Jan 8, 2018 at 23:52

answered Jan 8, 2018 at 23:09

George Vasiliou

8,1013 gold badges24 silver badges43 bronze badges

Add a comment |

score 0 · Accepted Answer · 2018-01-09 00:10:20Z

0

Assuming there are always one state after the HDFS, This solve the issue:

awk '$1=="HDFS"{l=$0;next};$1=="state"{print(l,$2);l=""}' file

$1=="HDFS"{ … } For lines the field 1 is HDFS do ….
l=$0;next Store line in var l (elle, line) move to next line.
$1=="state"{ … } for lines that field 1 is state do …
{print(l,$2)} print the line stored in var l (elle) and field 2.
{l=""} Avoid printing stale (old) values of l.

edited Jan 9, 2018 at 0:10

answered Jan 8, 2018 at 22:59

user232326

golfing based on the condition{action} awk default synthax: awk '$1=="HDFS"{l=$0}$1=="state"{print(l,$2)}'

George Vasiliou
– George Vasiliou

2018-01-08 23:32:39 +00:00
Commented Jan 8, 2018 at 23:32
Thanks @GeorgeVasiliou answer edited, description added.

user232326
– user232326

2018-01-08 23:43:14 +00:00
Commented Jan 8, 2018 at 23:43
Just for fun, this can be golfed more if one state come always after HDFS: awk '$1=="state"{print l,$2}{l=$0}'. Give it a try.

George Vasiliou
– George Vasiliou

2018-01-08 23:48:27 +00:00
Commented Jan 8, 2018 at 23:48
Good idea. @GeorgeVasiliou Probably not much aster, but I like this better: awk '$1=="state"{print l,$2;next}{l=$0}'

user232326
– user232326

2018-01-09 00:05:11 +00:00
Commented Jan 9, 2018 at 0:05
@GeorgeVasiliou Besides, the answer as it stands now seems more robust.

user232326
– user232326

2018-01-09 00:11:56 +00:00
Commented Jan 9, 2018 at 0:11

Add a comment |

Praveen Kumar BS · Accepted Answer · 2018-01-09 02:49:39Z

0

Got the result by using below sed one liner

 sed "s/state//g" filename| sed "N;s/\n/ /g"

output

HDFS  worker01.gtdns.com   STARTED
HDFS  worker02.gtdns.com   STOP
HDFS  worker03.gtdns.com   STARTED
HDFS  worker05.gtdns.com   STARTED
HDFS  worker06.gtdns.com   STARTED
HDFS  worker07.gtdns.com   STARTED
HDFS  worker08.gtdns.com   STARTED
HDFS  worker09.gtdns.com   STOP

answered Jan 9, 2018 at 2:49

Praveen Kumar BS

5,3112 gold badges11 silver badges16 bronze badges

Add a comment |

Stack Exchange Network

Append second field of second line to first line from file

6 Answers 6

You must log in to answer this question.

Hot Network Questions

Append second field of second line to first line from file

6 Answers 6

You must log in to answer this question.

Related

Hot Network Questions