Fixing header and print

Question

I have header starting with '>' and I want fix the header by keeping first word and removing other shown as in output.txt and print it

input.txt

>AGAJ01065549.1 scaffold:Xipmac4.4.2:AGAJ01065549.1:1:500:1 REF
CGCCAGGTGTCTGGCGTAATAGCGCCAGCGCCAGGTGTCATATACGTAATAGCGCCAGGT
>RGAMMT01065456.1 scaffold:Xipmac4.4.2:AGAJ01065595.1:1:500:1 REF
GACTAGTTTTTACATATAGTAATGGTTATTCGGAAGTGTACAGACGTTTTCAGGTTTTTT
TTTGGTAGGGGTTGAGGTGTTGAGGTGAGGGGACTATGTGGAGGGAACTTTCCATAGAGG

output.txt

>AGAJ01065549.1 
CGCCAGGTGTCTGGCGTAATAGCGCCAGCGCCAGGTGTCATATACGTAATAGCGCCAGGT
>RGAMMT01065456.1 
GACTAGTTTTTACATATAGTAATGGTTATTCGGAAGTGTACAGACGTTTTCAGGTTTTTT
TTTGGTAGGGGTTGAGGTGTTGAGGTGAGGGGACTATGTGGAGGGAACTTTCCATAGAGG

potong · Accepted Answer · 2012-11-03 01:31:33Z

3

This might work for you (GNU sed):

sed -i '/^>/s/\s.*//' file

answered Nov 3, 2012 at 1:31

potong

2661 silver badge2 bronze badges

Add a comment |

qbi · Accepted Answer · 2012-11-02 22:26:42Z

2

You can do this by piping the text through awk

awk '{print $1}' input.txt

This prints out the first entry of every line (entries are separated with spaces).

edited Nov 2, 2012 at 22:26

qbi

1,4591 gold badge15 silver badges32 bronze badges

answered Nov 2, 2012 at 20:46

Mark Cohen

1,3829 silver badges12 bronze badges

pass the file to awk directly, no need to cat and pipe it. Secondly your solution would break if non-header lines contain embedded spaces

iruvar
– iruvar

2012-11-02 21:28:55 +00:00
Commented Nov 2, 2012 at 21:28
True, but it addresses the example given and shows a solution that would work as long as the sample doesn't change.

Mark Cohen
– Mark Cohen

2012-11-02 21:34:30 +00:00
Commented Nov 2, 2012 at 21:34

Add a comment |

Community · Accepted Answer · 2017-04-13 12:36:58Z

1

Similar to the answer using awk is cut:

cut -d' ' -f 1 input.txt > output.txt

The -d option sets the delimiter to one space and -f selects the first field.

However you can also use sed:

sed 's,^\([^ ]\+\) .*,\1,' input.txt > output.txt

This command substitutes an expression. It looks the beginning of a line and copies every character into a buffer which is not white space. Furthermore it matches a white space and any other character. sed replaces this line with the buffer content.

edited Apr 13, 2017 at 12:36

CommunityBot

1

answered Nov 2, 2012 at 22:11

qbi

1,4591 gold badge15 silver badges32 bronze badges

Add a comment |

Stack Exchange Network

Fixing header and print

3 Answers 3

You must log in to answer this question.

Hot Network Questions

Fixing header and print

3 Answers 3

You must log in to answer this question.

Related

Hot Network Questions