I have a data file A.tsv (field separator = \t) :
id  mutation
243 siti,toto,mumu
254     
267 lala,siti,sojo
289 lala
and a template file B.txt (field separator = not important because only one line and one column) :
lala,siti,mumu
I want to create a new column in A.tsv(but in a new file C.tsv) named mutation_not were are printed only the mutation present in the mutation column of A.tsv that are not present in the list of B.txt.
C.tsv looks like this:
id  mutation    mutation_not
243 siti,toto,mumu  toto
254     
267 lala,siti,sojo  sojo
289 lala
I tried with exclude:
awk 'NR==FNR {exclude[$0];next} !($0 in exclude)' file2 file1
but I don't have any good result. Do you have an idea? Thanks

