I am relatively new to Linux in general and I hope someone can help me.
I would like to merge fastq.gz files from 4 different sequencing lanes. Each file has the following name: GC082_F4.lane1.1901.R1.fastq.gz with GC082_F4 the name of the sample, laneX referring to the lane (1 to 4) and R1 refers to the forward or reverse read ("R1" is reverse, "R2" is forward).
I want to concatenate the files from the 4 different lanes for each sample and this for the forward read (R1) and the reverse read (R2) separately. So for this specific sample that would be:
For the forward reads:
cat GC082_F4.lane1.1901.R1.fastq.gz \
  GC082_F4.lane2.1901.R1.fastq.gz GC082_F4.lane3.1901.R1.fastq.gz \
  GC082_F4.lane4.1901.R1.fastq.gz > GC082_F4.R1.fastq.gz
For the reverse reads:
cat GC082_F4.lane1.1901.R2.fastq.gz \
  GC082_F4.lane2.1901.R2.fastq.gz GC082_F4.lane3.1901.R2.fastq.gz \
  GC082_F4.lane4.1901.R2.fastq.gz > GC082_F4.R2.fastq.gz
But since I have to do this for +100 samples, I was wondering whether I could use one single loop for this?
Thank you in advance!


GC082_F4(the sample name) the variable you need to iterate over 100+ times? Where do the sample names come from? (A text file? A list you are manually typing? ...). Are the lanes going to be exactly "lane1", "lane2", "lane3" and "lane4" for every sample?.gzfiles. Uncompressing the concatenated file would result in a file which is the concatenation of the original uncompressed files.