changed the typography a bit

Source Link

edit approved Aug 19, 2020 at 12:13

user339704

I have two big files of 400,000 lines. I want to compare the column 1 of the second file with column 1 of first file recursively. If they match I would like to print the whole line. It is a sorted file.

      file 1: 
          name   values
          aaa    10
          aab    acc
          aac    30
          aac    abc 

      file2:
          aaa
          aac 
          aac
          aad

since the file contains 400,000 lines it takes time to process.

My current solution is like this

    #!/bin/ksh
   while read line
   do
   var=`echo $line `
   grep "$var" file1 >> /dev/null
   if [ $? -eq 0 ]
   then
   grep "$var" file1 >> present
   else
   echo " $line missing " > missing 
   fi
   done < "file2"

Since I am using grep here, the value may be present some where in the file1 other than the intended column1, I don't want that to happen.

My expected solution:

compare the second file only with the column 1 of first file (even if we do this way it takes long time).
Using a perl script with file pointer compare two columns of the files. If the string matches print it. Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE. If it is VICE VERSA increment the column 1 of file 1 and compare.

I have two big files of 400,000 lines. I want to compare the column 1 of the second file with column 1 of first file recursively. If they match I would like to print the whole line. It is a sorted file.

      file 1: 
          name   values
          aaa    10
          aab    acc
          aac    30
          aac    abc 

      file2:
          aaa
          aac 
          aac
          aad

since the file contains 400,000 lines it takes time to process.

My current solution is like this

    #!/bin/ksh
   while read line
   do
   var=`echo $line `
   grep "$var" file1 >> /dev/null
   if [ $? -eq 0 ]
   then
   grep "$var" file1 >> present
   else
   echo " $line missing " > missing 
   fi
   done < "file2"

Since I am using grep here, the value may be present some where in the file1 other than the intended column1, I don't want that to happen.

My expected solution:

compare the second file only with the column 1 of first file (even if we do this way it takes long time).
Using a perl script with file pointer compare two columns of the files. If the string matches print it. Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE. If it is VICE VERSA increment the column 1 of file 1 and compare.

I have two big files of 400,000 lines. I want to compare the column 1 of the second file with column 1 of first file recursively. If they match I would like to print the whole line. It is a sorted file.

file 1:
  name   values
  aaa    10
  aab    acc
  aac    30
  aac    abc

file2:
  aaa
  aac
  aac
  aad

since the file contains 400,000 lines it takes time to process.

My current solution is like this

#!/bin/ksh
while read line
do
var=`echo $line `
grep "$var" file1 >> /dev/null
if [ $? -eq 0 ]
then
grep "$var" file1 >> present
else
echo " $line missing " > missing
fi
done < "file2"

Since I am using grep here, the value may be present some where in the file1 other than the intended column1, I don't want that to happen.

My expected solution:

compare the second file only with the column 1 of first file (even if we do this way it takes long time).
Using a perl script with file pointer compare two columns of the files. If the string matches print it. Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE. If it is VICE VERSA increment the column 1 of file 1 and compare.

spelling, spaces around punctuation, formatting, capitalization

Source Link

edited Jun 3, 2014 at 8:59

Anthon

81.4k
42
174
228

hi iI have two big files of 400,000 lines . I want to compare the column1column 1 of the 2ndsecond file with column 1 of 1stfirst file recursively . If they Match imatch I would like to print the whole line .It It is a sorted file .

      file 1: 
          name   values
          aaa    10
          aab    acc
          aac    30
          aac    abc 

      file2:
          aaa
          aac 
          aac
          aad

since the linefile contains 400,000 lines it takes time to process .

My current solution is like this

    #!/bin/ksh
   while read line
   do
   var=`echo $line `
   grep "$var" file1 >> /dev/null
   if [ $? -eq 0 ]
   then
   grep "$var" file1 >> present
   else
   echo " $line missing " > missing 
   fi
   done < "file2"

Since iI am using grep here grep here, the value may be present some where in the file1 other than the intended column1, i dontI don't want that to happen .

My expected solution:

compare the second file only with the column 1 of first file .( eveneven if we do this way it takes long time ).
Using a perlperl script with file pointer compare two columns of the files . If the string matches print it . Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE .If If it is VICE VERSA increment thr column1the column 1 of file 1 and compare .

hi i have two big files of 400,000 lines . I want to compare the column1 of the 2nd file with column 1 of 1st file recursively . If they Match i would like to print the whole line .It is a sorted file .

      file 1: 
          name   values
          aaa    10
          aab    acc
          aac    30
          aac    abc 

      file2:
          aaa
          aac 
          aac
          aad

since the line contains 400,000 lines it takes time to process .

My current solution is like this

    #!/bin/ksh
   while read line
   do
   var=`echo $line `
   grep "$var" file1 >> /dev/null
   if [ $? -eq 0 ]
   then
   grep "$var" file1 >> present
   else
   echo " $line missing " > missing 
   fi
   done < "file2"

Since i am using grep here , the value may be present some where in the file1 other than the intended column1, i dont want that to happen .

My expected solution

compare the second file only with the column 1 of first file .( even if we do this way it takes long time )
Using a perl script with file pointer compare two columns of the files . If the string matches print it . Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE .If it is VICE VERSA increment thr column1 of file 1 and compare .

I have two big files of 400,000 lines. I want to compare the column 1 of the second file with column 1 of first file recursively. If they match I would like to print the whole line. It is a sorted file.

      file 1: 
          name   values
          aaa    10
          aab    acc
          aac    30
          aac    abc 

      file2:
          aaa
          aac 
          aac
          aad

since the file contains 400,000 lines it takes time to process.

My current solution is like this

    #!/bin/ksh
   while read line
   do
   var=`echo $line `
   grep "$var" file1 >> /dev/null
   if [ $? -eq 0 ]
   then
   grep "$var" file1 >> present
   else
   echo " $line missing " > missing 
   fi
   done < "file2"

Since I am using grep here, the value may be present some where in the file1 other than the intended column1, I don't want that to happen.

My expected solution:

compare the second file only with the column 1 of first file (even if we do this way it takes long time).
Using a perl script with file pointer compare two columns of the files. If the string matches print it. Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE. If it is VICE VERSA increment the column 1 of file 1 and compare.

Source Link

asked Jun 3, 2014 at 8:11

user68365

231
2
3
7

compare two files based on a column and print it

hi i have two big files of 400,000 lines . I want to compare the column1 of the 2nd file with column 1 of 1st file recursively . If they Match i would like to print the whole line .It is a sorted file .

      file 1: 
          name   values
          aaa    10
          aab    acc
          aac    30
          aac    abc 

      file2:
          aaa
          aac 
          aac
          aad

since the line contains 400,000 lines it takes time to process .

My current solution is like this

    #!/bin/ksh
   while read line
   do
   var=`echo $line `
   grep "$var" file1 >> /dev/null
   if [ $? -eq 0 ]
   then
   grep "$var" file1 >> present
   else
   echo " $line missing " > missing 
   fi
   done < "file2"

Since i am using grep here , the value may be present some where in the file1 other than the intended column1, i dont want that to happen .

My expected solution

compare the second file only with the column 1 of first file .( even if we do this way it takes long time )
Using a perl script with file pointer compare two columns of the files . If the string matches print it . Else if the column 1 of first file is greater than that of second file increment the file 2 AND COMPARE .If it is VICE VERSA increment thr column1 of file 1 and compare .

Stack Exchange Network

Return to Question

compare two files based on a column and print it