Revisions to Counting the instances of specific words in a text, using awk [duplicate]

replaced http://unix.stackexchange.com/ with https://unix.stackexchange.com/

Source Link

edited Apr 13, 2017 at 12:36

1

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

EDIT2: To those who marked this question as duplicate: the question you point to question you point to is about counting all the words, whereas this one is about counting only the instances of a specific pre-defined set of words.

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

EDIT2: To those who marked this question as duplicate: the question you point to is about counting all the words, whereas this one is about counting only the instances of a specific pre-defined set of words.

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

EDIT2: To those who marked this question as duplicate: the question you point to is about counting all the words, whereas this one is about counting only the instances of a specific pre-defined set of words.

explaining why this is not a duplicate of the referred question

Source Link

edited Dec 11, 2014 at 12:18

mitchus

205
3
10

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

EDIT2: To those who marked this question as duplicate: the question you point to is about counting all the words, whereas this one is about counting only the instances of a specific pre-defined set of words.

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

EDIT2: To those who marked this question as duplicate: the question you point to is about counting all the words, whereas this one is about counting only the instances of a specific pre-defined set of words.

Post Closed as "Duplicate" by muru, jofel, Anthon, jimmij, Gilles 'SO- stop being evil'

Get text-file word occurrence count of all words & print output sorted

occurred Dec 10, 2014 at 22:02

sample input and output

Source Link

edited Dec 10, 2014 at 18:49

mitchus

205
3
10

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

If I have a file words_of_interest.txt with one word per line, is there a way to use awk (or some other *nix tools) to obtain the number of times each of these words occurs in another text file my_text.txt, using only one pass?

Currently I am grep -c'ing the text for each word, but this is quite slow because the text is large, and there are several hundred words to search for.

EDIT: providing sample input and output:

[words_of_interest.txt]
joe
hi

[my_text.txt]
hi joe
hi jack
nice day today

[output]
joe 1
hi 2

Source Link

asked Dec 10, 2014 at 18:33

mitchus

205
3
10

Loading

Stack Exchange Network

Return to Question