Return to Answer

POSIXified `tr`, removed redundant `-k1`

Source Link

edited Nov 30, 2016 at 9:27

584.7k
96
1.1k
1.7k

Split the input into words, one per line.
Sort the resulting list of words (lines).
Squash multiple occurences.
Sort by occurrence count.

To split the input into words, replace any character that you deem to be a word separator by a newline.

<input_file \
tr -sc '[:alpha:]' '\n''[\n*]' |   # Add digits, -, \'', ... if you consider
                             # them word constituents
sort |
uniq -c |
sort -k 1nrnr

Split the input into words, one per line.
Sort the resulting list of words (lines).
Squash multiple occurences.
Sort by occurrence count.

To split the input into words, replace any character that you deem to be a word separator by a newline.

<input_file \
tr -sc '[:alpha:]' '\n' |   # Add digits, -, \', ... if you consider them word constituents
sort |
uniq -c |
sort -k 1nr

Split the input into words, one per line.
Sort the resulting list of words (lines).
Squash multiple occurences.
Sort by occurrence count.

To split the input into words, replace any character that you deem to be a word separator by a newline.

<input_file \
tr -sc '[:alpha:]' '[\n*]' | # Add digits, -, ', ... if you consider
                             # them word constituents
sort |
uniq -c |
sort -nr

Source Link

answered May 20, 2012 at 23:58

Gilles 'SO- stop being evil'

865.3k
205
1.8k
2.3k

Split the input into words, one per line.
Sort the resulting list of words (lines).
Squash multiple occurences.
Sort by occurrence count.

To split the input into words, replace any character that you deem to be a word separator by a newline.

<input_file \
tr -sc '[:alpha:]' '\n' |   # Add digits, -, \', ... if you consider them word constituents
sort |
uniq -c |
sort -k 1nr