Ignore duplicates in regex pattern

Question

I have a regex pattern that searches for words in a text file. How do I ignore duplicates?

For instance, take a look at this code

$pattern = '/(lorem|ipsum|daboom|pahwal|ababaga)/i';
$num_found = preg_match_all( $pattern, $string, $matches );

echo "$num_found match(es) found!";
echo "Matched words: " . implode( ',', $matches[0] );

If I have more than one say lorem in the article, the output will be something like this

5 matches found!
Matched words: daboom,lorem,lorem,lorem,lorem

I want the pattern to only find the first occurrence, and ignore the rest, so the output should be:

2 matches found!
Matched words: daboom,lorem

Alin P. · Accepted Answer · 2010-12-22 09:49:40Z

6

Do an array_unique on $matches[0]. And maybe an array_map with strtolower if you want the unique to be case insensitive.

$pattern = '/(lorem|ipsum|daboom|pahwal|ababaga)/i';
preg_match_all( $pattern, $string, $matches );
$matches = $matches[0]?array_unique(array_map('strtolower', $matches[0])):array();

echo count($matches)." match(es) found!";
echo "Matched words: " . implode( ',', $matches );

edited Dec 22, 2010 at 9:49

answered Dec 22, 2010 at 9:29

Alin P.

44.5k13 gold badges79 silver badges93 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

HyderA Over a year ago

slaps forehead Why didn't I think of that?

Collectives™ on Stack Overflow

Ignore duplicates in regex pattern

1 Answer 1

1 Comment

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

1 Comment

Related