How to match string with Regex in C#

Question

I have this string Sample Text <[email protected]> and this string [email protected] and I'm trying to match the preceeding text ("Sample Text" in this example) if it exists and the email without the "<",">" characters. There may be whitespaces at before and after that. At first I used Regex.Split with this expression @"\s*(.*)<(.*@.*)>\s*" but it gave me 4 strings instead of 2. The 2 strings that I wanted were correct but it also returned empty strings. Now I'm trying with Regex.Matches using this expression @"\s*(.*)(?: <)?(.*@.*)(?:>)?\s*" it finds 3 matches. The 2 are again the correct ones and the other is the input string itself. As for the second string it doesn't work. How do I fix this?

Not going to add an answer since other people seem to have basically covered it. But if you decide to go with regex instead of the MailAddress class (if, for example, you need to do a search to find the e-mail addresses), you could write your regex very loosely and parse/clean up the strings after the fact (using MailAddress and/or calls to string.Split and string.Trim). Trying to make the regex both search for/validate the proper format as well as clean up the strings might make your regex more complicated than it needs to be. — Merlyn Morgan-Graham
– Merlyn Morgan-Graham, Commented May 8, 2011 at 22:34

Oleks · Accepted Answer · 2011-05-08 20:02:17Z

3

This could be done without regex. Take a look onto MailAddress class; it could be used to parse strings like in your example:

var mailAddress = new MailAddress("Sample Text <[email protected]>");

Here mailAddress.Address property will contain [email protected] value, and mailAddress.DisplayName will contain Sample Text value.

answered May 8, 2011 at 20:02

Oleks

32.4k11 gold badges80 silver badges134 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

Merlyn Morgan-Graham Over a year ago

@user579674: Read the Remarks section on the page he linked. It has some slightly complex implementation, but I am guessing that "Display Name <[email protected]>" is a semi-standard format. msdn.microsoft.com/en-us/library/…

David · Accepted Answer · 2011-05-08 19:29:03Z

2

Based on your test cases this regex may work..

(.*)\s?\<(.*)\>

This will give you to results 1 the preceding text & 2 the text contained within the <> brackets

If you care about ensuring the email is valid you may wish to look at a more thorough email regex, but I am guess you are trying to match a string that has come from an email or mail server so that may not be a problem.

Also, its worth grabbing a regex building program such as Expresso or using one of the many online tools to help build your regex.

edited May 8, 2011 at 19:29

answered May 8, 2011 at 19:23

David

8,7207 gold badges52 silver badges71 bronze badges

4 Comments

user579674 Over a year ago

okay this does fix the problem for the second string.. isn't a way to do it in one expression?

David Over a year ago

Not really, in the solution I gave you are asking regex to match two groups for you i.e the stuff in () braces. How can you return two pieces of data as one item?

user579674 Over a year ago

it turns out that the regex doesn't work for the second string.. anyway i think that if regex.matches does return the initial result then i'll just do another one for the second string

David Over a year ago

I would be suprised if the second part didnt match anything. Maybe if you have any more test cases you could provide we could fine tune the regex

ariel · Accepted Answer · 2011-05-08 19:36:57Z

1

Regex.Matches always return the full match on the first match, so just ignore it and use the second and third.

To match the second type of string (only email) you better match the first type and if not found match the second using a single email regex

edited May 8, 2011 at 19:36

answered May 8, 2011 at 19:20

ariel

16.2k13 gold badges65 silver badges75 bronze badges

Comments

stema · Accepted Answer · 2011-05-08 22:14:29Z

0

Try this one here

\s*(.*?)(?: <)?(\S*@.*)(?:>)?\s*

I changed yours only a bit.

added into the first group the ? to make it a lazy match
changed the part before the @ into \S, what means anything but whitespace.

You can see it online here on Rubular

answered May 8, 2011 at 22:14

stema

93.5k20 gold badges110 silver badges138 bronze badges

Collectives™ on Stack Overflow

How to match string with Regex in C#

4 Answers 4

1 Comment

4 Comments

Comments

Comments

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

1 Comment

4 Comments

Comments

Comments

Related