Extract data PHP string

Question

I have used file_get_contents() to basically get the source code of a site into a single string variable.

The source contains many rows that looks like this: <td align="center"><a href="somewebsite.com/something">12345</a></td>

(and a lot of rows that don't look like that). I want to extract all the idnumbers (12345 above) and put them in an array. How can I do that? I assume I want to use some kind of regular expressions and then use the preg_match_all() function, but I'm not sure how...

Oh good Google, not another one. stackoverflow.com/questions/1732348/… — Zirak
– Zirak, Commented Apr 20, 2011 at 19:45

Strong Like Bull · Accepted Answer · 2011-04-20 19:49:28Z

4

Don't mess with regular expressions. Get the variable and let a DOM library do the mundane tasks for you. Take a look at: http://sourceforge.net/projects/simplehtmldom/

Then you can traverse your HTMl like a tree and extract stuff. If you really want to get funky, read up on xPath.

answered Apr 20, 2011 at 19:49

Strong Like Bull

11.3k37 gold badges102 silver badges169 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

SIFE · Accepted Answer · 2011-04-20 20:01:58Z

1

Try this:

preg_match('/>[0-9]+<\/a><\/td>/', $str, $matches);
for($i = 0;$i<sizeof($matches);$i++)
 $values[] = $matches[$i];

answered Apr 20, 2011 at 20:01

SIFE

5,7437 gold badges34 silver badges48 bronze badges

1 Comment

faximan Over a year ago

Thanks! This gave me a basic idea, I went with preg_match_all('/[0-9]+<\/a><\/td>/', $html, $matches); return $matches[0]; Works perfetly!

Collectives™ on Stack Overflow

Extract data PHP string

2 Answers 2

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

1 Comment

Linked

Related