Quote:
Originally Posted by agn
 Am not an expert in sed. Sed is a really weird tool, but the power it contains is awesome.
Regex's are not a good tool to parse html/xml data. You should use an XML parser. Right tool for the right job. 
|
Any links you would like to share?
I've tried everything listed in freshmeat / sourceforge for "dmoz" parsing.
Sadly, none of them really work, and all are poorly documented.
Besides I feel like a retard when it comes to perl.