View Single Post

  #5 (permalink)  
Old 02-14-2008, 10:44 AM
manishkochar manishkochar is offline
Junior Member
User
 
Join Date: Jun 2007
My distro: Debian
Posts: 2
Rep Power: 0
manishkochar is on a distinguished road
Default

Quote:
Originally Posted by agn View Post
Am not an expert in sed. Sed is a really weird tool, but the power it contains is awesome.

Regex's are not a good tool to parse html/xml data. You should use an XML parser. Right tool for the right job.
Any links you would like to share?
I've tried everything listed in freshmeat / sourceforge for "dmoz" parsing.
Sadly, none of them really work, and all are poorly documented.
Besides I feel like a retard when it comes to perl.
Reply With Quote