(auto): I don’t need this, but it looks interesting: HTML Screen Scraping and sgrep, a structured grep. From a quick glance, it looks like something XSLT or just XPath would be good for.

HTML screen scraping and sgrep

Comments

[gravatar]
I tried it for my Amazon wishlist optimizer, but Amazon's HTML makes most XML parsers barf. The solution that worked for me is BeautifulSoup.

Sometimes, XML is a hammer looking for a nail-shaped solution.

Add a comment:

Ignore this:
Leave this empty:
Name is required. Either email or web are required. Email won't be displayed and I won't spam you. Your web site won't be indexed by search engines.
Don't put anything here:
Leave this empty:
Comment text is Markdown.