Negative Space: XHTML
- Automatically distributing images within XHTML
- One of the nice things about XHTML is that the tools for reading XML have finally matured. So if, for example, I want to have a series of images automatically placed within my web page, I can parse the XHTML of the content to ensure that nothing is broken.
- Caching DTDs using lxml and etree
- It’s not just a good idea to cache public DTDs when validating your web pages with an automated script. If you don’t do it, you might get blocked from w3.org.
- HTML & XHTML: The Definitive Guide
-
I’ve never found an HTML book that I’ve considered great. This one is probably the best. It covers just about everything in HTML, as well as a little bit of Cascading Style Sheets.
More Information
- RegEx match open tags except XHTML self-contained tags
-
“You can't parse [X]HTML with regex. Because HTML can't be parsed by regex. Regex is not a tool that can be used to correctly parse HTML.”
- W3C's Excessive DTD Traffic
-
“A while ago we put a system in place to monitor our servers for abusive request patterns and send 503 Service Unavailable responses with custom text depending on the nature of the abuse. Our hope was that the authors of misbehaving software and the administrators of sites who deployed it would notice these errors and make the necessary fixes to the software responsible.”