I am still in the pondering phase of how to do my own publishing software. Since a lot of smart people use python (quack!) I am considering to start there; it seems like a neat addition to my skill set. (First, you can never know too many script languages and, second, I also need some counter balance to my latest addition of C# (which, since you ask, is an Ok language, but there is nothing exciting about it (only the HTTP stack is exciting in the hammer-to-your-head sense.))).
Where was I?
So I read some up on python during the last days, bought the cookbook, parsed some XML and had a look at feedparser by Mark Pilgrim. Seems a nice and well tested piece of code and the documentation at his site is excellent! Kudos.
And then as a warm-up task I examine the html5lib by Anne van Kesteren a bit and ponder some ideas around an API on top of it. I was thinking that something SAX-like is excellent (i did such in the past), but Anne wrote that this is not sufficient. Might see in the code what he really means.
So much code, so little time.
Technorati Tags: html5lib, feedparser, python
