htmlcxx is a simple non-validating css1 and html parser for C++. See also: http://htmlcxx.sourceforge.net/