On 9 May, 08:25, Mitchel Haas <mh...@[EMAIL PROTECTED]
> wrote:
> Hello,
>
> For anyone with a need to generate or parse (x)html, I'd like to
> announce a relatively new lightweight library for generating xhtml and
> parsing xhtml and html.
> If you have any need of xhtml/
> html generation or parsing, I hope you can find the library useful.
Thanks for making this available. I'm sure many will find it useful.
But I am not so sure about my case. My need is to parse HTML for use
by a screen scraper. The trouble is, most web pages, including the
ones I am scraping, have ill-formed HTML. How does your library cope
with that? I eventually gave up trying to do this in C++ and used
python instead. It has a package called BeautifulSoup which is
designed specifically to cope with ill-formed HTML.
>
> Thanks,
>
> Mitchel Haas
Regards,
Andrew Marlow
--
[ See http://www.gotw.ca/resources/clcm.htm
for info about ]
[ comp.lang.c++.moderated. First time posters: Do this! ]


|