Talk About Network

Google


Register and Login
Nick
Password
Register create new account Sign up is FREE and you can post replies, new topics, bookmark posts and more!
Recover lost password


Programming > C++ Moderated > Re: Open source...
Latest [ Topics | Posts ] Archive Post A New Topic Post a Reply
<< Topic < Post Post 4 of 5 Topic 9584 of 9830
Post > Topic >>

Re: Open source library for generating and parsing (x)html

by "AnonMail2005@[EMAIL PROTECTED] " <AnonMail2005@[EMAIL PROTECTED] > May 10, 2008 at 07:25 PM

{ Accepted as follow-up.  Further discussion of general tools for HTML
tidying 
would be off-topic unless there is some C++ content. -mod }

On May 10, 8:14 am, marlow.and...@[EMAIL PROTECTED]
 wrote:
> On 9 May, 08:25, Mitchel Haas <mh...@[EMAIL PROTECTED]
> wrote:
>
> > Hello,
>
> > For anyone with a need to generate or parse (x)html, I'd like to
> > announce a relatively new lightweight library for generating xhtml and
> > parsing xhtml and html.
> > If you have any need of xhtml/
> > html generation or parsing, I hope you can find the library useful.
>
> Thanks for making this available. I'm sure many will find it useful.
> But I am not so sure about my case. My need is to parse HTML for use
> by a screen scraper. The trouble is, most web pages, including the
> ones I am scraping, have ill-formed HTML. How does your library cope
> with that? I eventually gave up trying to do this in C++ and used
> python instead. It has a package called BeautifulSoup which is
> designed specifically to cope with ill-formed HTML.
>
>
>
> > Thanks,
>
> > Mitchel Haas
>
> Regards,
>
> Andrew Marlow

Try http://www.w3.org/People/Raggett/tidy/
to make the html well
formed.


-- 
      [ See http://www.gotw.ca/resources/clcm.htm
for info about ]
      [ comp.lang.c++.moderated.    First time posters: Do this! ]
 




 5 Posts in Topic:
Open source library for generating and parsing (x)html
Mitchel Haas <mhaas@[E  2008-05-09 09:25:17 
Re: Open source library for generating and parsing (x)html
marlow.andrew@[EMAIL PROT  2008-05-10 06:14:38 
Re: Open source library for generating and parsing (x)html
Ian Collins <ian-news@  2008-05-10 19:26:36 
Re: Open source library for generating and parsing (x)html
"AnonMail2005@[EMAIL  2008-05-10 19:25:28 
Re: Open source library for generating and parsing (x)html
Mitchel Haas <mhaas@[E  2008-05-11 09:47:14 

Post A Reply:
  Go here to Signup

AddThis Feed Button


About - Advertising - Contact - Frequently Asked Questions - Privacy Policy - Terms of Use - Signup

Contact
tan12V112 Fri Jul 25 15:47:28 CDT 2008.