unixdev.net


Switch to SpeakEasy.net DSL

The Modular Manual Browser

Home Page
Manual: (Debian-5.0)
Page:
Section:
Apropos / Subsearch:
optional field

HTML::Tree(3pm)       User Contributed Perl Documentation      HTML::Tree(3pm)



NAME
       HTML::Tree - overview of HTML::TreeBuilder et al

VERSION
       3.23

SYNOPSIS
           use HTML::TreeBuilder;
           my $tree = HTML::TreeBuilder->new();
           $tree->parse_file($filename);

               # Then do something with the tree, using HTML::Element
               # methods -- for example:

           $tree->dump

               # Finally:

           $tree->delete;

DESCRIPTION
       HTML-Tree is a suite of Perl modules for making parse trees out of HTML
       source.  It consists of mainly two modules, whose documentation you
       should refer to: HTML::TreeBuilder and HTML::Element.

       HTML::TreeBuilder is the module that builds the parse trees.  (It uses
       HTML::Parser to do the work of breaking the HTML up into tokens.)

       The tree that TreeBuilder builds for you is made up of objects of the
       class HTML::Element.

       If you find that you do not properly understand the documentation for
       HTML::TreeBuilder and HTML::Element, it may be because you are unfamil-
       iar with tree-shaped data structures, or with object-oriented modules
       in general. Sean Burke has written some articles for The Perl Journal
       ("www.tpj.com") that seek to provide that background.  The full text of
       those articles is contained in this distribution, as:

       HTML::Tree::AboutObjects
           "User's View of Object-Oriented Modules" from TPJ17.

       HTML::Tree::AboutTrees
           "Trees" from TPJ18

       HTML::Tree::Scanning
           "Scanning HTML" from TPJ19

       Readers already familiar with object-oriented modules and tree-shaped
       data structures should read just the last article.  Readers without
       that background should read the first, then the second, and then the
       third.

SUPPORT
       You can find documentation for this module with the perldoc command.

           perldoc HTML::Tree

           You can also look for information at:

       * AnnoCPAN: Annotated CPAN documentation
           <http://annocpan.org/dist/HTML-Tree>;

       * CPAN Ratings
           <http://cpanratings.perl.org/d/HTML-Tree>;

       * RT: CPAN's request tracker
           <http://rt.cpan.org/NoAuth/Bugs.html?Dist=HTML-Tree>;

       * Search CPAN
           <http://search.cpan.org/dist/HTML-Tree>;

SEE ALSO
       HTML::TreeBuilder, HTML::Element, HTML::Tagset, HTML::Parser,
       HTML::DOMbo

       The book Perl &amp; LWP by Sean M. Burke published by O'Reilly and Asso-
       ciates, 2002.  ISBN: 0-596-00178-9

       It has several chapters to do with HTML processing in general, and
       HTML-Tree specifically.  There's more info at:

           http://www.oreilly.com/catalog/perllwp/

           http://www.amazon.com/exec/obidos/ASIN/0596001789

SOURCE REPOSITORY
       HTML::Tree is maintained in Subversion hosted at perl.org.

           http://svn.perl.org/modules/HTML-Tree

       The latest development work is always at:

           http://svn.perl.org/modules/HTML-Tree/trunk

       Any patches sent should be diffed against this repository.

ACKNOWLEDGEMENTS
       Thanks to Gisle Aas, Sean Burke and Andy Lester for their original
       work.

       Thanks to Chicago Perl Mongers (http://chicago.pm.org) for their
       patches submitted to HTML::Tree as part of the Phalanx project
       (http://qa.perl.org/phalanx).

       Thanks to the following people for additional patches and documenta-
       tion: Terrence Brannon, Gordon Lack, Chris Madsen and Ricardo Signes.

AUTHOR
       Original HTML-Tree author Gisle Aas.  Handed off to Sean M. Burke.  and
       Andy Lester.  Currently maintained by Pete Krawczyk "<petekATcpan.org>".

COPYRIGHT
       Copyright 1995-1998 Gisle Aas; 1999-2004 Sean M. Burke; 2005 Andy
       Lester; 2006 Pete Krawczyk.  (Except the articles contained in
       HTML::Tree::AboutObjects, HTML::Tree::AboutTrees, and HTML::Tree::Scan-
       ning, which are all copyright 2000 The Perl Journal.)

       Except for those three TPJ articles, the whole HTML-Tree distribution,
       of which this file is a part, is free software; you can redistribute it
       and/or modify it under the same terms as Perl itself.

       Those three TPJ articles may be distributed under the same terms as
       Perl itself.

       The programs in this library are distributed in the hope that they will
       be useful, but without any warranty; without even the implied warranty
       of merchantability or fitness for a particular purpose.



perl v5.8.8                       2006-11-15                   HTML::Tree(3pm)