HTML::Parse man page on BSDOS

Man page or keyword search:  
man Server   6284 pages
apropos Keyword Search (all sections)
Output format
BSDOS logo
[printable version]



lib::HTML::ParsUser Contributed Perl Documentalib::HTML::Parse(3)

NAME
       parse_html - Parse HTML text

       parse_htmlfile - Parse HTML text from file

SYNOPSIS
	use HTML::Parse;
	$h = parse_htmlfile("test.html");
	print $h->dump;
	$h = parse_html("<p>Some more <i>italic</i> text", $h);
	$h->delete;

	print parse_htmlfile("index.html")->as_HTML;  # tidy up markup in a file

DESCRIPTION
       Disclaimer: This module is provided for backwards
       compatibility with earlier versions of this library.  New
       code will probably prefer to use the HTML::Parser and
       HTML::TreeBuilder modules directly.

       The HTML::Parse module provides functions to parse HTML
       documents.  There are two functions exported by this
       module:

       parse_html($html, [$obj])
	   This function is really just a synonym for
	   $obj->parse($html) and $obj is assumed to be a
	   subclass of HTML::Parser.  Refer to the HTML::Parser
	   manpage for more documentation.

	   The $obj will default to a internally created
	   HTML::TreeBuilder object.  This class implements a
	   parser that builds (and is) a HTML syntax tree with
	   HTML::Element objects as nodes.

	   The return value from parse_html() is $obj.

       parse_htmlfile($file, [$obj])
	   Same as parse_html(), but obtains HTML text from the
	   named file.

	   Returns undef if the file could not be opened, or $obj
	   otherwise.

       When a HTML::TreeBuilder object is created, the following
       variables control how parsing takes place:

       $HTML::Parse::IMPLICIT_TAGS
	   Setting this variable to true will instruct the parser
	   to try to deduce implicit elements and implicit end
	   tags.  If this variable is false you get a parse tree
	   that just reflects the text as it stands.  Might be
	   useful for quick & dirty parsing.  Default is true.

24/Aug/1997	       perl 5.005, patch 03			1

lib::HTML::ParsUser Contributed Perl Documentalib::HTML::Parse(3)

	   Implicit elements have the implicit() attribute set.

       $HTML::Parse::IGNORE_UNKNOWN
	   This variable contols whether unknow tags should be
	   represented as elements in the parse tree.  Default is
	   true.

       $HTML::Parse::IGNORE_TEXT
	   Do not represent the text content of elements.  This
	   saves space if all you want is to examine the
	   structure of the document.  Default is false.

       $HTML::Parse::WARN
	   Call warn() with an apropriate message for syntax
	   errors.  Default is false.

SEE ALSO
       the HTML::Parser manpage, the HTML::TreeBuilder manpage,
       the HTML::Element manpage

COPYRIGHT
       Copyright 1995-1996 Gisle Aas. All rights reserved.

       This library is free software; you can redistribute it
       and/or modify it under the same terms as Perl itself.

AUTHOR
       Gisle Aas <aas@sn.no>

24/Aug/1997	       perl 5.005, patch 03			2

[top]
                             _         _         _ 
                            | |       | |       | |     
                            | |       | |       | |     
                         __ | | __ __ | | __ __ | | __  
                         \ \| |/ / \ \| |/ / \ \| |/ /  
                          \ \ / /   \ \ / /   \ \ / /   
                           \   /     \   /     \   /    
                            \_/       \_/       \_/ 
More information is available in HTML format for server BSDOS

List of man pages available for BSDOS

Copyright (c) for man pages and the logo by the respective OS vendor.

For those who want to learn more, the polarhome community provides shell access and support.

[legal] [privacy] [GNU] [policy] [cookies] [netiquette] [sponsors] [FAQ]
Tweet
Polarhome, production since 1999.
Member of Polarhome portal.
Based on Fawad Halim's script.
....................................................................
Vote for polarhome
Free Shell Accounts :: the biggest list on the net