Module sgmllib :: Class SGMLParser
[hide private]
[frames] | no frames]

_ClassType SGMLParser

markupbase.ParserBase --+
                        |
                       SGMLParser
Known Subclasses:

Instance Methods [hide private]
 
__init__(self, verbose=0)
Initialize and reset this instance.
 
reset(self)
Reset this instance.
 
setnomoretags(self)
Enter literal mode (CDATA) till EOF.
 
setliteral(self, *args)
Enter literal mode (CDATA).
 
feed(self, data)
Feed some data to the parser.
 
close(self)
Handle the remaining data.
 
error(self, message)
 
goahead(self, end)
 
parse_pi(self, i)
 
get_starttag_text(self)
 
parse_starttag(self, i)
 
_convert_ref(self, match)
 
parse_endtag(self, i)
 
finish_shorttag(self, tag, data)
 
finish_starttag(self, tag, attrs)
 
finish_endtag(self, tag)
 
handle_starttag(self, tag, method, attrs)
 
handle_endtag(self, tag, method)
 
report_unbalanced(self, tag)
 
convert_charref(self, name)
Convert character reference, may be overridden.
 
convert_codepoint(self, codepoint)
 
handle_charref(self, name)
Handle character reference, no need to override.
 
convert_entityref(self, name)
Convert entity references.
 
handle_entityref(self, name)
Handle entity references, no need to override.
 
handle_data(self, data)
 
handle_comment(self, data)
 
handle_decl(self, decl)
 
handle_pi(self, data)
 
unknown_starttag(self, tag, attrs)
 
unknown_endtag(self, tag)
 
unknown_charref(self, ref)
 
unknown_entityref(self, ref)

Inherited from markupbase.ParserBase: getpos, parse_comment, parse_declaration, parse_marked_section, unknown_decl, updatepos

Class Variables [hide private]
  entity_or_charref = re.compile(r'&(?:([a-zA-Z][-\.a-zA-Z0-9]*)...
  _decl_otherchars = '='
  entitydefs = {'amp': '&', 'apos': '\'', 'gt': '>', 'lt': '<', ...
Method Details [hide private]

__init__(self, verbose=0)
(Constructor)

 

Initialize and reset this instance.

Overrides: markupbase.ParserBase.__init__

reset(self)

 

Reset this instance. Loses all unprocessed data.

Overrides: markupbase.ParserBase.reset

setnomoretags(self)

 

Enter literal mode (CDATA) till EOF.

Intended for derived classes only.

setliteral(self, *args)

 

Enter literal mode (CDATA).

Intended for derived classes only.

feed(self, data)

 
Feed some data to the parser.

        Call this as often as you want, with as little or as much text
        as you want (may include '
').  (This just saves the text,
        all the processing is done by goahead().)
        

error(self, message)

 
Overrides: markupbase.ParserBase.error

convert_entityref(self, name)

 

Convert entity references.

As an alternative to overriding this method; one can tailor the results by setting up the self.entitydefs mapping appropriately.


Class Variable Details [hide private]

entity_or_charref

Value:
re.compile(r'&(?:([a-zA-Z][-\.a-zA-Z0-9]*)|#([0-9]+))(;?)')

entitydefs

Value:
{'amp': '&', 'apos': '\'', 'gt': '>', 'lt': '<', 'quot': '"'}