Module HTMLParser
[hide private]
[frames] | no frames]

Module HTMLParser

A parser for HTML and XHTML.

Classes [hide private]
HTMLParseError
Exception raised for all parse errors.
HTMLParser
Find tags and other markup and call handler functions.
Variables [hide private]
  interesting_normal = re.compile(r'[&<]')
  interesting_cdata = re.compile(r'<(/|\Z)')
  incomplete = re.compile(r'&[a-zA-Z#]')
  entityref = re.compile(r'&([a-zA-Z][-\.a-zA-Z0-9]*)[^a-zA-Z0-9]')
  charref = re.compile(r'&#(?:[0-9]+|[xX][0-9a-fA-F]+)[^0-9a-fA-...
  starttagopen = re.compile(r'<[a-zA-Z]')
  piclose = re.compile(r'>')
  commentclose = re.compile(r'--\s*>')
  tagfind = re.compile(r'[a-zA-Z][-\.a-zA-Z0-9:_]*')
  attrfind = re.compile(r'\s*([a-zA-Z_][-\.:a-zA-Z_0-9]*)(\s*=\s...
  locatestarttagend = re.compile(r'(?x)<[a-zA-Z][-\.a-zA-Z0-9:_]...
  endendtag = re.compile(r'>')
  endtagfind = re.compile(r'</\s*([a-zA-Z][-\.a-zA-Z0-9:_]*)\s*>')

Imports: markupbase, re


Variables Details [hide private]

charref

Value:
re.compile(r'&#(?:[0-9]+|[xX][0-9a-fA-F]+)[^0-9a-fA-F]')

attrfind

Value:
re.compile(r'\s*([a-zA-Z_][-\.:a-zA-Z_0-9]*)(\s*=\s*(\'[^\']*\'|"[^"]*\
"|[-a-zA-Z0-9\./,:;\+\*%\?!&\$\(\)_#=~@]*))?')

locatestarttagend

Value:
re.compile(r'(?x)<[a-zA-Z][-\.a-zA-Z0-9:_]*(?:\s+(?:[a-zA-Z_][-\.:a-zA\
-Z0-9_]*(?:\s*=\s*(?:\'[^\']*\'|"[^"]*"|[^\'">\s]+))?))*\s*')