Module unicodedata
[hide private]
[frames] | no frames]

Module unicodedata

This module provides access to the Unicode Character Database which defines character properties for all Unicode characters. The data in this database is based on the UnicodeData.txt file version 4.1.0 which is publically available from ftp://ftp.unicode.org/.

The module uses the same names and symbols as defined by the UnicodeData File Format 4.1.0 (see http://www.unicode.org/Public/4.1.0/ucd/UCD.html).

Classes [hide private]
UCD
Functions [hide private]
 
bidirectional(unichr)
Returns the bidirectional category assigned to the Unicode character unichr as string.
 
category(unichr)
Returns the general category assigned to the Unicode character unichr as string.
 
combining(unichr)
Returns the canonical combining class assigned to the Unicode character unichr as integer.
 
decimal(unichr, default=...)
Returns the decimal value assigned to the Unicode character unichr as integer.
 
decomposition(unichr)
Returns the character decomposition mapping assigned to the Unicode character unichr as string.
 
digit(unichr, default=...)
Returns the digit value assigned to the Unicode character unichr as integer.
 
east_asian_width(unichr)
Returns the east asian width assigned to the Unicode character unichr as string.
 
lookup(name)
Look up character by name.
 
mirrored(unichr)
Returns the mirrored property assigned to the Unicode character unichr as integer.
 
name(unichr, default=...)
Returns the name assigned to the Unicode character unichr as a string.
 
normalize(form, unistr)
Return the normal form 'form' for the Unicode string unistr.
 
numeric(unichr, default=...)
Returns the numeric value assigned to the Unicode character unichr as float.
Variables [hide private]
  ucd_3_2_0 = <unicodedata.UCD object at 0x40198980>
  ucnhash_CAPI = <PyCObject object at 0x40198998>
  unidata_version = '4.1.0'
Function Details [hide private]

bidirectional(unichr)

 

Returns the bidirectional category assigned to the Unicode character unichr as string. If no such value is defined, an empty string is returned.

combining(unichr)

 

Returns the canonical combining class assigned to the Unicode character unichr as integer. Returns 0 if no combining class is defined.

decimal(unichr, default=...)

 

Returns the decimal value assigned to the Unicode character unichr as integer. If no such value is defined, default is returned, or, if not given, ValueError is raised.

decomposition(unichr)

 

Returns the character decomposition mapping assigned to the Unicode character unichr as string. An empty string is returned in case no such mapping is defined.

digit(unichr, default=...)

 

Returns the digit value assigned to the Unicode character unichr as integer. If no such value is defined, default is returned, or, if not given, ValueError is raised.

lookup(name)

 

Look up character by name. If a character with the given name is found, return the corresponding Unicode character. If not found, KeyError is raised.

mirrored(unichr)

 

Returns the mirrored property assigned to the Unicode character unichr as integer. Returns 1 if the character has been identified as a "mirrored" character in bidirectional text, 0 otherwise.

name(unichr, default=...)

 

Returns the name assigned to the Unicode character unichr as a string. If no name is defined, default is returned, or, if not given, ValueError is raised.

normalize(form, unistr)

 

Return the normal form 'form' for the Unicode string unistr. Valid values for form are 'NFC', 'NFKC', 'NFD', and 'NFKD'.

numeric(unichr, default=...)

 

Returns the numeric value assigned to the Unicode character unichr as float. If no such value is defined, default is returned, or, if not given, ValueError is raised.