Next: Composition of Unicode characters, Up: Normalization forms (composition and decomposition) <uninorm.h>
[Contents][Index]
The following enumerated values are the possible types of decomposition of a Unicode character.
Denotes canonical decomposition.
UCD marker: <font>
. Denotes a font variant (e.g. a blackletter form).
UCD marker: <noBreak>
.
Denotes a no-break version of a space or hyphen.
UCD marker: <initial>
.
Denotes an initial presentation form (Arabic).
UCD marker: <medial>
.
Denotes a medial presentation form (Arabic).
UCD marker: <final>
.
Denotes a final presentation form (Arabic).
UCD marker: <isolated>
.
Denotes an isolated presentation form (Arabic).
UCD marker: <circle>
.
Denotes an encircled form.
UCD marker: <super>
.
Denotes a superscript form.
UCD marker: <sub>
.
Denotes a subscript form.
UCD marker: <vertical>
.
Denotes a vertical layout presentation form.
UCD marker: <wide>
.
Denotes a wide (or zenkaku) compatibility character.
UCD marker: <narrow>
.
Denotes a narrow (or hankaku) compatibility character.
UCD marker: <small>
.
Denotes a small variant form (CNS compatibility).
UCD marker: <square>
.
Denotes a CJK squared font variant.
UCD marker: <fraction>
.
Denotes a vulgar fraction form.
UCD marker: <compat>
.
Denotes an otherwise unspecified compatibility character.
The following constant denotes the maximum size of decomposition of a single Unicode character.
This macro expands to a constant that is the required size of buffer passed to
the uc_decomposition
and uc_canonical_decomposition
functions.
The following functions decompose a Unicode character.
Returns the character decomposition mapping of the Unicode character uc.
decomposition must point to an array of at least
UC_DECOMPOSITION_MAX_LENGTH
ucs_t
elements.
When a decomposition exists, decomposition[0..n-1]
and
*decomp_tag
are filled and n is returned. Otherwise -1 is
returned.
Returns the canonical character decomposition mapping of the Unicode character
uc. decomposition must point to an array of at least
UC_DECOMPOSITION_MAX_LENGTH
ucs_t
elements.
When a decomposition exists, decomposition[0..n-1]
is filled
and n is returned. Otherwise -1 is returned.
Note: This function returns the (simple) “canonical decomposition” of
uc. If you want the “full canonical decomposition” of uc,
that is, the recursive application of “canonical decomposition”, use the
function u*_normalize
with argument UNINORM_NFD
instead.