GNormalizeMode

Defines how a Unicode string is transformed in a canonical form, standardizing such issues as whether a character with an accent is represented as a base character and combining accent or as a single precomposed character. Unicode strings should generally be normalized before comparing them.

Values

ValueMeaning
DEFAULT0

standardize differences that do not affect the text content, such as the above-mentioned accent representation

NFD0

another name for %G_NORMALIZE_DEFAULT

DEFAULT_COMPOSE1

like %G_NORMALIZE_DEFAULT, but with composed forms rather than a maximally decomposed form

NFC1

another name for %G_NORMALIZE_DEFAULT_COMPOSE

ALL2

beyond %G_NORMALIZE_DEFAULT also standardize the "compatibility" characters in Unicode, such as SUPERSCRIPT THREE to the standard forms (in this case DIGIT THREE). Formatting information may be lost but for most text operations such characters should be considered the same

NFKD2

another name for %G_NORMALIZE_ALL

ALL_COMPOSE3

like %G_NORMALIZE_ALL, but with composed forms rather than a maximally decomposed form

NFKC3

another name for %G_NORMALIZE_ALL_COMPOSE

Meta