Character Values

The class library provides a name, through a set of enumerations, for every character in the Unicode set, with the exception of most of the Han ideographic characters. Names are provided for some particularly significant ideographs, such as digits and the 214 KangXi radicals.

To refer to specific Unicode values, use character names rather than code points. For example, refer to UGeneralPunctuation::kQuestionMark rather than the value U+003F.

Because of the large number of characters, the names are scoped into a set of classes based on script or function. These classes are provided only for referencing the enumerated names they contain--do not use them for any other reason. These classes are listed in the following table. See the enumeration in the referenced header file for specific character names.

Character Name Enumerations

Category Header file Classes
General utility characters iugnrl.hpp UASCII
UCombining
UControlCode
UGeneralPunctuation
ULatin
ULatin1
UModifierLetter
UUnicodeDigit
UUnicodeSpecial
East Asian scripts iueasia.hpp UBopomofo
UHangulChoseong
UHangulJongseong
UHangulJungseong
UHangulLetter
UHangzhouNumeral
UHanNumeral
UHiragana
UIdeographicAnnotation
UKangXiRadical
UKatakana
UKatakanaHiragana
South and Southeast Asian scripts iusasia.hpp UBengali
UDevanagari
UGujarati
UGurmukhi
UKannada
ULao
UMalayalam
UOriya
UTamil
UTelugu
UThai
Eastern European scripts iueeuro.hpp UCoptic
UCyrillic
UGeorgian
UGreek
Mideastern scripts iumeast.hpp UArabic
UArmenian
UHebrew
Characters provided for compatibility with other standards iucmpbty.hpp UArabicCompatibility
UArabicLigature
UArmenianSmallLigature
UBlocks
UBoxDrawings
UCircledDigit
UCircledHangul
UCircledIdeograph
UCircledKatakana
UCircledLatin
UCircledNumber
UCJKCompatibility
UCJKSquaredAbbreviations
UCJKSquaredWords
UCNSCompatibility
UFullStopDigit
UFullStopNumber
UFullwidth
UHalfwidth
UHalfwidthHangulLetter
UHangulSyllable
UIdeographicTelegraph
ULatinSmallLigature
UParenthesizedDigit
UParenthesizedHangul
UParenthesizedIdeograph
UParenthesizedLatin
UParenthesizedNumber
UPresentationFormForVertical
URomanNumeral
USmallVariants
USubscript
USuperscript
UVulgarFraction
Symbols iusyms.hpp UAPLFunctionalSymbol
UArrow
UCJKSymbols
UControlCodePicture
UCurrency
UDingbats
UGeometricShapes
UHarpoon
ULetterLikeSymbol
UMathematicalOperators
UMiscellaneousTechnical
UOCR
UStandardPhonetic
UZapfDingbats

Some character names may refer to characters of types other than UniChar. In such cases, you may need to cast characters before using interfaces that take UniChar parameters.