com.ibm.icu.lang
Interface UProperty


public interface UProperty

Selection constants for Unicode properties.

These constants are used in functions like UCharacter.hasBinaryProperty(int) to select one of the Unicode properties.

The properties APIs are intended to reflect Unicode properties as defined in the Unicode Character Database (UCD) and Unicode Technical Reports (UTR).

For details about the properties see http://www.unicode.org.

For names of Unicode properties see the UCD file PropertyAliases.txt.

Important: If ICU is built with UCD files from Unicode versions below 3.2, then properties marked with "new" are not or not fully available. Check UCharacter.getUnicodeVersion() to be sure.

Author:
Syn Wee Quek
See Also:
UCharacter
Status:
Stable ICU 2.6.

Nested Class Summary
static interface UProperty.NameChoice
          Selector constants for UCharacter.getPropertyName() and UCharacter.getPropertyValueName().
 
Field Summary
static int AGE
          String property Age.
static int ALPHABETIC
          Binary property Alphabetic.
static int ASCII_HEX_DIGIT
          Binary property ASCII_Hex_Digit (0-9 A-F a-f).
static int BIDI_CLASS
          Enumerated property Bidi_Class.
static int BIDI_CONTROL
          Binary property Bidi_Control.
static int BIDI_MIRRORED
          Binary property Bidi_Mirrored.
static int BIDI_MIRRORING_GLYPH
          String property Bidi_Mirroring_Glyph.
static int BINARY_LIMIT
          One more than the last constant for binary Unicode properties.
static int BINARY_START
          First constant for binary Unicode properties.
static int BLOCK
          Enumerated property Block.
static int CANONICAL_COMBINING_CLASS
          Enumerated property Canonical_Combining_Class.
static int CASE_FOLDING
          String property Case_Folding.
static int CASE_SENSITIVE
          Binary property Case_Sensitive.
static int DASH
          Binary property Dash.
static int DECOMPOSITION_TYPE
          Enumerated property Decomposition_Type.
static int DEFAULT_IGNORABLE_CODE_POINT
          Binary property Default_Ignorable_Code_Point (new).
static int DEPRECATED
          Binary property Deprecated (new).
static int DIACRITIC
          Binary property Diacritic.
static int DOUBLE_LIMIT
          One more than the last constant for double Unicode properties.
static int DOUBLE_START
          First constant for double Unicode properties.
static int EAST_ASIAN_WIDTH
          Enumerated property East_Asian_Width.
static int EXTENDER
          Binary property Extender.
static int FULL_COMPOSITION_EXCLUSION
          Binary property Full_Composition_Exclusion.
static int GENERAL_CATEGORY
          Enumerated property General_Category.
static int GENERAL_CATEGORY_MASK
          Bitmask property General_Category_Mask.
static int GRAPHEME_BASE
          Binary property Grapheme_Base (new).
static int GRAPHEME_CLUSTER_BREAK
          Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1).
static int GRAPHEME_EXTEND
          Binary property Grapheme_Extend (new).
static int GRAPHEME_LINK
          Binary property Grapheme_Link (new).
static int HANGUL_SYLLABLE_TYPE
          Enumerated property Hangul_Syllable_Type, new in Unicode 4.
static int HEX_DIGIT
          Binary property Hex_Digit.
static int HYPHEN
          Binary property Hyphen.
static int ID_CONTINUE
          Binary property ID_Continue.
static int ID_START
          Binary property ID_Start.
static int IDEOGRAPHIC
          Binary property Ideographic.
static int IDS_BINARY_OPERATOR
          Binary property IDS_Binary_Operator (new).
static int IDS_TRINARY_OPERATOR
          Binary property IDS_Trinary_Operator (new).
static int INT_LIMIT
          One more than the last constant for enumerated/integer Unicode properties.
static int INT_START
          First constant for enumerated/integer Unicode properties.
static int ISO_COMMENT
          String property ISO_Comment.
static int JOIN_CONTROL
          Binary property Join_Control.
static int JOINING_GROUP
          Enumerated property Joining_Group.
static int JOINING_TYPE
          Enumerated property Joining_Type.
static int LEAD_CANONICAL_COMBINING_CLASS
          Enumerated property Lead_Canonical_Combining_Class.
static int LINE_BREAK
          Enumerated property Line_Break.
static int LOGICAL_ORDER_EXCEPTION
          Binary property Logical_Order_Exception (new).
static int LOWERCASE
          Binary property Lowercase.
static int LOWERCASE_MAPPING
          String property Lowercase_Mapping.
static int MASK_LIMIT
          One more than the last constant for bit-mask Unicode properties.
static int MASK_START
          First constant for bit-mask Unicode properties.
static int MATH
          Binary property Math.
static int NAME
          String property Name.
static int NFC_INERT
          Binary property NFC_Inert.
static int NFC_QUICK_CHECK
          Enumerated property NFC_Quick_Check.
static int NFD_INERT
          Binary property NFD_Inert.
static int NFD_QUICK_CHECK
          Enumerated property NFD_Quick_Check.
static int NFKC_INERT
          Binary property NFKC_Inert.
static int NFKC_QUICK_CHECK
          Enumerated property NFKC_Quick_Check.
static int NFKD_INERT
          Binary property NFKD_Inert.
static int NFKD_QUICK_CHECK
          Enumerated property NFKD_Quick_Check.
static int NONCHARACTER_CODE_POINT
          Binary property Noncharacter_Code_Point.
static int NUMERIC_TYPE
          Enumerated property Numeric_Type.
static int NUMERIC_VALUE
          Double property Numeric_Value.
static int PATTERN_SYNTAX
          Binary property Pattern_Syntax (new in Unicode 4.1).
static int PATTERN_WHITE_SPACE
          Binary property Pattern_White_Space (new in Unicode 4.1).
static int POSIX_ALNUM
          Binary property alnum (a C/POSIX character class).
static int POSIX_BLANK
          Binary property blank (a C/POSIX character class).
static int POSIX_GRAPH
          Binary property graph (a C/POSIX character class).
static int POSIX_PRINT
          Binary property print (a C/POSIX character class).
static int POSIX_XDIGIT
          Binary property xdigit (a C/POSIX character class).
static int QUOTATION_MARK
          Binary property Quotation_Mark.
static int RADICAL
          Binary property Radical (new).
static int S_TERM
          Binary property STerm (new in Unicode 4.0.1).
static int SCRIPT
          Enumerated property Script.
static int SEGMENT_STARTER
          Binary Property Segment_Starter.
static int SENTENCE_BREAK
          Enumerated property Sentence_Break (new in Unicode 4.1).
static int SIMPLE_CASE_FOLDING
          String property Simple_Case_Folding.
static int SIMPLE_LOWERCASE_MAPPING
          String property Simple_Lowercase_Mapping.
static int SIMPLE_TITLECASE_MAPPING
          String property Simple_Titlecase_Mapping.
static int SIMPLE_UPPERCASE_MAPPING
          String property Simple_Uppercase_Mapping.
static int SOFT_DOTTED
          Binary property Soft_Dotted (new).
static int STRING_LIMIT
          One more than the last constant for string Unicode properties.
static int STRING_START
          First constant for string Unicode properties.
static int TERMINAL_PUNCTUATION
          Binary property Terminal_Punctuation.
static int TITLECASE_MAPPING
          String property Titlecase_Mapping.
static int TRAIL_CANONICAL_COMBINING_CLASS
          Enumerated property Trail_Canonical_Combining_Class.
static int UNICODE_1_NAME
          String property Unicode_1_Name.
static int UNIFIED_IDEOGRAPH
          Binary property Unified_Ideograph (new).
static int UPPERCASE
          Binary property Uppercase.
static int UPPERCASE_MAPPING
          String property Uppercase_Mapping.
static int VARIATION_SELECTOR
          Binary property Variation_Selector (new in Unicode 4.0.1).
static int WHITE_SPACE
          Binary property White_Space.
static int WORD_BREAK
          Enumerated property Word_Break (new in Unicode 4.1).
static int XID_CONTINUE
          Binary property XID_Continue.
static int XID_START
          Binary property XID_Start.
 

Field Detail

ALPHABETIC

static final int ALPHABETIC

Binary property Alphabetic.

Property for UCharacter.isUAlphabetic(), different from the property in UCharacter.isalpha().

Lu + Ll + Lt + Lm + Lo + Nl + Other_Alphabetic.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

BINARY_START

static final int BINARY_START
First constant for binary Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

ASCII_HEX_DIGIT

static final int ASCII_HEX_DIGIT
Binary property ASCII_Hex_Digit (0-9 A-F a-f).

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

BIDI_CONTROL

static final int BIDI_CONTROL

Binary property Bidi_Control.

Format controls which have specific functions in the Bidi Algorithm.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

BIDI_MIRRORED

static final int BIDI_MIRRORED

Binary property Bidi_Mirrored.

Characters that may change display in RTL text.

Property for UCharacter.isMirrored().

See Bidi Algorithm; UTR 9.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

DASH

static final int DASH

Binary property Dash.

Variations of dashes.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

DEFAULT_IGNORABLE_CODE_POINT

static final int DEFAULT_IGNORABLE_CODE_POINT

Binary property Default_Ignorable_Code_Point (new).

Property that indicates codepoint is ignorable in most processing.

Codepoints (2060..206F, FFF0..FFFB, E0000..E0FFF) + Other_Default_Ignorable_Code_Point + (Cf + Cc + Cs - White_Space)

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

DEPRECATED

static final int DEPRECATED

Binary property Deprecated (new).

The usage of deprecated characters is strongly discouraged.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

DIACRITIC

static final int DIACRITIC

Binary property Diacritic.

Characters that linguistically modify the meaning of another character to which they apply.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

EXTENDER

static final int EXTENDER

Binary property Extender.

Extend the value or shape of a preceding alphabetic character, e.g. length and iteration marks.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

FULL_COMPOSITION_EXCLUSION

static final int FULL_COMPOSITION_EXCLUSION

Binary property Full_Composition_Exclusion.

CompositionExclusions.txt + Singleton Decompositions + Non-Starter Decompositions.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

GRAPHEME_BASE

static final int GRAPHEME_BASE

Binary property Grapheme_Base (new).

For programmatic determination of grapheme cluster boundaries. [0..10FFFF]-Cc-Cf-Cs-Co-Cn-Zl-Zp-Grapheme_Link-Grapheme_Extend-CGJ

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

GRAPHEME_EXTEND

static final int GRAPHEME_EXTEND

Binary property Grapheme_Extend (new).

For programmatic determination of grapheme cluster boundaries.

Me+Mn+Mc+Other_Grapheme_Extend-Grapheme_Link-CGJ

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

GRAPHEME_LINK

static final int GRAPHEME_LINK

Binary property Grapheme_Link (new).

For programmatic determination of grapheme cluster boundaries.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

HEX_DIGIT

static final int HEX_DIGIT

Binary property Hex_Digit.

Characters commonly used for hexadecimal numbers.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

HYPHEN

static final int HYPHEN

Binary property Hyphen.

Dashes used to mark connections between pieces of words, plus the Katakana middle dot.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

ID_CONTINUE

static final int ID_CONTINUE

Binary property ID_Continue.

Characters that can continue an identifier.

ID_Start+Mn+Mc+Nd+Pc

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

ID_START

static final int ID_START

Binary property ID_Start.

Characters that can start an identifier.

Lu+Ll+Lt+Lm+Lo+Nl

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

IDEOGRAPHIC

static final int IDEOGRAPHIC

Binary property Ideographic.

CJKV ideographs.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

IDS_BINARY_OPERATOR

static final int IDS_BINARY_OPERATOR

Binary property IDS_Binary_Operator (new).

For programmatic determination of Ideographic Description Sequences.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

IDS_TRINARY_OPERATOR

static final int IDS_TRINARY_OPERATOR

Binary property IDS_Trinary_Operator (new).

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

JOIN_CONTROL

static final int JOIN_CONTROL

Binary property Join_Control.

Format controls for cursive joining and ligation.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

LOGICAL_ORDER_EXCEPTION

static final int LOGICAL_ORDER_EXCEPTION

Binary property Logical_Order_Exception (new).

Characters that do not use logical order and require special handling in most processing.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

LOWERCASE

static final int LOWERCASE

Binary property Lowercase.

Same as UCharacter.isULowercase(), different from UCharacter.islower().

Ll+Other_Lowercase

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

MATH

static final int MATH

Binary property Math.

Sm+Other_Math

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

NONCHARACTER_CODE_POINT

static final int NONCHARACTER_CODE_POINT

Binary property Noncharacter_Code_Point.

Code points that are explicitly defined as illegal for the encoding of characters.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

QUOTATION_MARK

static final int QUOTATION_MARK

Binary property Quotation_Mark.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

RADICAL

static final int RADICAL

Binary property Radical (new).

For programmatic determination of Ideographic Description Sequences.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

SOFT_DOTTED

static final int SOFT_DOTTED

Binary property Soft_Dotted (new).

Characters with a "soft dot", like i or j.

An accent placed on these characters causes the dot to disappear.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

TERMINAL_PUNCTUATION

static final int TERMINAL_PUNCTUATION

Binary property Terminal_Punctuation.

Punctuation characters that generally mark the end of textual units.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

UNIFIED_IDEOGRAPH

static final int UNIFIED_IDEOGRAPH

Binary property Unified_Ideograph (new).

For programmatic determination of Ideographic Description Sequences.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

UPPERCASE

static final int UPPERCASE

Binary property Uppercase.

Same as UCharacter.isUUppercase(), different from UCharacter.isUpperCase().

Lu+Other_Uppercase

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

WHITE_SPACE

static final int WHITE_SPACE

Binary property White_Space.

Same as UCharacter.isUWhiteSpace(), different from UCharacter.isSpace() and UCharacter.isWhitespace().

Space characters+TAB+CR+LF-ZWSP-ZWNBSP

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

XID_CONTINUE

static final int XID_CONTINUE

Binary property XID_Continue.

ID_Continue modified to allow closure under normalization forms NFKC and NFKD.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

XID_START

static final int XID_START

Binary property XID_Start.

ID_Start modified to allow closure under normalization forms NFKC and NFKD.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

CASE_SENSITIVE

static final int CASE_SENSITIVE

Binary property Case_Sensitive.

Either the source of a case mapping or _in_ the target of a case mapping. Not the same as the general category Cased_Letter.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

S_TERM

static final int S_TERM
Binary property STerm (new in Unicode 4.0.1). Sentence Terminal. Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/)

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

VARIATION_SELECTOR

static final int VARIATION_SELECTOR
Binary property Variation_Selector (new in Unicode 4.0.1). Indicates all those characters that qualify as Variation Selectors. For details on the behavior of these characters, see StandardizedVariants.html and 15.6 Variation Selectors.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

NFD_INERT

static final int NFD_INERT
Binary property NFD_Inert. ICU-specific property for characters that are inert under NFD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions. There is one such property per normalization form. These properties are computed as follows - an inert character is: a) unassigned, or ALL of the following: b) of combining class 0. c) not decomposed by this normalization form. AND if NFC or NFKC, d) can never compose with a previous character. e) can never compose with a following character. f) can never change if another character is added. Example: a-breve might satisfy all but f, but if you add an ogonek it changes to a-ogonek + breve See also com.ibm.text.UCD.NFSkippable in the ICU4J repository, and icu/source/common/unormimp.h .

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

NFKD_INERT

static final int NFKD_INERT
Binary property NFKD_Inert. ICU-specific property for characters that are inert under NFKD, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.

See Also:
NFD_INERT, Constant Field Values
Status:
Stable ICU 3.0.

NFC_INERT

static final int NFC_INERT
Binary property NFC_Inert. ICU-specific property for characters that are inert under NFC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.

See Also:
NFD_INERT, Constant Field Values
Status:
Stable ICU 3.0.

NFKC_INERT

static final int NFKC_INERT
Binary property NFKC_Inert. ICU-specific property for characters that are inert under NFKC, i.e., they do not interact with adjacent characters. Used for example in normalizing transforms in incremental mode to find the boundary of safely normalizable text despite possible text additions.

See Also:
NFD_INERT, Constant Field Values
Status:
Stable ICU 3.0.

SEGMENT_STARTER

static final int SEGMENT_STARTER
Binary Property Segment_Starter. ICU-specific property for characters that are starters in terms of Unicode normalization and combining character sequences. They have ccc=0 and do not occur in non-initial position of the canonical decomposition of any character (like " in NFD(a-umlaut) and a Jamo T in an NFD(Hangul LVT)). ICU uses this property for segmenting a string for generating a set of canonically equivalent strings, e.g. for canonical closure while processing collation tailoring rules.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

PATTERN_SYNTAX

static final int PATTERN_SYNTAX
Binary property Pattern_Syntax (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

PATTERN_WHITE_SPACE

static final int PATTERN_WHITE_SPACE
Binary property Pattern_White_Space (new in Unicode 4.1). See UAX #31 Identifier and Pattern Syntax (http://www.unicode.org/reports/tr31/)

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

POSIX_ALNUM

static final int POSIX_ALNUM
Binary property alnum (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

POSIX_BLANK

static final int POSIX_BLANK
Binary property blank (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

POSIX_GRAPH

static final int POSIX_GRAPH
Binary property graph (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

POSIX_PRINT

static final int POSIX_PRINT
Binary property print (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

POSIX_XDIGIT

static final int POSIX_XDIGIT
Binary property xdigit (a C/POSIX character class). Implemented according to the UTS #18 Annex C Standard Recommendation. See the UCharacter class documentation.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

BINARY_LIMIT

static final int BINARY_LIMIT

One more than the last constant for binary Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

BIDI_CLASS

static final int BIDI_CLASS
Enumerated property Bidi_Class. Same as UCharacter.getDirection(int), returns UCharacterDirection values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

INT_START

static final int INT_START
First constant for enumerated/integer Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

BLOCK

static final int BLOCK
Enumerated property Block. Same as UCharacter.UnicodeBlock.of(int), returns UCharacter.UnicodeBlock values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

CANONICAL_COMBINING_CLASS

static final int CANONICAL_COMBINING_CLASS
Enumerated property Canonical_Combining_Class. Same as UCharacter.getCombiningClass(int), returns 8-bit numeric values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

DECOMPOSITION_TYPE

static final int DECOMPOSITION_TYPE
Enumerated property Decomposition_Type. Returns UCharacter.DecompositionType values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

EAST_ASIAN_WIDTH

static final int EAST_ASIAN_WIDTH
Enumerated property East_Asian_Width. See http://www.unicode.org/reports/tr11/ Returns UCharacter.EastAsianWidth values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

GENERAL_CATEGORY

static final int GENERAL_CATEGORY
Enumerated property General_Category. Same as UCharacter.getType(int), returns UCharacterCategory values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

JOINING_GROUP

static final int JOINING_GROUP
Enumerated property Joining_Group. Returns UCharacter.JoiningGroup values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

JOINING_TYPE

static final int JOINING_TYPE
Enumerated property Joining_Type. Returns UCharacter.JoiningType values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

LINE_BREAK

static final int LINE_BREAK
Enumerated property Line_Break. Returns UCharacter.LineBreak values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

NUMERIC_TYPE

static final int NUMERIC_TYPE
Enumerated property Numeric_Type. Returns UCharacter.NumericType values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

SCRIPT

static final int SCRIPT
Enumerated property Script. Same as UScript.getScript(int), returns UScript values.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

HANGUL_SYLLABLE_TYPE

static final int HANGUL_SYLLABLE_TYPE
Enumerated property Hangul_Syllable_Type, new in Unicode 4. Returns HangulSyllableType values.

See Also:
Constant Field Values
Status:
Stable ICU 2.6.

NFD_QUICK_CHECK

static final int NFD_QUICK_CHECK
Enumerated property NFD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

NFKD_QUICK_CHECK

static final int NFKD_QUICK_CHECK
Enumerated property NFKD_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

NFC_QUICK_CHECK

static final int NFC_QUICK_CHECK
Enumerated property NFC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

NFKC_QUICK_CHECK

static final int NFKC_QUICK_CHECK
Enumerated property NFKC_Quick_Check. Returns numeric values compatible with Normalizer.QuickCheckResult.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

LEAD_CANONICAL_COMBINING_CLASS

static final int LEAD_CANONICAL_COMBINING_CLASS
Enumerated property Lead_Canonical_Combining_Class. ICU-specific property for the ccc of the first code point of the decomposition, or lccc(c)=ccc(NFD(c)[0]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

TRAIL_CANONICAL_COMBINING_CLASS

static final int TRAIL_CANONICAL_COMBINING_CLASS
Enumerated property Trail_Canonical_Combining_Class. ICU-specific property for the ccc of the last code point of the decomposition, or lccc(c)=ccc(NFD(c)[last]). Useful for checking for canonically ordered text; see Normalizer.FCD and http://www.unicode.org/notes/tn5/#FCD . Returns 8-bit numeric values like CANONICAL_COMBINING_CLASS.

See Also:
Constant Field Values
Status:
Stable ICU 3.0.

GRAPHEME_CLUSTER_BREAK

static final int GRAPHEME_CLUSTER_BREAK
Enumerated property Grapheme_Cluster_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UGraphemeClusterBreak values.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

SENTENCE_BREAK

static final int SENTENCE_BREAK
Enumerated property Sentence_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns USentenceBreak values.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

WORD_BREAK

static final int WORD_BREAK
Enumerated property Word_Break (new in Unicode 4.1). Used in UAX #29: Text Boundaries (http://www.unicode.org/reports/tr29/) Returns UWordBreakValues values.

See Also:
Constant Field Values
Status:
Stable ICU 3.4.

INT_LIMIT

static final int INT_LIMIT
One more than the last constant for enumerated/integer Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

GENERAL_CATEGORY_MASK

static final int GENERAL_CATEGORY_MASK
Bitmask property General_Category_Mask. This is the General_Category property returned as a bit mask. When used in UCharacter.getIntPropertyValue(c), returns bit masks for UCharacterCategory values where exactly one bit is set. When used with UCharacter.getPropertyValueName() and UCharacter.getPropertyValueEnum(), a multi-bit mask is used for sets of categories like "Letters".

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

MASK_START

static final int MASK_START
First constant for bit-mask Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

MASK_LIMIT

static final int MASK_LIMIT
One more than the last constant for bit-mask Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

NUMERIC_VALUE

static final int NUMERIC_VALUE
Double property Numeric_Value. Corresponds to UCharacter.getUnicodeNumericValue(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

DOUBLE_START

static final int DOUBLE_START
First constant for double Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

DOUBLE_LIMIT

static final int DOUBLE_LIMIT
One more than the last constant for double Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

AGE

static final int AGE
String property Age. Corresponds to UCharacter.getAge(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

STRING_START

static final int STRING_START
First constant for string Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

BIDI_MIRRORING_GLYPH

static final int BIDI_MIRRORING_GLYPH
String property Bidi_Mirroring_Glyph. Corresponds to UCharacter.getMirror(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

CASE_FOLDING

static final int CASE_FOLDING
String property Case_Folding. Corresponds to UCharacter.foldCase(String, boolean).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

ISO_COMMENT

static final int ISO_COMMENT
String property ISO_Comment. Corresponds to UCharacter.getISOComment(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

LOWERCASE_MAPPING

static final int LOWERCASE_MAPPING
String property Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(String).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

NAME

static final int NAME
String property Name. Corresponds to UCharacter.getName(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

SIMPLE_CASE_FOLDING

static final int SIMPLE_CASE_FOLDING
String property Simple_Case_Folding. Corresponds to UCharacter.foldCase(int, boolean).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

SIMPLE_LOWERCASE_MAPPING

static final int SIMPLE_LOWERCASE_MAPPING
String property Simple_Lowercase_Mapping. Corresponds to UCharacter.toLowerCase(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

SIMPLE_TITLECASE_MAPPING

static final int SIMPLE_TITLECASE_MAPPING
String property Simple_Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

SIMPLE_UPPERCASE_MAPPING

static final int SIMPLE_UPPERCASE_MAPPING
String property Simple_Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

TITLECASE_MAPPING

static final int TITLECASE_MAPPING
String property Titlecase_Mapping. Corresponds to UCharacter.toTitleCase(String).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

UNICODE_1_NAME

static final int UNICODE_1_NAME
String property Unicode_1_Name. Corresponds to UCharacter.getName1_0(int).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

UPPERCASE_MAPPING

static final int UPPERCASE_MAPPING
String property Uppercase_Mapping. Corresponds to UCharacter.toUpperCase(String).

See Also:
Constant Field Values
Status:
Stable ICU 2.4.

STRING_LIMIT

static final int STRING_LIMIT
One more than the last constant for string Unicode properties.

See Also:
Constant Field Values
Status:
Stable ICU 2.4.


Copyright (c) 2009 IBM Corporation and others.