A Unicode block is one of several contiguous ranges of numeric character codes ( code points ) of the Unicode character set that are defined by the Unicode Consortium for administrative and documentation purposes. Typically, proposals such as the addition of new glyphs are discussed and evaluated by considering the relevant block or blocks as a whole.
9-875: Control Pictures is a Unicode block containing characters for graphically representing the C0 control codes , and other control characters. Its block name in Unicode 1.0 was Pictures for Control Codes . The following Unicode-related documents record the purpose and process of defining specific characters in the Control Pictures block: Unicode block Each block is generally, but not always, meant to supply glyphs used by one or more specific languages, or in some general application area such as mathematics , surveying , decorative typesetting , social forums, etc. Unicode blocks are identified by unique names, which use only ASCII characters and are usually descriptive of
18-502: A script property , specifying which writing system it is intended for, or whether it is intended for multiple writing systems. This, also, is independent of block. In descriptions of the Unicode system, a block may be subdivided into more specific subgroups, such as the " Chess symbols " in the Miscellaneous Symbols block (not to be confused with the separate Chess Symbols block). Those subgroups are not "blocks" in
27-444: The code point. ) The size of a block may range from the minimum of 16 to a maximum of 65,536 code points. Every assigned code point has a glyph property called "Block", whose value is a character string naming the unique block that owns that point. However, a block may also contain unassigned code points, usually reserved for future additions of characters that "logically" should belong to that block. Code points not belonging to any of
36-525: The named blocks, e.g. in the unassigned planes 4–13, have the value block="No_Block". Simply belonging to a particular Unicode block does not guarantee the certain particular properties of the characters it is or will be expected to contain. The identity of any character is determined by its properties stated in the Unicode Character Database. For example, the contiguous range of 32 noncharacter code points U+FDD0..U+FDEF share none of
45-462: The nature of the symbols, in English ; such as "Tibetan" or "Supplemental Arrows-A". (When comparing block names, one is supposed to equate uppercase with lowercase letters, and ignore any whitespace, hyphens, and underbars; so the last name is equivalent to "supplemental_arrows__a" and "SUPPLEMENTALARROWSA". Blocks are pairwise disjoint ; that is, they do not overlap. The starting code point and
54-498: The properties common to the other characters in the Arabic Presentation Forms-A block, that they are certainly not Arabic script characters or "right-to-left noncharacters", and are assigned there as a filler to this block given that it has been agreed that no further Arabic compatibility characters will be encoded. Each Unicode point also has a property called " General Category ", that attempts to describe
63-531: The role of the corresponding symbol in the languages or applications for whose sake it was included in the system. Examples of General Categories are "Lu" (meaning upper-case letter), "Nd" (decimal digit), "Pi" (open-quote punctuation), and "Mn" (non-spacing mark, i.e. a diacritic for the preceding glyph). This division is completely independent of code blocks: the code points with a given General Category generally span many blocks, and do not have to be consecutive, not even within each block. Each code point also has
72-497: The size (number of code points) of each block are always multiples of 16; therefore, in the hexadecimal notation, the starting (smallest) point is U+ xxx 0 and the ending (largest) point is U+ yyy F, where xxx and yyy are three or more hexadecimal digits. (These constraints are intended to simplify the display of glyphs in Unicode Consortium documents, as tables with 16 rows labeled with the last hexadecimal digit of
81-2013: The technical sense used by the Unicode consortium, and are named only for the convenience of users. Unicode 16.0 defines 338 blocks: The Unicode Stability Policy requires that a character, once assigned, may not be moved or removed, although it may be deprecated. This applies to Unicode 2.0 and all subsequent versions. Prior to this, the following former blocks were moved: 0000–0FFF 1000–1FFF 2000–2FFF 3000–3FFF 4000–4FFF 5000–5FFF 6000–6FFF 7000–7FFF 8000–8FFF 9000–9FFF A000–AFFF B000–BFFF C000–CFFF D000–DFFF E000–EFFF F000–FFFF 10000–10FFF 11000–11FFF 12000–12FFF 13000–13FFF 14000–14FFF 16000–16FFF 17000–17FFF 18000–18FFF 1A000–1AFFF 1B000–1BFFF 1C000–1CFFF 1D000–1DFFF 1E000–1EFFF 1F000–1FFFF 20000–20FFF 21000–21FFF 22000–22FFF 23000–23FFF 24000–24FFF 25000–25FFF 26000–26FFF 27000–27FFF 28000–28FFF 29000–29FFF 2A000–2AFFF 2B000–2BFFF 2C000–2CFFF 2D000–2DFFF 2E000–2EFFF 2F000–2FFFF 30000–30FFF 31000–31FFF 32000–32FFF E0000–E0FFF 15: SPUA-A F0000–FFFFF 16: SPUA-B 100000–10FFFF Unicode Too Many Requests If you report this error to
#196803