Character Map is a utility included with Microsoft Windows operating systems and is used to view the characters in any installed font , to check what keyboard input ( Alt code ) is used to enter those characters, and to copy characters to the clipboard in lieu of typing them. Other operating systems have apps which do the same things that Character Map does; for example, Apple MacOS Character Viewer (formerly Character Palette).
21-465: The tool is usually useful for entering special characters . It can be opened via the command-line interface or Run command dialog using the 'charmap' command. The "Advanced view" check box can be used to inspect the character sets in a font according to different encodings ( code pages ), including Unicode code ranges, to locate particular characters by their Unicode code point and to search for characters by their Unicode name. For Unicode fonts,
42-664: A character entity reference refers to a character by the name of an entity which has the desired character as its replacement text . The entity must either be predefined (built into the markup language) or explicitly declared in a Document Type Definition (DTD). The format is the same as for any entity reference: where name is the case-sensitive name of the entity. The semicolon is required. Because numbers are harder for humans to remember than names, character entity references are most often written by humans, while numeric character references are most often produced by computer programs. 65 characters, including DEL . All belong to
63-458: A character by a predefined name. A numeric character reference uses the format or where nnnn is the code point in decimal form, and hhhh is the code point in hexadecimal form. The x must be lowercase in XML documents. The nnnn or hhhh may be any number of digits and may include leading zeros. The hhhh may mix uppercase and lowercase, though uppercase is the usual style. In contrast,
84-530: A complete SGML document, it is known in the calling document as an SGML document entity . An SGML document is a text document with SGML markup defined in an SGML prologue (i.e., the DTD and subdocuments). A complete SGML document comprises not only the document instance itself, but also the prologue and, optionally, the SGML declaration (which defines the document's markup syntax and declares the character encoding ). An entity
105-448: A literal value, or may have some combination of an optional SYSTEM identifier , which allows SGML parsers to process an entity's string referent as a resource identifier, and an optional PUBLIC identifier, which identifies the entity independent of any particular representation. In XML , a subset of SGML , an entity declaration may not have a PUBLIC identifier without a SYSTEM identifier. When an external entity references
126-405: A value that is either a literal string, or a parsed string comprising markup and entities defined in the same document (such as a Document Type Declaration or subdocument). In contrast, an external entity has a declaration that invokes an external document, thereby necessitating the intervention of an entity manager to resolve the external document reference. An entity declaration may have
147-583: Is a need to use characters that are not easily typed or that are not widely supported by legacy character encodings. Each such entity consists of just one character from the Universal Character Set . Although any character can be referenced using a numeric character reference , a character entity reference allows characters to be referenced by name instead of code point . For example, HTML 4 has 252 built-in character entities that do not need to be explicitly declared, while XML has five. XHTML has
168-535: Is accessible in a text field on Windows 10 computers, using the keyboard shortcut ⊞ Win + . , or the 😀 key in Windows 10's virtual touch keyboard, which is mainly used for the purposes of using emoji , but also allows access to a smaller set of special characters. The Windows NT series of operating systems from Workstation and Server 4.0 build 1381 and the Windows 9x -series from Windows 95 onwards also contain
189-405: Is defined via an entity declaration in a document's document type definition (DTD). For example: This DTD markup declares the following: Names for entities must follow the rules for SGML names , and there are limitations on where entities can be referenced. Parameter entities are referenced by placing the entity name between % and ; . Parsed general entities are referenced by placing
210-521: Is meant when an organization says a password "requires punctuation marks". 96 characters; the 62 letters, and two ordinal indicators belong to the Latin script. The remaining 32 belong to the common script. Uppercase Uppercase Lowercase Lowercase 128 characters; all belong to the Latin script. 208 characters; all belong to the Latin script; 33 in the MES-2 subset. 256 characters; all belong to
231-871: The Standard Generalized Markup Language (SGML), an entity is a primitive data type , which associates a string with either a unique alias (such as a user-specified name) or an SGML reserved word (such as #DEFAULT ). Entities are foundational to the organizational structure and definition of SGML documents. The SGML specification defines numerous entity types , which are distinguished by keyword qualifiers and context. An entity string value may variously consist of plain text , SGML tags, and/or references to previously defined entities. Certain entity types may also invoke external documents. Entities are called by reference . Entities are classified as general or parameter: Entities are also further classified as parsed or unparsed: An internal entity has
SECTION 10
#1732844776166252-456: The common script. Footnotes: The Unicode Standard (version 16.0) classifies 1,487 characters as belonging to the Latin script. 95 characters; the 52 alphabet characters belong to the Latin script. The remaining 43 belong to the common script. The 33 characters classified as ASCII Punctuation & Symbols are also sometimes referred to as ASCII special characters . Often only these characters (and not other Unicode punctuation) are what
273-489: The 1,062 characters in the Multilingual European Character Set 2 ( MES-2 ) subset, and some additional related characters. HTML and XML provide ways to reference Unicode characters when the characters themselves either cannot or should not be used. A numeric character reference refers to a character by its Universal Character Set / Unicode code point , and a character entity reference refers to
294-757: The Latin script; 23 in the MES-2 subset. 96 characters; all belong to the Latin script; three in the MES-2 subset. 80 characters; 15 in the MES-2 subset. 144 code points; 135 assigned characters; 85 in the MES-2 subset. For polytonic orthography . 256 code points; 233 assigned characters, all in the MES-2 subset (#670 – 902). 256 characters; 191 in the MES-2 subset. The range from U+0900 to U+0DFF includes Devanagari , Bengali script , Gurmukhi , Gujarati script , Odia alphabet , Tamil script , Telugu script , Kannada script , Malayalam script , and Sinhala script . Other Brahmic and Indic scripts in Unicode include: 112 code points; 111 assigned characters; 24 in
315-1612: The MES-2 subset. 0000–0FFF 1000–1FFF 2000–2FFF 3000–3FFF 4000–4FFF 5000–5FFF 6000–6FFF 7000–7FFF 8000–8FFF 9000–9FFF A000–AFFF B000–BFFF C000–CFFF D000–DFFF E000–EFFF F000–FFFF 10000–10FFF 11000–11FFF 12000–12FFF 13000–13FFF 14000–14FFF 16000–16FFF 17000–17FFF 18000–18FFF 1A000–1AFFF 1B000–1BFFF 1C000–1CFFF 1D000–1DFFF 1E000–1EFFF 1F000–1FFFF 20000–20FFF 21000–21FFF 22000–22FFF 23000–23FFF 24000–24FFF 25000–25FFF 26000–26FFF 27000–27FFF 28000–28FFF 29000–29FFF 2A000–2AFFF 2B000–2BFFF 2C000–2CFFF 2D000–2DFFF 2E000–2EFFF 2F000–2FFFF 30000–30FFF 31000–31FFF 32000–32FFF E0000–E0FFF 15: SPUA-A F0000–FFFFF 16: SPUA-B 100000–10FFFF SGML entity In
336-691: The Start / Run dialog box. On Windows 2000 , Windows XP , Windows Vista , and Windows 7 , the utility is in All Programs → Accessories → System Tools → Character Map in the Start Menu . On Windows 10 , the utility is in the Windows Accessories folder in the Start Menu . Beginning with Windows Vista, the user can also type the name of the utility in the Start Menu search box. A secondary character map program
357-606: The character map called Characters Map is available from third parties for systems from OS/2 Warp 3 onwards to current ArcaOS versions. The MacOS version is included in the Font Book app, and is shown when viewing the "Repertoire" of a font. A Linux GNUstep character map application, "Charmap", is developed by GNU Savannah . Special characters As of Unicode version 16.0, there are 155,063 characters with code points , covering 168 modern and historical scripts , as well as multiple symbol sets. This article includes
378-519: The character map, as do versions of Windows CE using a GUI based on these systems' explorer.exe, introduced with Windows 95. Another version of the character map is found in the Progman.exe-based Windows 3.11 and Windows NT 3.51 . Other operating systems such as some Unix - Linux variants with GUIs , the HP-48 series graphing calculators and others also have a similar accessory. The OS/2 analogue of
399-452: The characters can be grouped by their Unicode subrange. Although the Unicode standard already extends character field to plane 16 and many codepoints of plane 1 are assigned with characters, this tool still only supports code points on plane 0 (between U+0000 and U+FFFF). Additionally, it does not display certain characters in that range for reasons unexplained. With all versions of Windows the utility can be started by entering charmap in
420-419: The entity name between " & " and " ; ". Unparsed entities are referenced by placing the entity name in the value of an attribute declared as type ENTITY. The general entities from the example above might be referenced in a document as follows: When parsed, this document would be reported to the downstream application the same as if it has been written as follows, assuming the hello.txt file contains
441-441: The text Salutations : A reference to an undeclared entity is an error unless a default entity has been defined. For example: Additional markup constructs and processor options may affect whether and how entities are processed. For example, a processor may optionally ignore external entities. Standard entity sets for SGML and some of its derivatives have been developed as mnemonic devices, to ease document authoring when there
SECTION 20
#1732844776166#165834