This table prepared from SGML.TXT available at ftp.unicode.org ftp://ftp.unicode.org/MAPPINGS/VENDORS/MISC/SGML.TXT (if doing ftp, try cd Public/MAPPINGS/VENDORS/MISC) original comment: # Author: John Cowan <cowan@ccil.org> # Date: 25 July 1997 # # The following table maps SGML character entities from various # public sets (namely, ISOamsa, ISOamsb, ISOamsc, ISOamsn, ISOamso, # ISOamsr, ISObox, ISOcyr1, ISOcyr2, ISOdia, ISOgrk1, ISOgrk2, # ISOgrk3, ISOgrk4, ISOlat1, ISOlat2, ISOnum, ISOpub, ISOtech, # HTMLspecial, HTMLsymbol) to corresponding Unicode characters. # # The table has four tab-separated columns: # Column 1: SGML character entity name # Column 2: SGML public entity set # Column 3: Unicode 2.0 character code # Column 4: Unicode 2.0 character name (UPPER CASE) # Entries which don't have Unicode equivalents have "0x????" # in Column 3 and a lower case description (from the public entity # set DTD) in Column 4. The mapping is not reversible, because many # distinctions are unified away in Unicode, particularly between # mathematical symbols. # # The table is sorted case-blind by SGML character entity name. # # The contents of this table are drawn from various sources, and # are in the public domain. # This test is illuminated Unicode numeric entities like ⊫ We sort the entities according to unicode numbers. You should see visible characters if your display character set support them or some substitution string picked up from src/chrtrans/def7_uni.tbl If you see something like 㓒 - this number unknown to def7_uni.tbl or the internal browser's implementation is broken. Leonid Pauzner. 0x0021 ! # EXCLAMATION MARK 0x0022 " # QUOTATION MARK 0x0023 # # NUMBER SIGN 0x0024 $ # DOLLAR SIGN 0x0025 % # PERCENT SIGN 0x0026 & # AMPERSAND 0x0028 ( # LEFT PARENTHESIS 0x0029 ) # RIGHT PARENTHESIS 0x002A * # ASTERISK 0x002B + # PLUS SIGN 0x002C , # COMMA 0x002D - # HYPHEN-MINUS 0x002E . # FULL STOP 0x002F / # SOLIDUS 0x003A : # COLON 0x003B ; # SEMICOLON 0x003C < # LESS-THAN SIGN 0x003D = # EQUALS SIGN 0x003E > # GREATER-THAN SIGN 0x003F ? # QUESTION MARK 0x0040 @ # COMMERCIAL AT 0x005B [ # LEFT SQUARE BRACKET 0x005C \ # REVERSE SOLIDUS 0x005C \ # REVERSE SOLIDUS 0x005D ] # RIGHT SQUARE BRACKET 0x005F _ # LOW LINE 0x0060 ` # GRAVE ACCENT 0x007B { # LEFT CURLY BRACKET 0x007C | # VERTICAL LINE 0x007D } # RIGHT CURLY BRACKET 0x00A0 # NO-BREAK SPACE 0x00A1 ¡ # INVERTED EXCLAMATION MARK 0x00A2 ¢ # CENT SIGN 0x00A3 £ # POUND SIGN 0x00A4 ¤ # CURRENCY SIGN 0x00A5 ¥ # YEN SIGN 0x00A6 ¦ # BROKEN BAR 0x00A7 § # SECTION SIGN 0x00A8 ¨ # DIAERESIS 0x00A9 © # COPYRIGHT SIGN 0x00AA ª # FEMININE ORDINAL INDICATOR 0x00AB « # LEFT-POINTING DOUBLE ANGLE QUOTATION MARK 0x00AC ¬ # NOT SIGN 0x00AD # SOFT HYPHEN 0x00AE ® # REGISTERED SIGN 0x00AF ¯ # MACRON 0x00B0 ° # DEGREE SIGN 0x00B1 ± # PLUS-MINUS SIGN 0x00B2 ² # SUPERSCRIPT TWO 0x00B3 ³ # SUPERSCRIPT THREE 0x00B4 ´ # ACUTE ACCENT 0x00B5 µ # MICRO SIGN 0x00B6 ¶ # PILCROW SIGN 0x00B7 · # MIDDLE DOT 0x00B8 ¸ # CEDILLA 0x00B9 ¹ # SUPERSCRIPT ONE 0x00BA º # MASCULINE ORDINAL INDICATOR 0x00BB �