This table prepared from SGML.TXT available at ftp.unicode.org ftp://ftp.unicode.org/MAPPINGS/VENDORS/MISC/SGML.TXT (if doing ftp, try cd Public/MAPPINGS/VENDORS/MISC) original comment: # Author: John Cowan <cowan@ccil.org> # Date: 25 July 1997 # # The following table maps SGML character entities from various # public sets (namely, ISOamsa, ISOamsb, ISOamsc, ISOamsn, ISOamso, # ISOamsr, ISObox, ISOcyr1, ISOcyr2, ISOdia, ISOgrk1, ISOgrk2, # ISOgrk3, ISOgrk4, ISOlat1, ISOlat2, ISOnum, ISOpub, ISOtech, # HTMLspecial, HTMLsymbol) to corresponding Unicode characters. # # The table has four tab-separated columns: # Column 1: SGML character entity name # Column 2: SGML public entity set # Column 3: Unicode 2.0 character code # Column 4: Unicode 2.0 character name (UPPER CASE) # Entries which don't have Unicode equivalents have "0x????" # in Column 3 and a lower case description (from the public entity # set DTD) in Column 4. The mapping is not reversible, because many # distinctions are unified away in Unicode, particularly between # mathematical symbols. # # The table is sorted case-blind by SGML character entity name. # # The contents of this table are drawn from various sources, and # are in the public domain. # This test illuminating SGML character entities implementation in your browser. We sort the entities according to unicode numbers. You should see visible character if your display character set support it or some substitution string picked up from src/chrtrans/def7_uni.tbl. If you see &somename; - this name is not implemented yet, you may search for &. (Sorry, ISOgrk4 which hold a dot in its name seems to be nonvisible for most browsers. Keep in mind that this table is much wider than in HTML 4.0 draft). Leonid Pauzner. 0x0021 ! ISOnum # EXCLAMATION MARK 0x0022 " ISOnum # QUOTATION MARK 0x0023 # ISOnum # NUMBER SIGN 0x0024 $ ISOnum # DOLLAR SIGN 0x0025 % ISOnum # PERCENT SIGN 0x0026 & ISOnum # AMPERSAND 0x0028 ( ISOnum # LEFT PARENTHESIS 0x0029 ) ISOnum # RIGHT PARENTHESIS 0x002A * ISOnum # ASTERISK 0x002B + ISOnum # PLUS SIGN 0x002C , ISOnum # COMMA 0x002D ‐ ISOnum # HYPHEN-MINUS 0x002E . ISOnum # FULL STOP 0x002F / ISOnum # SOLIDUS 0x003A : ISOnum # COLON 0x003B ; ISOnum # SEMICOLON 0x003C < ISOnum # LESS-THAN SIGN 0x003D = ISOnum # EQUALS SIGN 0x003E > ISOnum # GREATER-THAN SIGN 0x003F ? ISOnum # QUESTION MARK 0x0040 @ ISOnum # COMMERCIAL AT 0x005B [ ISOnum # LEFT SQUARE BRACKET 0x005C \ ISOnum # REVERSE SOLIDUS 0x005C &sbsol; ISOamso # REVERSE SOLIDUS 0x005D ] ISOnum # RIGHT SQUARE BRACKET 0x005F _ ISOnum # LOW LINE 0x0060 ` ISOdia # GRAVE ACCENT 0x007B { ISOnum # LEFT CURLY BRACKET 0x007C | ISOnum # VERTICAL LINE 0x007D } ISOnum # RIGHT CURLY BRACKET 0x00A0 ISOnum # NO-BREAK SPACE 0x00A1 ¡ ISOnum # INVERTED EXCLAMATION MARK 0x00A2 ¢ ISOnum # CENT SIGN 0x00A3 £ ISOnum # POUND SIGN 0x00A4 ¤ ISOnum # CURRENCY SIGN 0x00A5 ¥ ISOnum # YEN SIGN 0x00A6 ¦ ISOnum # BR