Unicode

Short introduction
Unicode is an industry standard allowing computers to consistently represent and manipulate text expressed in most of the world's writing systems. In text processing, Unicode takes the role of providing a unique code point — a number, not a glyph — for each character. In other words, Unicode represents a character in an abstract way and leaves the visual rendering (size, shape, font or style) to other software, such as a web browser or word processor.

Info material

 * Slides of a talk that Dr. Romary gave on encoding of languages, locales, and characters: [[Media:Talk_laurent_1.2-Malta-BasicStandards_V2_UNICODE.pdf|PDF]]
 * Current view on UTF-8 within PubMan: [[Media:UTF_8_pubman.pdf|PDF]]
 * Slides of Jost Gippert's talk on Unicode during the eScience Seminar at 25 October 2007: [[Media:ESci_Standards_Unicode_Gippert.pdf|PDF]]

Tools

 * Tools for Unicode on Titus (Univ. Frankfurt)

External resources

 * The unicode standard
 * UTF-8 and Unicode Standards
 * Information about Unicode in Wikipedia
 * Problems with Unicode (in Japan)