Unicode is an industry standard allowing computers to consistently represent and manipulate text expressed in most of the world's writing systems. In text processing, Unicode takes the role of providing a unique code point — a number, not a glyph — for each character. In other words, Unicode represents a character in an abstract way and leaves the visual rendering (size, shape, font or style) to other software, such as a web browser or word processor.
- Slides of a talk that Dr. Romary gave on encoding of languages, locales, and characters: PDF
- Current view on UTF-8 within PubMan: PDF
- Slides of Jost Gippert's talk on Unicode during the eScience Seminar at 25 October 2007: PDF