The Unicode Standard 5.0

Front Cover
Addison-Wesley, 2007 - Computers - 1417 pages

"Hard copy versions of the Unicode Standard have been among the most crucial and most heavily used reference books in my personal library for years."

--Donald E. Knuth, The Art of Computer Programming

"For more than a decade, Unicode has been a foundation for many Microsoft products and technologies; Unicode Standard Version 5.0 will help us deliver important new benefits to users."

--Bill Gates, chairman, Microsoft Corporation

"The path W3C follows to making text on the Web truly global is Unicode."

--Sir Tim Berners-Lee, kbe, Web inventor and director of the World Wide Consortium (W3C)

"Without Unicode, Java wouldn't be Java, and the Internet would have a harder time connecting the people of the world."

--James Gosling, Inventor of Java, Sun Microsystems, Inc.

These and other software luminaries recognize that Unicode has become an indispensable tool for supporting an increasingly global marketplace (see inside for more acclaim). A comprehensive system of standards for representing alphabets throughout the world, Unicode is the basis for modern programming-- Windows, XML, Python, PERL, Mac OS, Linux--and every major search engine and browser in operation today.

New to Unicode Version 5.0
  • A stable foundation for Unicode Security Mechanisms
  • Property data for the Unicode Collation Algorithm and Common Locale Data Repository
  • Improvements to the Unicode Encoding Model for UTF-8
  • Rigorous stability of case folding and identifiers for improved interoperability and backward compatibility--enabling additional new ways to optimize code
  • A systematic framework for improved text processing for greater reliability--covering combining characters, Unicode strings, line breaking, and segmentation

This new edition of Unicode's official reference manual has been substantially updated to document the latest revisions to the Unicode Standard, with hundreds of pages of new information. It includes major revisions to text, figures, tables, definitions, and conformance clauses, and provides clear and practical answers to common questions. For the first time, the book contains the Unicode Standard Annexes, which specify vital processes such as text normalization and identifier parsing.

These improvements are so important that Version 5.0 is the basis for Microsoft's Vista generation of operating systems, and is included in upgrade plans for Google, Yahoo!, and ICU, to name but a few.

This is the one book all developers using Unicode must have.

From inside the book

Contents

List of Figures
xxxi
Figures
xxxiii
Conformance
xxxiv
Copyright

85 other sections not shown

Common terms and phrases

About the author (2007)

The Unicode Consortium is a non-profit organization founded to develop, extend and promote use of the Unicode Standard, which specifies the representation of text in modern software products and standards. The membership of the consortium represents a broad spectrum of corporations and organizations in the computer and information processing industry. The Unicode Consortium actively cooperates with many of the leading standards development organizations, such as ISO/IEC JTC1, W3C, IETF, and ECMA. The Consortium is based in Mountain View, CA, with members located around the world.

Bibliographic information