ILIAS  Release_4_4_x_branch Revision 61816
 All Data Structures Namespaces Files Functions Variables Groups Pages
UtfNormal Namespace Reference

Unicode normalization routines for working with UTF-8 strings. More...

Detailed Description

Unicode normalization routines for working with UTF-8 strings.

Some of these functions are adapted from places in MediaWiki.

This script generates UniNormalData.inc from the Unicode Character Database and supplementary files.

Currently assumes that input strings are valid UTF-8!

Not as fast as I'd like, but should be usable for most purposes. UtfNormal::toNFC() will bail early if given ASCII text or text it can quickly deterimine is already normalized.

All functions can be called static.

See description of forms at http://www.unicode.org/reports/tr15/

private

Should probably merge them for consistency.

public