ILIAS
Release_4_4_x_branch Revision 61816
|
Unicode normalization routines for working with UTF-8 strings. More...
Unicode normalization routines for working with UTF-8 strings.
Some of these functions are adapted from places in MediaWiki.
This script generates UniNormalData.inc from the Unicode Character Database and supplementary files.
Currently assumes that input strings are valid UTF-8!
Not as fast as I'd like, but should be usable for most purposes. UtfNormal::toNFC() will bail early if given ASCII text or text it can quickly deterimine is already normalized.
All functions can be called static.
See description of forms at http://www.unicode.org/reports/tr15/
private
Should probably merge them for consistency.
public