ILIAS  trunk Revision v11.0_alpha-3011-gc6b235a2e85
Sanitizer Class Reference
+ Collaboration diagram for Sanitizer:

Static Public Member Functions

static decodeCharReferences ($text)
 Decode any character references, numeric or named entities, in the text and return a UTF-8 string. More...
 
static decodeCharReferencesCallback ($matches)
 
static decodeChar ($codepoint)
 Return UTF-8 string for a codepoint if that is a valid character reference, otherwise U+FFFD REPLACEMENT CHARACTER. More...
 
static decodeEntity ($name)
 If the named entity is defined in the HTML 4.0/XHTML 1.0 DTD, return the UTF-8 encoding of that character. More...
 

Static Private Member Functions

static validateCodepoint ($codepoint)
 Returns true if a given Unicode codepoint is a valid character in XML. More...
 

Detailed Description

Definition at line 347 of file Sanitizer.php.

Member Function Documentation

◆ decodeChar()

static Sanitizer::decodeChar (   $codepoint)
static

Return UTF-8 string for a codepoint if that is a valid character reference, otherwise U+FFFD REPLACEMENT CHARACTER.

Parameters
int$codepoint
Returns
string

Definition at line 408 of file Sanitizer.php.

413 {
414 if (Sanitizer::validateCodepoint($codepoint)) {
415 return "";
416 //return codepointToUtf8($codepoint);
static validateCodepoint($codepoint)
Returns true if a given Unicode codepoint is a valid character in XML.
Definition: Sanitizer.php:354

◆ decodeCharReferences()

static Sanitizer::decodeCharReferences (   $text)
static

Decode any character references, numeric or named entities, in the text and return a UTF-8 string.

Parameters
string$text
Returns
string

Definition at line 373 of file Sanitizer.php.

378 {
379 return preg_replace_callback(
const MW_CHAR_REFS_REGEX
This file is part of ILIAS, a powerful learning management system published by ILIAS open source e-Le...
Definition: Sanitizer.php:22

Referenced by Title\newFromText().

+ Here is the caller graph for this function:

◆ decodeCharReferencesCallback()

static Sanitizer::decodeCharReferencesCallback (   $matches)
static
Parameters
string$matches
Returns
string

Definition at line 386 of file Sanitizer.php.

391 {
392 if ($matches[1] != '') {
393 return Sanitizer::decodeEntity($matches[1]);
394 } elseif ($matches[2] != '') {
395 return Sanitizer::decodeChar(intval($matches[2]));
396 } elseif ($matches[3] != '') {
397 return Sanitizer::decodeChar(hexdec($matches[3]));
398 } elseif ($matches[4] != '') {
399 return Sanitizer::decodeChar(hexdec($matches[4]));
static decodeChar($codepoint)
Return UTF-8 string for a codepoint if that is a valid character reference, otherwise U+FFFD REPLACEM...
Definition: Sanitizer.php:408
static decodeEntity($name)
If the named entity is defined in the HTML 4.0/XHTML 1.0 DTD, return the UTF-8 encoding of that chara...
Definition: Sanitizer.php:426

◆ decodeEntity()

static Sanitizer::decodeEntity (   $name)
static

If the named entity is defined in the HTML 4.0/XHTML 1.0 DTD, return the UTF-8 encoding of that character.

Otherwise, returns pseudo-entity source (eg &foo;)

Parameters
string$name
Returns
string

Definition at line 426 of file Sanitizer.php.

431 {
433
434 if (isset($wgHtmlEntityAliases[$name])) {
435 $name = $wgHtmlEntityAliases[$name];
436 }
437 if (isset($wgHtmlEntities[$name])) {
438 return "";
439 //return codepointToUtf8($wgHtmlEntities[$name]);
global $wgHtmlEntities
List of all named character entities defined in HTML 4.01 http://www.w3.org/TR/html4/sgml/entities....
Definition: Sanitizer.php:55
global $wgHtmlEntityAliases
Character entity aliases accepted by MediaWiki.
Definition: Sanitizer.php:313

◆ validateCodepoint()

static Sanitizer::validateCodepoint (   $codepoint)
staticprivate

Returns true if a given Unicode codepoint is a valid character in XML.

Parameters
int$codepoint
Returns
bool

Definition at line 354 of file Sanitizer.php.

359 {
360 return ($codepoint == 0x09)
361 || ($codepoint == 0x0a)
362 || ($codepoint == 0x0d)

The documentation for this class was generated from the following file: