ILIAS  trunk Revision v11.0_alpha-1769-g99a433fe2dc
All Data Structures Namespaces Files Functions Variables Enumerations Enumerator Modules Pages
Sanitizer Class Reference
+ Collaboration diagram for Sanitizer:

Static Public Member Functions

static decodeCharReferences ($text)
 Decode any character references, numeric or named entities, in the text and return a UTF-8 string. More...
 
static decodeCharReferencesCallback ($matches)
 
static decodeChar ($codepoint)
 Return UTF-8 string for a codepoint if that is a valid character reference, otherwise U+FFFD REPLACEMENT CHARACTER. More...
 
static decodeEntity ($name)
 If the named entity is defined in the HTML 4.0/XHTML 1.0 DTD, return the UTF-8 encoding of that character. More...
 

Static Private Member Functions

static validateCodepoint ($codepoint)
 Returns true if a given Unicode codepoint is a valid character in XML. More...
 

Detailed Description

Definition at line 347 of file Sanitizer.php.

Member Function Documentation

◆ decodeChar()

static Sanitizer::decodeChar (   $codepoint)
static

Return UTF-8 string for a codepoint if that is a valid character reference, otherwise U+FFFD REPLACEMENT CHARACTER.

Parameters
int$codepoint
Returns
string

Definition at line 408 of file Sanitizer.php.

413  {
414  if (Sanitizer::validateCodepoint($codepoint)) {
415  return "";
416  //return codepointToUtf8($codepoint);
static validateCodepoint($codepoint)
Returns true if a given Unicode codepoint is a valid character in XML.
Definition: Sanitizer.php:354

◆ decodeCharReferences()

static Sanitizer::decodeCharReferences (   $text)
static

Decode any character references, numeric or named entities, in the text and return a UTF-8 string.

Parameters
string$text
Returns
string

Definition at line 373 of file Sanitizer.php.

Referenced by Title\newFromText().

378  {
379  return preg_replace_callback(
const MW_CHAR_REFS_REGEX
This file is part of ILIAS, a powerful learning management system published by ILIAS open source e-Le...
Definition: Sanitizer.php:22
+ Here is the caller graph for this function:

◆ decodeCharReferencesCallback()

static Sanitizer::decodeCharReferencesCallback (   $matches)
static
Parameters
string$matches
Returns
string

Definition at line 386 of file Sanitizer.php.

391  {
392  if ($matches[1] != '') {
393  return Sanitizer::decodeEntity($matches[1]);
394  } elseif ($matches[2] != '') {
395  return Sanitizer::decodeChar(intval($matches[2]));
396  } elseif ($matches[3] != '') {
397  return Sanitizer::decodeChar(hexdec($matches[3]));
398  } elseif ($matches[4] != '') {
399  return Sanitizer::decodeChar(hexdec($matches[4]));
static decodeChar($codepoint)
Return UTF-8 string for a codepoint if that is a valid character reference, otherwise U+FFFD REPLACEM...
Definition: Sanitizer.php:408
static decodeEntity($name)
If the named entity is defined in the HTML 4.0/XHTML 1.0 DTD, return the UTF-8 encoding of that chara...
Definition: Sanitizer.php:426

◆ decodeEntity()

static Sanitizer::decodeEntity (   $name)
static

If the named entity is defined in the HTML 4.0/XHTML 1.0 DTD, return the UTF-8 encoding of that character.

Otherwise, returns pseudo-entity source (eg )

Parameters
string$name
Returns
string

Definition at line 426 of file Sanitizer.php.

431  {
433 
434  if (isset($wgHtmlEntityAliases[$name])) {
435  $name = $wgHtmlEntityAliases[$name];
436  }
437  if (isset($wgHtmlEntities[$name])) {
438  return "";
439  //return codepointToUtf8($wgHtmlEntities[$name]);
global $wgHtmlEntities
List of all named character entities defined in HTML 4.01 http://www.w3.org/TR/html4/sgml/entities.html.
Definition: Sanitizer.php:55
global $wgHtmlEntityAliases
Character entity aliases accepted by MediaWiki.
Definition: Sanitizer.php:313

◆ validateCodepoint()

static Sanitizer::validateCodepoint (   $codepoint)
staticprivate

Returns true if a given Unicode codepoint is a valid character in XML.

Parameters
int$codepoint
Returns
bool

Definition at line 354 of file Sanitizer.php.

359  {
360  return ($codepoint == 0x09)
361  || ($codepoint == 0x0a)
362  || ($codepoint == 0x0d)

The documentation for this class was generated from the following file: