ILIAS
release_5-3 Revision v5.3.23-19-g915713cf615
|
Parser that uses PHP 5's DOM extension (part of the core). More...
Public Member Functions | |
__construct () | |
tokenizeHTML ($html, $config, $context) | |
muteErrorHandler ($errno, $errstr) | |
An error handler that mutes all errors. More... | |
callbackUndoCommentSubst ($matches) | |
Callback function for undoing escaping of stray angled brackets in comments. More... | |
callbackArmorCommentEntities ($matches) | |
Callback function that entity-izes ampersands in comments so that callbackUndoCommentSubst doesn't clobber them. More... | |
Public Member Functions inherited from HTMLPurifier_Lexer | |
__construct () | |
parseText ($string, $config) | |
parseAttr ($string, $config) | |
parseData ($string, $is_attr, $config) | |
Parses special entities into the proper characters. More... | |
tokenizeHTML ($string, $config, $context) | |
Lexes an HTML string into tokens. More... | |
normalize ($html, $config, $context) | |
Takes a piece of HTML and normalizes it by converting entities, fixing encoding, extracting bits, and other good stuff. More... | |
extractBody ($html) | |
Takes a string of HTML (fragment or document) and returns the content. More... | |
Protected Member Functions | |
tokenizeDOM ($node, &$tokens, $config) | |
Iterative function that tokenizes a node, putting it into an accumulator. More... | |
createStartNode ($node, &$tokens, $collect, $config) | |
createEndNode ($node, &$tokens) | |
transformAttrToAssoc ($node_map) | |
Converts a DOMNamedNodeMap of DOMAttr objects into an assoc array. More... | |
wrapHTML ($html, $config, $context, $use_div=true) | |
Wraps an HTML fragment in the necessary HTML. More... | |
Private Attributes | |
$factory | |
HTMLPurifier_TokenFactory More... | |
Additional Inherited Members | |
Static Public Member Functions inherited from HTMLPurifier_Lexer | |
static | create ($config) |
Retrieves or sets the default Lexer as a Prototype Factory. More... | |
Data Fields inherited from HTMLPurifier_Lexer | |
$tracksLineNumbers = false | |
Whether or not this lexer implements line-number/column-number tracking. More... | |
Static Protected Member Functions inherited from HTMLPurifier_Lexer | |
static | escapeCDATA ($string) |
Translates CDATA sections into regular sections (through escaping). More... | |
static | escapeCommentedCDATA ($string) |
Special CDATA case that is especially convoluted for <script> More... | |
static | removeIEConditional ($string) |
Special Internet Explorer conditional comments should be removed. More... | |
static | CDATACallback ($matches) |
Callback function for escapeCDATA() that does the work. More... | |
Protected Attributes inherited from HTMLPurifier_Lexer | |
$_special_entity2str | |
Most common entity to raw value conversion table for special entities. More... | |
Parser that uses PHP 5's DOM extension (part of the core).
In PHP 5, the DOM XML extension was revamped into DOM and added to the core. It gives us a forgiving HTML parser, which we use to transform the HTML into a DOM, and then into the tokens. It is blazingly fast (for large documents, it performs twenty times faster than HTMLPurifier_Lexer_DirectLex,and is the default choice for PHP 5.
Definition at line 27 of file DOMLex.php.
HTMLPurifier_Lexer_DOMLex::__construct | ( | ) |
Definition at line 35 of file DOMLex.php.
HTMLPurifier_Lexer_DOMLex::callbackArmorCommentEntities | ( | $matches | ) |
Callback function that entity-izes ampersands in comments so that callbackUndoCommentSubst doesn't clobber them.
array | $matches |
Definition at line 251 of file DOMLex.php.
HTMLPurifier_Lexer_DOMLex::callbackUndoCommentSubst | ( | $matches | ) |
Callback function for undoing escaping of stray angled brackets in comments.
array | $matches |
Definition at line 240 of file DOMLex.php.
References array.
|
protected |
DOMNode | $node | |
HTMLPurifier_Token[] | $tokens |
Definition at line 198 of file DOMLex.php.
Referenced by tokenizeDOM().
|
protected |
DOMNode | $node | DOMNode to be tokenized. |
HTMLPurifier_Token[] | $tokens Array-list of already tokenized tokens. | |
bool | $collect | Says whether or start and close are collected, set to false at first recursion because it's the implicit DIV tag you're dealing with. |
Definition at line 138 of file DOMLex.php.
References $config, $data, array, HTMLPurifier_Lexer\parseText(), and transformAttrToAssoc().
Referenced by tokenizeDOM().
HTMLPurifier_Lexer_DOMLex::muteErrorHandler | ( | $errno, | |
$errstr | |||
) |
An error handler that mutes all errors.
int | $errno | |
string | $errstr |
Definition at line 230 of file DOMLex.php.
|
protected |
Iterative function that tokenizes a node, putting it into an accumulator.
To iterate is human, to recurse divine - L. Peter Deutsch
DOMNode | $node | DOMNode to be tokenized. |
HTMLPurifier_Token[] | $tokens Array-list of already tokenized tokens. |
Definition at line 99 of file DOMLex.php.
References $config, array, createEndNode(), and createStartNode().
Referenced by HTMLPurifier_Lexer_PH5P\tokenizeHTML(), and tokenizeHTML().
HTMLPurifier_Lexer_DOMLex::tokenizeHTML | ( | $html, | |
$config, | |||
$context | |||
) |
string | $html | |
HTMLPurifier_Config | $config | |
HTMLPurifier_Context | $context |
Definition at line 48 of file DOMLex.php.
References $comment, $config, $html, $old, array, HTMLPurifier_Lexer\normalize(), tokenizeDOM(), and wrapHTML().
|
protected |
Converts a DOMNamedNodeMap of DOMAttr objects into an assoc array.
DOMNamedNodeMap | $node_map | DOMNamedNodeMap of DOMAttr objects. |
Definition at line 210 of file DOMLex.php.
References array.
Referenced by createStartNode().
|
protected |
Wraps an HTML fragment in the necessary HTML.
string | $html | |
HTMLPurifier_Config | $config | |
HTMLPurifier_Context | $context |
Definition at line 263 of file DOMLex.php.
References $config, $def, $html, and $ret.
Referenced by HTMLPurifier_Lexer_PH5P\tokenizeHTML(), and tokenizeHTML().
|
private |
Definition at line 33 of file DOMLex.php.