ILIAS
Release_4_2_x_branch Revision 61807
|
Experimental HTML5-based parser using Jeroen van der Meer's PH5P library. More...
Public Member Functions | |
tokenizeHTML ($html, $config, $context) | |
Lexes an HTML string into tokens. | |
Public Member Functions inherited from HTMLPurifier_Lexer_DOMLex | |
__construct () | |
muteErrorHandler ($errno, $errstr) | |
An error handler that mutes all errors. | |
callbackUndoCommentSubst ($matches) | |
Callback function for undoing escaping of stray angled brackets in comments. | |
callbackArmorCommentEntities ($matches) | |
Callback function that entity-izes ampersands in comments so that callbackUndoCommentSubst doesn't clobber them. | |
Public Member Functions inherited from HTMLPurifier_Lexer | |
parseData ($string) | |
Parses special entities into the proper characters. | |
normalize ($html, $config, $context) | |
Takes a piece of HTML and normalizes it by converting entities, fixing encoding, extracting bits, and other good stuff. | |
extractBody ($html) | |
Takes a string of HTML (fragment or document) and returns the content. |
Additional Inherited Members | |
Static Public Member Functions inherited from HTMLPurifier_Lexer | |
static | create ($config) |
Retrieves or sets the default Lexer as a Prototype Factory. | |
Data Fields inherited from HTMLPurifier_Lexer | |
$tracksLineNumbers = false | |
Whether or not this lexer implements line-number/column-number tracking. | |
Protected Member Functions inherited from HTMLPurifier_Lexer_DOMLex | |
tokenizeDOM ($node, &$tokens) | |
Iterative function that tokenizes a node, putting it into an accumulator. | |
createStartNode ($node, &$tokens, $collect) | |
createEndNode ($node, &$tokens) | |
transformAttrToAssoc ($node_map) | |
Converts a DOMNamedNodeMap of DOMAttr objects into an assoc array. | |
wrapHTML ($html, $config, $context) | |
Wraps an HTML fragment in the necessary HTML. | |
Static Protected Member Functions inherited from HTMLPurifier_Lexer | |
static | escapeCDATA ($string) |
Translates CDATA sections into regular sections (through escaping). | |
static | escapeCommentedCDATA ($string) |
Special CDATA case that is especially convoluted for <script> | |
static | removeIEConditional ($string) |
Special Internet Explorer conditional comments should be removed. | |
static | CDATACallback ($matches) |
Callback function for escapeCDATA() that does the work. | |
Protected Attributes inherited from HTMLPurifier_Lexer | |
$_special_entity2str | |
Most common entity to raw value conversion table for special entities. |
Experimental HTML5-based parser using Jeroen van der Meer's PH5P library.
Occupies space in the HTML5 pseudo-namespace, which may cause conflicts.
HTMLPurifier_Lexer_PH5P::tokenizeHTML | ( | $string, | |
$config, | |||
$context | |||
) |
Lexes an HTML string into tokens.
$string | String HTML. |
Reimplemented from HTMLPurifier_Lexer_DOMLex.
Definition at line 15 of file PH5P.php.
References $config, HTMLPurifier_Lexer\normalize(), HTMLPurifier_Lexer_DOMLex\tokenizeDOM(), and HTMLPurifier_Lexer_DOMLex\wrapHTML().