Parsoid
A bidirectional parser between wikitext and HTML5
Loading...
Searching...
No Matches
Wikimedia\Parsoid\Utils\DOMCompat Class Reference

Helper class that provides missing DOM level 3 methods for the PHP DOM classes. More...

Static Public Member Functions

static newDocument (bool $isHtml)
 Create a new empty document.
 
static nodeName (Node $node)
 Return the lower-case version of the node name.
 
static getTitle ( $document)
 Get document title.
 
static setTitle ( $document, string $title)
 Set document title.
 
static getElementById ( $node, string $id)
 Return the descendant with the specified ID.
 
static setIdAttribute ( $element, string $id)
 Workaround bug in PHP's Document::getElementById() which doesn't actually index the 'id' attribute unless you use the non-standard Element::setIdAttribute method after the attribute is set; see https://www.php.net/manual/en/domdocument.getelementbyid.php for more details.
 
static getElementsByTagName ( $node, string $tagName)
 Return all descendants with the specified tag name.
 
static getFirstElementChild ( $node)
 Return the first child of the node that is an Element, or null otherwise.
 
static getLastElementChild ( $node)
 Return the last child of the node that is an Element, or null otherwise.
 
static querySelector ( $node, string $selector)
 
static querySelectorAll ( $node, string $selector)
 
static getPreviousElementSibling ( $node)
 Return the last preceding sibling of the node that is an element, or null otherwise.
 
static getNextElementSibling ( $node)
 Return the first following sibling of the node that is an element, or null otherwise.
 
static append ( $parentNode,... $nodes)
 Append the node to the parent node.
 
static remove ( $node)
 Removes the node from the document.
 
static getInnerHTML ( $element)
 Get innerHTML.
 
static setInnerHTML ( $element, string $html)
 Set innerHTML.
 
static getOuterHTML ( $element)
 Get outerHTML.
 
static getAttribute ( $element, string $attributeName)
 Return the value of an element attribute.
 
static getClassList ( $node)
 Return the class list of this element.
 
static normalize ( $elt)
 
static replaceChildren ( $parentNode,... $nodes)
 ParentNode.replaceChildren() https://developer.mozilla.org/en-US/docs/Web/API/ParentNode/replaceChildren.
 
static getTemplateElementContent ( $node)
 Return HTMLTemplateElement::content.
 

Detailed Description

Helper class that provides missing DOM level 3 methods for the PHP DOM classes.

For a DOM method $node->foo( $bar) the equivalent helper is DOMCompat::foo( $node, $bar ). For a DOM property $node->foo there is a DOMCompat::getFoo( $node ) and DOMCompat::setFoo( $node, $value ).

Only implements the methods that are actually used by Parsoid.

Because this class may be used by code outside Parsoid, it tries to be relatively tolerant of object types: you can call it either with PHP's DOM* types or with a "proper" DOM implementation, and it will attempt to Do The Right Thing regardless. As a result, there are generally not parameter type hints for DOM object types, and the return types will be broad enough to accomodate the value a "real" DOM implementation would return, as well as the values our thunk will return. (For instance, we can't create a "real" NodeList in our compatibility thunk.)

Exception to the above: ::nodeName method is not so much a DOM compatibility method in the sense above, but a proxy to let us support multiple DOM libraries against the Parsoid codebase that expects lower-case names. In this specific instance the default behavior is tailored for performance vs. being HTML-standards-compliant.

Member Function Documentation

◆ append()

static Wikimedia\Parsoid\Utils\DOMCompat::append ( $parentNode,
$nodes )
static

Append the node to the parent node.

Parameters
Document | DocumentFragment | Element$parentNode
Node|string...$nodes
Note
This method was added in PHP 8.0.0

◆ getAttribute()

static Wikimedia\Parsoid\Utils\DOMCompat::getAttribute ( $element,
string $attributeName )
static

Return the value of an element attribute.

Unlike PHP's version, this is spec-compliant and returns null if the attribute is not present, allowing the caller to distinguish between "the attribute exists but has the empty string as its value" and "the attribute does not exist".

Parameters
Element$element
string$attributeName
Returns
?string The attribute value, or null if the attribute does not exist on the element.
See also
https://dom.spec.whatwg.org/#dom-element-getattribute

◆ getClassList()

static Wikimedia\Parsoid\Utils\DOMCompat::getClassList ( $node)
static

Return the class list of this element.

Parameters
Element$node
Returns
TokenList
See also
https://dom.spec.whatwg.org/#dom-element-classlist

◆ getElementById()

static Wikimedia\Parsoid\Utils\DOMCompat::getElementById ( $node,
string $id )
static

Return the descendant with the specified ID.

Workaround for https://bugs.php.net/bug.php?id=77686 and other issues related to inconsistent indexing behavior. XXX: 77686 is fixed in php 8.1.21

Parameters
Document | DocumentFragment$node
string$id
Returns
Element|null
See also
https://dom.spec.whatwg.org/#dom-nonelementparentnode-getelementbyid

◆ getElementsByTagName()

static Wikimedia\Parsoid\Utils\DOMCompat::getElementsByTagName ( $node,
string $tagName )
static

Return all descendants with the specified tag name.

Workaround for PHP's getElementsByTagName being inexplicably slow in some situations and the lack of Element::getElementsByTagName().

Parameters
Document | Element$node
string$tagName
Returns
(iterable<Element>&\Countable)|array<Element> Either an array or an HTMLCollection object
See also
https://dom.spec.whatwg.org/#dom-document-getelementsbytagname
https://dom.spec.whatwg.org/#dom-element-getelementsbytagname
Note
Note that unlike the spec this method is not guaranteed to return a NodeList (which cannot be freely constructed in PHP), just a traversable containing Elements.

◆ getFirstElementChild()

static Wikimedia\Parsoid\Utils\DOMCompat::getFirstElementChild ( $node)
static

Return the first child of the node that is an Element, or null otherwise.

Parameters
Document | DocumentFragment | Element$node
Returns
Element|null
See also
https://dom.spec.whatwg.org/#dom-parentnode-firstelementchild
Note
This property was added to PHP in 8.0.0, and won't be needed once our minimum required version >= 8.0.0

◆ getInnerHTML()

static Wikimedia\Parsoid\Utils\DOMCompat::getInnerHTML ( $element)
static

Get innerHTML.

See also
DOMUtils::getFragmentInnerHTML() for the fragment version
Parameters
Element$element
Returns
string
See also
https://w3c.github.io/DOM-Parsing/#dom-innerhtml-innerhtml

◆ getLastElementChild()

static Wikimedia\Parsoid\Utils\DOMCompat::getLastElementChild ( $node)
static

Return the last child of the node that is an Element, or null otherwise.

Parameters
Document | DocumentFragment | Element$node
Returns
Element|null
See also
https://dom.spec.whatwg.org/#dom-parentnode-lastelementchild
Note
This property was added to PHP in 8.0.0, and won't be needed once our minimum required version >= 8.0.0

◆ getNextElementSibling()

static Wikimedia\Parsoid\Utils\DOMCompat::getNextElementSibling ( $node)
static

Return the first following sibling of the node that is an element, or null otherwise.

Parameters
Node$node
Returns
Element|null
See also
https://dom.spec.whatwg.org/#dom-nondocumenttypechildnode-nextelementsibling

◆ getOuterHTML()

static Wikimedia\Parsoid\Utils\DOMCompat::getOuterHTML ( $element)
static

Get outerHTML.

Parameters
Element$element
Returns
string
See also
https://w3c.github.io/DOM-Parsing/#dom-element-outerhtml

◆ getPreviousElementSibling()

static Wikimedia\Parsoid\Utils\DOMCompat::getPreviousElementSibling ( $node)
static

Return the last preceding sibling of the node that is an element, or null otherwise.

Parameters
Node$node
Returns
Element|null
See also
https://dom.spec.whatwg.org/#dom-nondocumenttypechildnode-previouselementsibling

◆ getTemplateElementContent()

static Wikimedia\Parsoid\Utils\DOMCompat::getTemplateElementContent ( $node)
static

Return HTMLTemplateElement::content.

In the PHP DOM, <template> elements do not have a dedicated DocumentFragment and children are stored directly under the Element. In the HTML5 spec, the contents are stored in a DocumentFragment with a unique owner document.

Bridge this gap by returning the <template> element for PHP's DOM, or the DocumentFragment for an HTML5-compliant DOM.

Parameters
Element$nodeA <template> element
Returns
Element|DocumentFragment Either the element (for PHP compat) or the DocumentFragment which is the template's "content"

◆ getTitle()

static Wikimedia\Parsoid\Utils\DOMCompat::getTitle ( $document)
static

Get document title.

Parameters
Document$document
Returns
string
See also
https://html.spec.whatwg.org/multipage/dom.html#document.title

◆ newDocument()

static Wikimedia\Parsoid\Utils\DOMCompat::newDocument ( bool $isHtml)
static

Create a new empty document.

This is abstracted because the process is a little different depending on whether we're using Dodo or DOMDocument, and phan gets a little confused by this.

Parameters
bool$isHtml
Returns
Document

◆ nodeName()

static Wikimedia\Parsoid\Utils\DOMCompat::nodeName ( Node $node)
static

Return the lower-case version of the node name.

FIXME: HTML says this should be capitalized, but we are tailoring this to the DOM libraries that Parsoid uses that return lower-case names.

◆ normalize()

static Wikimedia\Parsoid\Utils\DOMCompat::normalize ( $elt)
static
Parameters
Element | DocumentFragment$eltroot of the DOM tree that needs to be normalized

◆ querySelector()

static Wikimedia\Parsoid\Utils\DOMCompat::querySelector ( $node,
string $selector )
static
Parameters
Document | DocumentFragment | Element$node
string$selector
Returns
Element|null
See also
https://dom.spec.whatwg.org/#dom-parentnode-queryselector

◆ querySelectorAll()

static Wikimedia\Parsoid\Utils\DOMCompat::querySelectorAll ( $node,
string $selector )
static
Parameters
Document | DocumentFragment | Element$node
string$selector
Returns
(iterable<Element>&\Countable)|array<Element> Either a NodeList or an array
See also
https://dom.spec.whatwg.org/#dom-parentnode-queryselectorall
Note
Note that unlike the spec this method is not guaranteed to return a NodeList (which cannot be freely constructed in PHP), just a traversable containing Elements.

◆ remove()

static Wikimedia\Parsoid\Utils\DOMCompat::remove ( $node)
static

Removes the node from the document.

Parameters
Element | CharacterData$node
See also
https://dom.spec.whatwg.org/#dom-childnode-remove

◆ replaceChildren()

static Wikimedia\Parsoid\Utils\DOMCompat::replaceChildren ( $parentNode,
$nodes )
static

ParentNode.replaceChildren() https://developer.mozilla.org/en-US/docs/Web/API/ParentNode/replaceChildren.

Parameters
Document | DocumentFragment | Element$parentNode
string|Node...$nodes

◆ setIdAttribute()

static Wikimedia\Parsoid\Utils\DOMCompat::setIdAttribute ( $element,
string $id )
static

Workaround bug in PHP's Document::getElementById() which doesn't actually index the 'id' attribute unless you use the non-standard Element::setIdAttribute method after the attribute is set; see https://www.php.net/manual/en/domdocument.getelementbyid.php for more details.

Parameters
Element$element
string$idThe desired value for the id attribute on $element.
See also
https://phabricator.wikimedia.org/T232390

◆ setInnerHTML()

static Wikimedia\Parsoid\Utils\DOMCompat::setInnerHTML ( $element,
string $html )
static

Set innerHTML.

See also
https://w3c.github.io/DOM-Parsing/#dom-innerhtml-innerhtml
DOMUtils::setFragmentInnerHTML() for the fragment version
Parameters
Element$element
string$html

◆ setTitle()

static Wikimedia\Parsoid\Utils\DOMCompat::setTitle ( $document,
string $title )
static

Set document title.

Parameters
Document$document
string$title
See also
https://html.spec.whatwg.org/multipage/dom.html#document.title

The documentation for this class was generated from the following file: