Parsoid
A bidirectional parser between wikitext and HTML5
Loading...
Searching...
No Matches
Wikimedia\Parsoid\Core Namespace Reference

General token sanitizer. More...

Classes

class  ClientError
 Exception thrown on invalid client requests. More...
 
interface  ContentMetadataCollector
 Interface for collecting the results of a parse. More...
 
class  ContentMetadataCollectorStringSets
 Registry of flags used with ContentMetadataCollector::appendOutputStrings() More...
 
class  ContentModelHandler
 
class  DomPageBundle
 A page bundle stores an HTML DOM with separated data-parsoid and data-mw content. More...
 
class  DomSourceRange
 Represents a DOM source range. More...
 
class  ElementRange
 A simple pair of DOM elements. More...
 
class  InternalException
 Parsoid internal error that we don't know how to recover from. More...
 
interface  LinkTarget
 Represents the target of a wiki link. More...
 
class  MediaStructure
 All media should have a fixed structure: More...
 
class  PageBundle
 A page bundle stores an HTML string with separated data-parsoid and (optionally) data-mw content. More...
 
class  ResourceLimitExceededException
 Parsoid resource limit exception. More...
 
class  Sanitizer
 
class  SectionMetadata
 Section metadata for generating TOC. More...
 
class  SelectiveUpdateData
 Data that's necessary for selective updates (whether html->wt or wt->html). More...
 
class  SelserData
 TODO: Kept around for backwards compatibilty with uses outside this repo. More...
 
class  TOCData
 Table of Contents data, including an array of section metadata. More...
 

Detailed Description

General token sanitizer.

Strips out (or encapsulates) unsafe and disallowed tag types and attributes. Should run last in the third, synchronous expansion stage.

FIXME: This code was originally ported from PHP to JS in 2012 and periodically updated before being back to PHP. This code should be (a) resynced with core sanitizer changes (b) updated to use HTML5 spec