MediaWiki master
WikiExporter Class Reference
Collaboration diagram for WikiExporter:

Public Member Functions

 __construct ( $db, CommentStore $commentStore, HookContainer $hookContainer, RevisionStore $revisionStore, TitleParser $titleParser, $history=self::CURRENT, $text=self::TEXT, $limitNamespaces=null)
 
 allLogs ()
 
 allPages ()
 Dumps a series of page and revision records for all pages in the database, either including complete history or only the most recent version.
 
 closeStream ()
 
 logsByRange ( $start, $end)
 
 openStream ()
 
 pageByName ( $name)
 
 pageByTitle (PageIdentity $page)
 
 pagesByName ( $names)
 
 pagesByRange ( $start, $end, $orderRevs)
 Dumps a series of page and revision records for those pages in the database falling within the page_id range given.
 
 revsByRange ( $start, $end)
 Dumps a series of page and revision records for those pages in the database with revisions falling within the rev_id range given.
 
 setOutputSink (&$sink)
 Set the DumpOutput or DumpFilter object which will receive various row objects and XML output for filtering.
 
 setSchemaVersion ( $schemaVersion)
 

Static Public Member Functions

static schemaVersion ()
 Returns the default export schema version, as defined by the XmlDumpSchemaVersion setting.
 

Public Attributes

string $author_list = ""
 
bool $dumpUploadFileContents = false
 
bool $dumpUploads = false
 
bool $list_authors = false
 Return distinct author list (when not returning full history)
 
DumpOutput $sink
 
int $text
 
const CURRENT = 2
 
const FULL = 1
 
const LOGS = 8
 
const RANGE = 16
 
const STABLE = 4
 
const STUB = XmlDumpWriter::WRITE_STUB
 
const TEXT = XmlDumpWriter::WRITE_CONTENT
 

Protected Member Functions

 do_list_authors ( $cond)
 Generates the distinct list of authors of an article Not called by default (depends on $this->list_authors) Can be set by Special:Export when not exporting whole history.
 
 dumpFrom ( $cond='', $orderRevs=false)
 
 dumpLogs ( $cond)
 
 dumpPages ( $cond, $orderRevs)
 
 finishPageStreamOutput ( $lastRow)
 Final page stream output, after all batches are complete.
 
 getSlotRowBatch ( $results, &$carry=null)
 Returns all slot rows for a revision.
 
 outputLogStream ( $resultset)
 
 outputPageStreamBatch ( $results, $lastRow)
 Runs through a query result set dumping page, revision, and slot records.
 

Protected Attributes

IReadableDatabase $db
 
array int $history
 
array null $limitNamespaces
 
const BATCH_SIZE = 10000
 

Detailed Description

Definition at line 48 of file WikiExporter.php.

Constructor & Destructor Documentation

◆ __construct()

WikiExporter::__construct ( $db,
CommentStore $commentStore,
HookContainer $hookContainer,
RevisionStore $revisionStore,
TitleParser $titleParser,
$history = self::CURRENT,
$text = self::TEXT,
$limitNamespaces = null )
Parameters
IReadableDatabase$db
CommentStore$commentStore
HookContainer$hookContainer
RevisionStore$revisionStore
TitleParser$titleParser
int | array$historyOne of WikiExporter::FULL, WikiExporter::CURRENT, WikiExporter::RANGE or WikiExporter::STABLE, or an associative array:
  • offset: non-inclusive offset at which to start the query
  • limit: maximum number of rows to return
  • dir: "asc" or "desc" timestamp order
int$textOne of WikiExporter::TEXT or WikiExporter::STUB
null | array$limitNamespacesList of namespace numbers to limit results

Definition at line 125 of file WikiExporter.php.

References $limitNamespaces.

Member Function Documentation

◆ allLogs()

WikiExporter::allLogs ( )

Definition at line 261 of file WikiExporter.php.

References dumpFrom().

◆ allPages()

WikiExporter::allPages ( )

Dumps a series of page and revision records for all pages in the database, either including complete history or only the most recent version.

Definition at line 187 of file WikiExporter.php.

References dumpFrom().

◆ closeStream()

WikiExporter::closeStream ( )

Definition at line 177 of file WikiExporter.php.

◆ do_list_authors()

WikiExporter::do_list_authors ( $cond)
protected

Generates the distinct list of authors of an article Not called by default (depends on $this->list_authors) Can be set by Special:Export when not exporting whole history.

Parameters
string$cond

Definition at line 284 of file WikiExporter.php.

Referenced by dumpPages().

◆ dumpFrom()

WikiExporter::dumpFrom ( $cond = '',
$orderRevs = false )
protected
Parameters
string$cond
bool$orderRevs

Definition at line 312 of file WikiExporter.php.

References dumpLogs(), and dumpPages().

Referenced by allLogs(), allPages(), logsByRange(), pageByName(), pageByTitle(), pagesByRange(), and revsByRange().

◆ dumpLogs()

WikiExporter::dumpLogs ( $cond)
protected
Parameters
string$cond

Definition at line 323 of file WikiExporter.php.

References outputLogStream().

Referenced by dumpFrom().

◆ dumpPages()

WikiExporter::dumpPages ( $cond,
$orderRevs )
protected
Parameters
string$cond
bool$orderRevs

Definition at line 368 of file WikiExporter.php.

References do_list_authors(), finishPageStreamOutput(), and outputPageStreamBatch().

Referenced by dumpFrom().

◆ finishPageStreamOutput()

WikiExporter::finishPageStreamOutput ( $lastRow)
protected

Final page stream output, after all batches are complete.

Parameters
stdClass$lastRowthe last row output from the last batch (or null if none)

Definition at line 596 of file WikiExporter.php.

Referenced by dumpPages().

◆ getSlotRowBatch()

WikiExporter::getSlotRowBatch ( $results,
& $carry = null )
protected

Returns all slot rows for a revision.

Takes and returns a carry row from the last batch;

Parameters
IResultWrapper | array$results
null | stdClass&$carryA row carried over from the last call to getSlotRowBatch()
Returns
stdClass[]

Definition at line 567 of file WikiExporter.php.

Referenced by outputPageStreamBatch().

◆ logsByRange()

WikiExporter::logsByRange ( $start,
$end )
Parameters
int$start
int$end

Definition at line 269 of file WikiExporter.php.

References dumpFrom().

◆ openStream()

WikiExporter::openStream ( )

Definition at line 172 of file WikiExporter.php.

◆ outputLogStream()

WikiExporter::outputLogStream ( $resultset)
protected
Parameters
IResultWrapper$resultset
Returns
int|null the log_id value of the last item output, or null if none

Definition at line 610 of file WikiExporter.php.

Referenced by dumpLogs().

◆ outputPageStreamBatch()

WikiExporter::outputPageStreamBatch ( $results,
$lastRow )
protected

Runs through a query result set dumping page, revision, and slot records.

The result set should join the page, revision, slots, and content tables, and be sorted/grouped by page and revision to avoid duplicate page records in the output.

Parameters
IResultWrapper$results
stdClass | null$lastRowthe last row output from the previous call (or null if none)
Returns
stdClass the last row processed

Definition at line 507 of file WikiExporter.php.

References getSlotRowBatch().

Referenced by dumpPages().

◆ pageByName()

WikiExporter::pageByName ( $name)
Parameters
string$name

Definition at line 241 of file WikiExporter.php.

References dumpFrom().

Referenced by pagesByName().

◆ pageByTitle()

WikiExporter::pageByTitle ( PageIdentity $page)
Parameters
PageIdentity$page

Definition at line 232 of file WikiExporter.php.

References dumpFrom(), MediaWiki\Page\PageReference\getDBkey(), and MediaWiki\Page\PageReference\getNamespace().

◆ pagesByName()

WikiExporter::pagesByName ( $names)
Parameters
string[]$names

Definition at line 255 of file WikiExporter.php.

References pageByName().

◆ pagesByRange()

WikiExporter::pagesByRange ( $start,
$end,
$orderRevs )

Dumps a series of page and revision records for those pages in the database falling within the page_id range given.

Parameters
int$startInclusive lower limit (this id is included)
int$endExclusive upper limit (this id is not included) If 0, no upper limit.
bool$orderRevsorder revisions within pages in ascending order

Definition at line 199 of file WikiExporter.php.

References dumpFrom().

◆ revsByRange()

WikiExporter::revsByRange ( $start,
$end )

Dumps a series of page and revision records for those pages in the database with revisions falling within the rev_id range given.

Parameters
int$startInclusive lower limit (this id is included)
int$endExclusive upper limit (this id is not included) If 0, no upper limit.

Definition at line 221 of file WikiExporter.php.

References dumpFrom().

◆ schemaVersion()

static WikiExporter::schemaVersion ( )
static

Returns the default export schema version, as defined by the XmlDumpSchemaVersion setting.

Returns
string

Definition at line 106 of file WikiExporter.php.

Referenced by MediaWiki\Api\ApiQuery\getAllowedParams().

◆ setOutputSink()

WikiExporter::setOutputSink ( & $sink)

Set the DumpOutput or DumpFilter object which will receive various row objects and XML output for filtering.

Filters can be chained or used as callbacks.

Parameters
DumpOutput | DumpFilter&$sink

Definition at line 168 of file WikiExporter.php.

◆ setSchemaVersion()

WikiExporter::setSchemaVersion ( $schemaVersion)
Parameters
string$schemaVersionwhich schema version the generated XML should comply to. One of the values from self::$supportedSchemas, using the XML_DUMP_SCHEMA_VERSION_XX constants.

Definition at line 157 of file WikiExporter.php.

Member Data Documentation

◆ $author_list

string WikiExporter::$author_list = ""

Definition at line 59 of file WikiExporter.php.

◆ $db

IReadableDatabase WikiExporter::$db
protected

Definition at line 82 of file WikiExporter.php.

◆ $dumpUploadFileContents

bool WikiExporter::$dumpUploadFileContents = false

Definition at line 56 of file WikiExporter.php.

◆ $dumpUploads

bool WikiExporter::$dumpUploads = false

Definition at line 53 of file WikiExporter.php.

◆ $history

array int WikiExporter::$history
protected

Definition at line 85 of file WikiExporter.php.

◆ $limitNamespaces

array null WikiExporter::$limitNamespaces
protected

Definition at line 88 of file WikiExporter.php.

Referenced by __construct().

◆ $list_authors

bool WikiExporter::$list_authors = false

Return distinct author list (when not returning full history)

Definition at line 50 of file WikiExporter.php.

◆ $sink

DumpOutput WikiExporter::$sink

Definition at line 76 of file WikiExporter.php.

◆ $text

int WikiExporter::$text

Definition at line 73 of file WikiExporter.php.

◆ BATCH_SIZE

const WikiExporter::BATCH_SIZE = 10000
protected

Definition at line 70 of file WikiExporter.php.

◆ CURRENT

const WikiExporter::CURRENT = 2

Definition at line 62 of file WikiExporter.php.

◆ FULL

const WikiExporter::FULL = 1

Definition at line 61 of file WikiExporter.php.

◆ LOGS

const WikiExporter::LOGS = 8

Definition at line 64 of file WikiExporter.php.

◆ RANGE

const WikiExporter::RANGE = 16

Definition at line 65 of file WikiExporter.php.

◆ STABLE

const WikiExporter::STABLE = 4

Definition at line 63 of file WikiExporter.php.

◆ STUB

const WikiExporter::STUB = XmlDumpWriter::WRITE_STUB

Definition at line 68 of file WikiExporter.php.

◆ TEXT

const WikiExporter::TEXT = XmlDumpWriter::WRITE_CONTENT

Definition at line 67 of file WikiExporter.php.


The documentation for this class was generated from the following file: