MediaWiki master
WikiExporter Class Reference
Collaboration diagram for WikiExporter:

Public Member Functions

 __construct ( $db, CommentStore $commentStore, HookContainer $hookContainer, RevisionStore $revisionStore, TitleParser $titleParser, $history=self::CURRENT, $text=self::TEXT, $limitNamespaces=null)
 
 allLogs ()
 
 allPages ()
 Dumps a series of page and revision records for all pages in the database, either including complete history or only the most recent version.
 
 closeStream ()
 
 logsByRange ( $start, $end)
 
 openStream ()
 
 pageByName ( $name)
 
 pageByTitle (PageIdentity $page)
 
 pagesByName ( $names)
 
 pagesByRange ( $start, $end, $orderRevs)
 Dumps a series of page and revision records for those pages in the database falling within the page_id range given.
 
 revsByRange ( $start, $end)
 Dumps a series of page and revision records for those pages in the database with revisions falling within the rev_id range given.
 
 setOutputSink (&$sink)
 Set the DumpOutput or DumpFilter object which will receive various row objects and XML output for filtering.
 
 setSchemaVersion ( $schemaVersion)
 

Static Public Member Functions

static schemaVersion ()
 Returns the default export schema version, as defined by the XmlDumpSchemaVersion setting.
 

Public Attributes

string $author_list = ""
 
bool $dumpUploadFileContents = false
 
bool $dumpUploads = false
 
bool $list_authors = false
 Return distinct author list (when not returning full history)
 
DumpOutput $sink
 
int $text
 
const CURRENT = 2
 
const FULL = 1
 
const LOGS = 8
 
const RANGE = 16
 
const STABLE = 4
 
const STUB = XmlDumpWriter::WRITE_STUB
 
const TEXT = XmlDumpWriter::WRITE_CONTENT
 

Protected Member Functions

 do_list_authors ( $cond)
 Generates the distinct list of authors of an article Not called by default (depends on $this->list_authors) Can be set by Special:Export when not exporting whole history.
 
 dumpFrom ( $cond='', $orderRevs=false)
 
 dumpLogs ( $cond)
 
 dumpPages ( $cond, $orderRevs)
 
 finishPageStreamOutput ( $lastRow)
 Final page stream output, after all batches are complete.
 
 getSlotRowBatch ( $results, &$carry=null)
 Returns all slot rows for a revision.
 
 outputLogStream ( $resultset)
 
 outputPageStreamBatch ( $results, $lastRow)
 Runs through a query result set dumping page, revision, and slot records.
 

Protected Attributes

IReadableDatabase $db
 
array int $history
 
array null $limitNamespaces
 
const BATCH_SIZE = 50000
 

Detailed Description

Definition at line 47 of file WikiExporter.php.

Constructor & Destructor Documentation

◆ __construct()

WikiExporter::__construct (   $db,
CommentStore  $commentStore,
HookContainer  $hookContainer,
RevisionStore  $revisionStore,
TitleParser  $titleParser,
  $history = self::CURRENT,
  $text = self::TEXT,
  $limitNamespaces = null 
)
Parameters
IReadableDatabase$db
CommentStore$commentStore
HookContainer$hookContainer
RevisionStore$revisionStore
TitleParser$titleParser
int | array$historyOne of WikiExporter::FULL, WikiExporter::CURRENT, WikiExporter::RANGE or WikiExporter::STABLE, or an associative array:
  • offset: non-inclusive offset at which to start the query
  • limit: maximum number of rows to return
  • dir: "asc" or "desc" timestamp order
int$textOne of WikiExporter::TEXT or WikiExporter::STUB
null | array$limitNamespacesList of namespace numbers to limit results

Definition at line 124 of file WikiExporter.php.

References $limitNamespaces.

Member Function Documentation

◆ allLogs()

WikiExporter::allLogs ( )

Definition at line 260 of file WikiExporter.php.

References dumpFrom().

◆ allPages()

WikiExporter::allPages ( )

Dumps a series of page and revision records for all pages in the database, either including complete history or only the most recent version.

Definition at line 186 of file WikiExporter.php.

References dumpFrom().

◆ closeStream()

WikiExporter::closeStream ( )

Definition at line 176 of file WikiExporter.php.

◆ do_list_authors()

WikiExporter::do_list_authors (   $cond)
protected

Generates the distinct list of authors of an article Not called by default (depends on $this->list_authors) Can be set by Special:Export when not exporting whole history.

Parameters
string$cond

Definition at line 283 of file WikiExporter.php.

Referenced by dumpPages().

◆ dumpFrom()

WikiExporter::dumpFrom (   $cond = '',
  $orderRevs = false 
)
protected
Parameters
string$cond
bool$orderRevs

Definition at line 311 of file WikiExporter.php.

References dumpLogs(), and dumpPages().

Referenced by allLogs(), allPages(), logsByRange(), pageByName(), pageByTitle(), pagesByRange(), and revsByRange().

◆ dumpLogs()

WikiExporter::dumpLogs (   $cond)
protected
Parameters
string$cond

Definition at line 322 of file WikiExporter.php.

References outputLogStream().

Referenced by dumpFrom().

◆ dumpPages()

WikiExporter::dumpPages (   $cond,
  $orderRevs 
)
protected
Parameters
string$cond
bool$orderRevs

Definition at line 376 of file WikiExporter.php.

References do_list_authors(), finishPageStreamOutput(), and outputPageStreamBatch().

Referenced by dumpFrom().

◆ finishPageStreamOutput()

WikiExporter::finishPageStreamOutput (   $lastRow)
protected

Final page stream output, after all batches are complete.

Parameters
stdClass$lastRowthe last row output from the last batch (or null if none)

Definition at line 603 of file WikiExporter.php.

Referenced by dumpPages().

◆ getSlotRowBatch()

WikiExporter::getSlotRowBatch (   $results,
$carry = null 
)
protected

Returns all slot rows for a revision.

Takes and returns a carry row from the last batch;

Parameters
IResultWrapper | array$results
null | stdClass&$carryA row carried over from the last call to getSlotRowBatch()
Returns
stdClass[]

Definition at line 576 of file WikiExporter.php.

Referenced by outputPageStreamBatch().

◆ logsByRange()

WikiExporter::logsByRange (   $start,
  $end 
)
Parameters
int$start
int$end

Definition at line 268 of file WikiExporter.php.

References dumpFrom().

◆ openStream()

WikiExporter::openStream ( )

Definition at line 171 of file WikiExporter.php.

◆ outputLogStream()

WikiExporter::outputLogStream (   $resultset)
protected
Parameters
IResultWrapper$resultset
Returns
int|null the log_id value of the last item output, or null if none

Definition at line 617 of file WikiExporter.php.

Referenced by dumpLogs().

◆ outputPageStreamBatch()

WikiExporter::outputPageStreamBatch (   $results,
  $lastRow 
)
protected

Runs through a query result set dumping page, revision, and slot records.

The result set should join the page, revision, slots, and content tables, and be sorted/grouped by page and revision to avoid duplicate page records in the output.

Parameters
IResultWrapper$results
stdClass | null$lastRowthe last row output from the previous call (or null if none)
Returns
stdClass the last row processed

Definition at line 516 of file WikiExporter.php.

References getSlotRowBatch().

Referenced by dumpPages().

◆ pageByName()

WikiExporter::pageByName (   $name)
Parameters
string$name

Definition at line 240 of file WikiExporter.php.

References dumpFrom().

Referenced by pagesByName().

◆ pageByTitle()

WikiExporter::pageByTitle ( PageIdentity  $page)
Parameters
PageIdentity$page

Definition at line 231 of file WikiExporter.php.

References dumpFrom(), MediaWiki\Page\PageReference\getDBkey(), and MediaWiki\Page\PageReference\getNamespace().

◆ pagesByName()

WikiExporter::pagesByName (   $names)
Parameters
string[]$names

Definition at line 254 of file WikiExporter.php.

References pageByName().

◆ pagesByRange()

WikiExporter::pagesByRange (   $start,
  $end,
  $orderRevs 
)

Dumps a series of page and revision records for those pages in the database falling within the page_id range given.

Parameters
int$startInclusive lower limit (this id is included)
int$endExclusive upper limit (this id is not included) If 0, no upper limit.
bool$orderRevsorder revisions within pages in ascending order

Definition at line 198 of file WikiExporter.php.

References dumpFrom().

◆ revsByRange()

WikiExporter::revsByRange (   $start,
  $end 
)

Dumps a series of page and revision records for those pages in the database with revisions falling within the rev_id range given.

Parameters
int$startInclusive lower limit (this id is included)
int$endExclusive upper limit (this id is not included) If 0, no upper limit.

Definition at line 220 of file WikiExporter.php.

References dumpFrom().

◆ schemaVersion()

static WikiExporter::schemaVersion ( )
static

Returns the default export schema version, as defined by the XmlDumpSchemaVersion setting.

Returns
string

Definition at line 105 of file WikiExporter.php.

◆ setOutputSink()

WikiExporter::setOutputSink ( $sink)

Set the DumpOutput or DumpFilter object which will receive various row objects and XML output for filtering.

Filters can be chained or used as callbacks.

Parameters
DumpOutput | DumpFilter&$sink

Definition at line 167 of file WikiExporter.php.

◆ setSchemaVersion()

WikiExporter::setSchemaVersion (   $schemaVersion)
Parameters
string$schemaVersionwhich schema version the generated XML should comply to. One of the values from self::$supportedSchemas, using the XML_DUMP_SCHEMA_VERSION_XX constants.

Definition at line 156 of file WikiExporter.php.

Member Data Documentation

◆ $author_list

string WikiExporter::$author_list = ""

Definition at line 58 of file WikiExporter.php.

◆ $db

IReadableDatabase WikiExporter::$db
protected

Definition at line 81 of file WikiExporter.php.

◆ $dumpUploadFileContents

bool WikiExporter::$dumpUploadFileContents = false

Definition at line 55 of file WikiExporter.php.

◆ $dumpUploads

bool WikiExporter::$dumpUploads = false

Definition at line 52 of file WikiExporter.php.

◆ $history

array int WikiExporter::$history
protected

Definition at line 84 of file WikiExporter.php.

◆ $limitNamespaces

array null WikiExporter::$limitNamespaces
protected

Definition at line 87 of file WikiExporter.php.

Referenced by __construct().

◆ $list_authors

bool WikiExporter::$list_authors = false

Return distinct author list (when not returning full history)

Definition at line 49 of file WikiExporter.php.

◆ $sink

DumpOutput WikiExporter::$sink

Definition at line 75 of file WikiExporter.php.

◆ $text

int WikiExporter::$text

Definition at line 72 of file WikiExporter.php.

◆ BATCH_SIZE

const WikiExporter::BATCH_SIZE = 50000
protected

Definition at line 69 of file WikiExporter.php.

◆ CURRENT

const WikiExporter::CURRENT = 2

Definition at line 61 of file WikiExporter.php.

◆ FULL

const WikiExporter::FULL = 1

Definition at line 60 of file WikiExporter.php.

◆ LOGS

const WikiExporter::LOGS = 8

Definition at line 63 of file WikiExporter.php.

◆ RANGE

const WikiExporter::RANGE = 16

Definition at line 64 of file WikiExporter.php.

◆ STABLE

const WikiExporter::STABLE = 4

Definition at line 62 of file WikiExporter.php.

◆ STUB

const WikiExporter::STUB = XmlDumpWriter::WRITE_STUB

Definition at line 67 of file WikiExporter.php.

◆ TEXT

const WikiExporter::TEXT = XmlDumpWriter::WRITE_CONTENT

Definition at line 66 of file WikiExporter.php.


The documentation for this class was generated from the following file: