MediaWiki
1.34.0
|
Maintenance script to provide RDF representation of the recent changes in category tree. More...
Public Member Functions | |
__construct () | |
Default constructor. More... | |
execute () | |
Do the actual work. More... | |
getRdf () | |
Get accumulated RDF. More... | |
handleAdds (IDatabase $dbr, $output) | |
handleCategorization (IDatabase $dbr, $output) | |
Handles categorization changes. More... | |
handleDeletes (IDatabase $dbr, $output) | |
Handle category deletes. More... | |
handleEdits (IDatabase $dbr, $output) | |
Handle edits for category texts. More... | |
handleMoves (IDatabase $dbr, $output) | |
handleRestores (IDatabase $dbr, $output) | |
initialize () | |
Initialize external service classes. More... | |
updateTS ( $timestamp) | |
Generate SPARQL Update code for updating dump timestamp. More... | |
Public Member Functions inherited from Maintenance | |
checkRequiredExtensions () | |
Verify that the required extensions are installed. More... | |
cleanupChanneled () | |
Clean up channeled output. More... | |
clearParamsAndArgs () | |
Clear all params and arguments. More... | |
finalSetup () | |
Handle some last-minute setup here. More... | |
getConfig () | |
getDbType () | |
Does the script need different DB access? By default, we give Maintenance scripts normal rights to the DB. More... | |
getName () | |
Get the script's name. More... | |
globals () | |
Potentially debug globals. More... | |
isQuiet () | |
loadParamsAndArgs ( $self=null, $opts=null, $args=null) | |
Process command line arguments $mOptions becomes an array with keys set to the option names $mArgs becomes a zero-based array containing the non-option arguments. More... | |
loadSettings () | |
Generic setup for most installs. More... | |
loadWithArgv ( $argv) | |
Load params and arguments from a given array of command-line arguments. More... | |
memoryLimit () | |
Normally we disable the memory_limit when running admin scripts. More... | |
outputChanneled ( $msg, $channel=null) | |
Message outputter with channeled message support. More... | |
purgeRedundantText ( $delete=true) | |
Support function for cleaning up redundant text records. More... | |
runChild ( $maintClass, $classFile=null) | |
Run a child maintenance script. More... | |
setAgentAndTriggers () | |
Set triggers like when to try to run deferred updates. More... | |
setConfig (Config $config) | |
setDB (IMaintainableDatabase $db) | |
Sets database object to be returned by getDB(). More... | |
setup () | |
Do some sanity checking and basic setup. More... | |
updateSearchIndex ( $maxLockTime, $callback, $dbw, $results) | |
Perform a search index update with locking. More... | |
updateSearchIndexForPage ( $dbw, $pageId) | |
Update the searchindex table for a given pageid. More... | |
validateParamsAndArgs () | |
Run some validation checks on the params, etc. More... | |
Public Attributes | |
const | SPARQL_DELETE = |
Delete query. More... | |
const | SPARQL_DELETE_INSERT = |
Delete/Insert query. More... | |
const | SPARQL_INSERT = |
Insert query. More... | |
Public Attributes inherited from Maintenance | |
resource | $fileHandle |
Used when creating separate schema files. More... | |
$mDbPass | |
array | $orderedOptions = [] |
Used to read the options in the order they were passed. More... | |
const | DB_ADMIN = 2 |
const | DB_NONE = 0 |
Constants for DB access type. More... | |
const | DB_STD = 1 |
const | STDIN_ALL = 'all' |
Protected Member Functions | |
getCategoryLinksIterator (IDatabase $dbr, array $ids) | |
Get iterator for links for categories. More... | |
getChangedCatsIterator (IDatabase $dbr, $type) | |
Fetch categorization changes or edits. More... | |
getDeletedCatsIterator (IDatabase $dbr) | |
Fetch deleted categories. More... | |
getMovedCatsIterator (IDatabase $dbr) | |
Fetch moved categories. More... | |
getNewCatsIterator (IDatabase $dbr) | |
Fetch newly created categories. More... | |
getRestoredCatsIterator (IDatabase $dbr) | |
Fetch restored categories. More... | |
Protected Member Functions inherited from Maintenance | |
activateProfiler () | |
Activate the profiler (assuming $wgProfiler is set) More... | |
addArg ( $arg, $description, $required=true) | |
Add some args that are needed. More... | |
addDefaultParams () | |
Add the default parameters to the scripts. More... | |
addDescription ( $text) | |
Set the description text. More... | |
addOption ( $name, $description, $required=false, $withArg=false, $shortName=false, $multiOccurrence=false) | |
Add a parameter to the script. More... | |
adjustMemoryLimit () | |
Adjusts PHP's memory limit to better suit our needs, if needed. More... | |
afterFinalSetup () | |
Execute a callback function at the end of initialisation. More... | |
beginTransaction (IDatabase $dbw, $fname) | |
Begin a transcation on a DB. More... | |
commitTransaction (IDatabase $dbw, $fname) | |
Commit the transcation on a DB handle and wait for replica DBs to catch up. More... | |
countDown ( $seconds) | |
Count down from $seconds to zero on the terminal, with a one-second pause between showing each number. More... | |
deleteOption ( $name) | |
Remove an option. More... | |
error ( $err, $die=0) | |
Throw an error to the user. More... | |
fatalError ( $msg, $exitCode=1) | |
Output a message and terminate the current script. More... | |
getArg ( $argId=0, $default=null) | |
Get an argument. More... | |
getBatchSize () | |
Returns batch size. More... | |
getDB ( $db, $groups=[], $dbDomain=false) | |
Returns a database to be used by current maintenance script. More... | |
getDir () | |
Get the maintenance directory. More... | |
getOption ( $name, $default=null) | |
Get an option, or return the default. More... | |
getStdin ( $len=null) | |
Return input from stdin. More... | |
hasArg ( $argId=0) | |
Does a given argument exist? More... | |
hasOption ( $name) | |
Checks to see if a particular option exists. More... | |
loadSpecialVars () | |
Handle the special variables that are global to all scripts. More... | |
maybeHelp ( $force=false) | |
Maybe show the help. More... | |
output ( $out, $channel=null) | |
Throw some output to the user. More... | |
requireExtension ( $name) | |
Indicate that the specified extension must be loaded before the script can run. More... | |
rollbackTransaction (IDatabase $dbw, $fname) | |
Rollback the transcation on a DB handle. More... | |
setAllowUnregisteredOptions ( $allow) | |
Sets whether to allow unregistered options, which are options passed to a script that do not match an expected parameter. More... | |
setBatchSize ( $s=0) | |
Set the batch size. More... | |
supportsOption ( $name) | |
Checks to see if a particular option in supported. More... | |
Protected Attributes | |
int[] | $processed = [] |
List of processed page IDs, so we don't try to process same thing twice. More... | |
Protected Attributes inherited from Maintenance | |
$mAllowUnregisteredOptions = false | |
$mArgList = [] | |
$mArgs = [] | |
int | $mBatchSize = null |
Batch size. More... | |
$mDbUser | |
$mDescription = '' | |
$mInputLoaded = false | |
$mOptions = [] | |
array[] | $mParams = [] |
Array of desired/allowed params. More... | |
$mQuiet = false | |
$mSelf | |
$mShortParamsMap = [] | |
Private Member Functions | |
addIndex (BatchRowIterator $it) | |
Need to force index, somehow on terbium the optimizer chooses wrong one. More... | |
addTimestampConditions (BatchRowIterator $it, IDatabase $dbr) | |
Add timestamp limits to iterator. More... | |
getCategoriesUpdate (IDatabase $dbr, $deleteUrls, $pages, $mark) | |
Get SPARQL for updating set of categories. More... | |
getInsertRdf () | |
Get the text of SPARQL INSERT DATA clause. More... | |
setupChangesIterator (IDatabase $dbr, array $columns=[], array $extra_tables=[]) | |
Set up standard iterator for retrieving category changes. More... | |
writeCategoryData ( $row) | |
Write category data to RDF. More... | |
writeParentCategories (IDatabase $dbr, $pages) | |
Write parent data for a set of categories. More... | |
Private Attributes | |
CategoriesRdf | $categoriesRdf |
Categories RDF helper. More... | |
$endTS | |
RdfWriter | $rdfWriter |
$startTS | |
Additional Inherited Members | |
Static Public Member Functions inherited from Maintenance | |
static | getTermSize () |
Get the terminal size as a two-element array where the first element is the width (number of columns) and the second element is the height (number of rows). More... | |
static | posix_isatty ( $fd) |
Wrapper for posix_isatty() We default as considering stdin a tty (for nice readline methods) but treating stout as not a tty to avoid color codes. More... | |
static | readconsole ( $prompt='> ') |
Prompt the console for input. More... | |
static | requireTestsAutoloader () |
Call this to set up the autoloader to allow classes to be used from the tests directory. More... | |
static | setLBFactoryTriggers (LBFactory $LBFactory, Config $config) |
static | shouldExecute () |
Should we execute the maintenance script, or just allow it to be included as a standalone class? It checks that the call stack only includes this function and "requires" (meaning was called from the file scope) More... | |
Maintenance script to provide RDF representation of the recent changes in category tree.
Definition at line 31 of file categoryChangesAsRdf.php.
CategoryChangesAsRdf::__construct | ( | ) |
Default constructor.
Children should call this first if implementing their own constructors
Reimplemented from Maintenance.
Definition at line 94 of file categoryChangesAsRdf.php.
References Maintenance\addDescription(), Maintenance\addOption(), and Maintenance\setBatchSize().
|
private |
Need to force index, somehow on terbium the optimizer chooses wrong one.
BatchRowIterator | $it |
Definition at line 397 of file categoryChangesAsRdf.php.
References BatchRowIterator\addOptions().
Referenced by getChangedCatsIterator(), getDeletedCatsIterator(), getMovedCatsIterator(), and getRestoredCatsIterator().
|
private |
Add timestamp limits to iterator.
BatchRowIterator | $it | Iterator |
IDatabase | $dbr |
Definition at line 386 of file categoryChangesAsRdf.php.
References $dbr, and BatchRowIterator\addConditions().
Referenced by getDeletedCatsIterator(), and setupChangesIterator().
CategoryChangesAsRdf::execute | ( | ) |
Do the actual work.
All child classes will need to implement this
Reimplemented from Maintenance.
Definition at line 117 of file categoryChangesAsRdf.php.
References $dbr, $endTS, $output, $startTS, DB_REPLICA, Maintenance\error(), Maintenance\getConfig(), Maintenance\getDB(), Maintenance\getOption(), getRdf(), handleAdds(), handleCategorization(), handleDeletes(), handleEdits(), handleMoves(), handleRestores(), initialize(), and updateTS().
|
private |
Get SPARQL for updating set of categories.
IDatabase | $dbr | |
string[] | $deleteUrls | List of URIs to be deleted, with <> |
string[] | $pages | List of categories: id => title |
string | $mark | Marks which operation requests the query |
Definition at line 192 of file categoryChangesAsRdf.php.
References getInsertRdf(), and writeParentCategories().
Referenced by handleDeletes(), handleEdits(), and handleMoves().
|
protected |
Get iterator for links for categories.
IDatabase | $dbr | |
int[] | $ids | List of page IDs |
Definition at line 409 of file categoryChangesAsRdf.php.
References $dbr.
Referenced by writeParentCategories().
|
protected |
Fetch categorization changes or edits.
IDatabase | $dbr |
Definition at line 369 of file categoryChangesAsRdf.php.
References $type, addIndex(), NS_CATEGORY, and setupChangesIterator().
Referenced by handleCategorization(), and handleEdits().
|
protected |
Fetch deleted categories.
IDatabase | $dbr |
Definition at line 322 of file categoryChangesAsRdf.php.
References $dbr, addIndex(), addTimestampConditions(), NS_CATEGORY, and RC_LOG.
Referenced by handleDeletes().
|
private |
Get the text of SPARQL INSERT DATA clause.
Definition at line 176 of file categoryChangesAsRdf.php.
References getRdf().
Referenced by getCategoriesUpdate(), handleAdds(), and handleRestores().
|
protected |
Fetch moved categories.
IDatabase | $dbr |
Definition at line 302 of file categoryChangesAsRdf.php.
References addIndex(), NS_CATEGORY, RC_LOG, and setupChangesIterator().
Referenced by handleMoves().
|
protected |
Fetch newly created categories.
IDatabase | $dbr |
Definition at line 288 of file categoryChangesAsRdf.php.
References NS_CATEGORY, and setupChangesIterator().
Referenced by handleAdds().
CategoryChangesAsRdf::getRdf | ( | ) |
Get accumulated RDF.
Definition at line 428 of file categoryChangesAsRdf.php.
Referenced by execute(), and getInsertRdf().
|
protected |
Fetch restored categories.
IDatabase | $dbr |
Definition at line 349 of file categoryChangesAsRdf.php.
References addIndex(), NS_CATEGORY, RC_LOG, and setupChangesIterator().
Referenced by handleRestores().
CategoryChangesAsRdf::handleAdds | ( | IDatabase | $dbr, |
$output | |||
) |
IDatabase | $dbr | |
resource | $output |
Definition at line 526 of file categoryChangesAsRdf.php.
References $output, getInsertRdf(), getNewCatsIterator(), writeCategoryData(), and writeParentCategories().
Referenced by execute().
CategoryChangesAsRdf::handleCategorization | ( | IDatabase | $dbr, |
$output | |||
) |
Handles categorization changes.
IDatabase | $dbr | |
resource | $output |
Definition at line 584 of file categoryChangesAsRdf.php.
References $dbr, getChangedCatsIterator(), NS_CATEGORY, RC_CATEGORIZE, and writeCategoryData().
Referenced by execute().
CategoryChangesAsRdf::handleDeletes | ( | IDatabase | $dbr, |
$output | |||
) |
Handle category deletes.
IDatabase | $dbr | |
resource | $output | File to write the output |
Definition at line 437 of file categoryChangesAsRdf.php.
References $output, getCategoriesUpdate(), and getDeletedCatsIterator().
Referenced by execute().
CategoryChangesAsRdf::handleEdits | ( | IDatabase | $dbr, |
$output | |||
) |
Handle edits for category texts.
IDatabase | $dbr | |
resource | $output |
Definition at line 554 of file categoryChangesAsRdf.php.
References $output, getCategoriesUpdate(), getChangedCatsIterator(), RC_EDIT, and writeCategoryData().
Referenced by execute().
CategoryChangesAsRdf::handleMoves | ( | IDatabase | $dbr, |
$output | |||
) |
IDatabase | $dbr | |
resource | $output |
Definition at line 467 of file categoryChangesAsRdf.php.
References $output, getCategoriesUpdate(), getMovedCatsIterator(), NS_CATEGORY, and writeCategoryData().
Referenced by execute().
CategoryChangesAsRdf::handleRestores | ( | IDatabase | $dbr, |
$output | |||
) |
IDatabase | $dbr | |
resource | $output |
Definition at line 497 of file categoryChangesAsRdf.php.
References $output, getInsertRdf(), getRestoredCatsIterator(), writeCategoryData(), and writeParentCategories().
Referenced by execute().
CategoryChangesAsRdf::initialize | ( | ) |
Initialize external service classes.
Definition at line 111 of file categoryChangesAsRdf.php.
Referenced by execute().
|
private |
Set up standard iterator for retrieving category changes.
IDatabase | $dbr | |
string[] | $columns | List of additional fields to get |
string[] | $extra_tables | List of additional tables to join |
Definition at line 247 of file categoryChangesAsRdf.php.
References $dbr, and addTimestampConditions().
Referenced by getChangedCatsIterator(), getMovedCatsIterator(), getNewCatsIterator(), and getRestoredCatsIterator().
CategoryChangesAsRdf::updateTS | ( | $timestamp | ) |
Generate SPARQL Update code for updating dump timestamp.
string | int | $timestamp | Timestamp for last change |
Definition at line 222 of file categoryChangesAsRdf.php.
References wfTimestamp().
Referenced by execute().
|
private |
Write category data to RDF.
stdclass | $row | Database row |
Definition at line 454 of file categoryChangesAsRdf.php.
Referenced by handleAdds(), handleCategorization(), handleEdits(), handleMoves(), and handleRestores().
|
private |
Write parent data for a set of categories.
The list has the child categories.
IDatabase | $dbr | |
string[] | $pages | List of child categories: id => title |
Definition at line 211 of file categoryChangesAsRdf.php.
References getCategoryLinksIterator().
Referenced by getCategoriesUpdate(), handleAdds(), and handleRestores().
|
private |
Categories RDF helper.
Definition at line 82 of file categoryChangesAsRdf.php.
|
private |
Definition at line 85 of file categoryChangesAsRdf.php.
Referenced by execute().
|
protected |
List of processed page IDs, so we don't try to process same thing twice.
Definition at line 92 of file categoryChangesAsRdf.php.
|
private |
Definition at line 77 of file categoryChangesAsRdf.php.
|
private |
Definition at line 84 of file categoryChangesAsRdf.php.
Referenced by execute().
const CategoryChangesAsRdf::SPARQL_DELETE = |
Delete query.
Definition at line 45 of file categoryChangesAsRdf.php.
const CategoryChangesAsRdf::SPARQL_DELETE_INSERT = |
Delete/Insert query.
Definition at line 60 of file categoryChangesAsRdf.php.
const CategoryChangesAsRdf::SPARQL_INSERT = |
Insert query.
Definition at line 35 of file categoryChangesAsRdf.php.