MediaWiki REL1_35
FindBadBlobs Class Reference

Maintenance script for finding and marking bad content blobs. More...

Inheritance diagram for FindBadBlobs:
Collaboration diagram for FindBadBlobs:

Public Member Functions

 __construct ()
 Default constructor.
 
 execute ()
 Do the actual work.All child classes will need to implement this
Returns
bool|null|void True for success, false for failure. Not returning a value, or returning null, is also interpreted as success. Returning false for failure will cause doMaintenance.php to exit the process with a non-zero exit status.

 
 initializeServices (?RevisionStore $revisionStore=null, ?BlobStore $blobStore=null, ?LoadBalancer $loadBalancer=null, ?LBFactory $lbFactory=null)
 
- Public Member Functions inherited from Maintenance
 checkRequiredExtensions ()
 Verify that the required extensions are installed.
 
 cleanupChanneled ()
 Clean up channeled output.
 
 clearParamsAndArgs ()
 Clear all params and arguments.
 
 finalSetup ()
 Handle some last-minute setup here.
 
 getConfig ()
 
 getDbType ()
 Does the script need different DB access? By default, we give Maintenance scripts normal rights to the DB.
 
 getName ()
 Get the script's name.
 
 globals ()
 Potentially debug globals.
 
 isQuiet ()
 
 loadParamsAndArgs ( $self=null, $opts=null, $args=null)
 Process command line arguments $mOptions becomes an array with keys set to the option names $mArgs becomes a zero-based array containing the non-option arguments.
 
 loadSettings ()
 Generic setup for most installs.
 
 loadWithArgv ( $argv)
 Load params and arguments from a given array of command-line arguments.
 
 memoryLimit ()
 Normally we disable the memory_limit when running admin scripts.
 
 outputChanneled ( $msg, $channel=null)
 Message outputter with channeled message support.
 
 purgeRedundantText ( $delete=true)
 Support function for cleaning up redundant text records.
 
 runChild ( $maintClass, $classFile=null)
 Run a child maintenance script.
 
 setAgentAndTriggers ()
 Set triggers like when to try to run deferred updates.
 
 setConfig (Config $config)
 
 setDB (IMaintainableDatabase $db)
 Sets database object to be returned by getDB().
 
 setup ()
 Do some sanity checking and basic setup.
 
 updateSearchIndex ( $maxLockTime, $callback, $dbw, $results)
 Perform a search index update with locking.
 
 updateSearchIndexForPage (int $pageId)
 Update the searchindex table for a given pageid.
 
 validateParamsAndArgs ()
 Run some validation checks on the params, etc Stable for overriding.
 

Private Member Functions

 checkRevision (RevisionRecord $rev)
 
 checkSlot (RevisionRecord $rev, SlotRecord $slot)
 
 getNextRevision (int $revId, string $comp, string $dir)
 Returns the revision ID next to $revId, according to $comp and $dir.
 
 getRevisionIds ()
 
 getStartTimestamp ()
 
 handleStatus (StatusValue $status)
 
 loadArchiveByRevisionId (int $afterId, int $uptoId, $batchSize)
 
 loadRevisionsById (array $ids)
 
 loadRevisionsByTimestamp (int $afterId, string $fromTimestamp, $batchSize)
 
 markBlob (RevisionRecord $rev, SlotRecord $slot, string $error=null)
 
 scanRevisionsById (array $ids)
 
 scanRevisionsByTimestamp ( $fromTimestamp, $total)
 
 waitForReplication ()
 

Private Attributes

BlobStore null $blobStore
 
LBFactory $lbFactory
 
LoadBalancer null $loadBalancer
 
RevisionStore null $revisionStore
 

Additional Inherited Members

- Static Public Member Functions inherited from Maintenance
static getTermSize ()
 Get the terminal size as a two-element array where the first element is the width (number of columns) and the second element is the height (number of rows).
 
static posix_isatty ( $fd)
 Wrapper for posix_isatty() We default as considering stdin a tty (for nice readline methods) but treating stout as not a tty to avoid color codes.
 
static readconsole ( $prompt='> ')
 Prompt the console for input.
 
static requireTestsAutoloader ()
 Call this to set up the autoloader to allow classes to be used from the tests directory.
 
static setLBFactoryTriggers (LBFactory $LBFactory, Config $config)
 
static shouldExecute ()
 Should we execute the maintenance script, or just allow it to be included as a standalone class? It checks that the call stack only includes this function and "requires" (meaning was called from the file scope)
 
- Public Attributes inherited from Maintenance
resource $fileHandle
 Used when creating separate schema files.
 
array $orderedOptions = []
 Used to read the options in the order they were passed.
 
const DB_ADMIN = 2
 
const DB_NONE = 0
 Constants for DB access type.
 
const DB_STD = 1
 
const STDIN_ALL = 'all'
 
- Protected Member Functions inherited from Maintenance
 activateProfiler ()
 Activate the profiler (assuming $wgProfiler is set)
 
 addArg ( $arg, $description, $required=true)
 Add some args that are needed.
 
 addDefaultParams ()
 Add the default parameters to the scripts.
 
 addDescription ( $text)
 Set the description text.
 
 addOption ( $name, $description, $required=false, $withArg=false, $shortName=false, $multiOccurrence=false)
 Add a parameter to the script.
 
 adjustMemoryLimit ()
 Adjusts PHP's memory limit to better suit our needs, if needed.
 
 afterFinalSetup ()
 Execute a callback function at the end of initialisation Stable for overriding.
 
 beginTransaction (IDatabase $dbw, $fname)
 Begin a transcation on a DB.
 
 commitTransaction (IDatabase $dbw, $fname)
 Commit the transcation on a DB handle and wait for replica DBs to catch up.
 
 countDown ( $seconds)
 Count down from $seconds to zero on the terminal, with a one-second pause between showing each number.
 
 deleteOption ( $name)
 Remove an option.
 
 error ( $err, $die=0)
 Throw an error to the user.
 
 fatalError ( $msg, $exitCode=1)
 Output a message and terminate the current script.
 
 getArg ( $argId=0, $default=null)
 Get an argument.
 
 getBatchSize ()
 Returns batch size.
 
 getDB ( $db, $groups=[], $dbDomain=false)
 Returns a database to be used by current maintenance script.
 
 getDir ()
 Get the maintenance directory.
 
 getHookContainer ()
 Get a HookContainer, for running extension hooks or for hook metadata.
 
 getHookRunner ()
 Get a HookRunner for running core hooks.
 
 getOption ( $name, $default=null)
 Get an option, or return the default.
 
 getStdin ( $len=null)
 Return input from stdin.
 
 hasArg ( $argId=0)
 Does a given argument exist?
 
 hasOption ( $name)
 Checks to see if a particular option was set.
 
 loadSpecialVars ()
 Handle the special variables that are global to all scripts Stable for overriding.
 
 maybeHelp ( $force=false)
 Maybe show the help.
 
 output ( $out, $channel=null)
 Throw some output to the user.
 
 parseIntList ( $text)
 Utility function to parse a string (perhaps from a command line option) into a list of integers (perhaps some kind of numeric IDs).
 
 requireExtension ( $name)
 Indicate that the specified extension must be loaded before the script can run.
 
 rollbackTransaction (IDatabase $dbw, $fname)
 Rollback the transcation on a DB handle.
 
 setAllowUnregisteredOptions ( $allow)
 Sets whether to allow unregistered options, which are options passed to a script that do not match an expected parameter.
 
 setBatchSize ( $s=0)
 Set the batch size.
 
 showHelp ()
 Definitely show the help.
 
 supportsOption ( $name)
 Checks to see if a particular option in supported.
 
- Protected Attributes inherited from Maintenance
 $mAllowUnregisteredOptions = false
 
 $mArgList = []
 
 $mArgs = []
 
int $mBatchSize = null
 Batch size.
 
 $mDbPass
 
 $mDbUser
 
 $mDescription = ''
 
 $mInputLoaded = false
 
 $mOptions = []
 
array[] $mParams = []
 Array of desired/allowed params.
 
 $mQuiet = false
 
 $mSelf
 
 $mShortParamsMap = []
 

Detailed Description

Maintenance script for finding and marking bad content blobs.

Definition at line 39 of file findBadBlobs.php.

Constructor & Destructor Documentation

◆ __construct()

FindBadBlobs::__construct ( )

Default constructor.

Children should call this first if implementing their own constructors

Stable for calling

Reimplemented from Maintenance.

Definition at line 61 of file findBadBlobs.php.

References Maintenance\addDescription(), Maintenance\addOption(), and Maintenance\setBatchSize().

Member Function Documentation

◆ checkRevision()

FindBadBlobs::checkRevision ( RevisionRecord  $rev)
private
Parameters
RevisionRecord$rev
Returns
int

Definition at line 430 of file findBadBlobs.php.

References checkSlot(), MediaWiki\Revision\RevisionRecord\getSlots(), Maintenance\hasOption(), and Maintenance\output().

Referenced by scanRevisionsById(), and scanRevisionsByTimestamp().

◆ checkSlot()

FindBadBlobs::checkSlot ( RevisionRecord  $rev,
SlotRecord  $slot 
)
private
Parameters
RevisionRecord$rev
SlotRecord$slot
Returns
int

Definition at line 449 of file findBadBlobs.php.

References $type, MediaWiki\Revision\SlotRecord\getAddress(), Maintenance\hasOption(), markBlob(), and Maintenance\output().

Referenced by checkRevision().

◆ execute()

FindBadBlobs::execute ( )

Do the actual work.All child classes will need to implement this

Returns
bool|null|void True for success, false for failure. Not returning a value, or returning null, is also interpreted as success. Returning false for failure will cause doMaintenance.php to exit the process with a non-zero exit status.

Reimplemented from Maintenance.

Definition at line 132 of file findBadBlobs.php.

References Maintenance\fatalError(), Maintenance\getOption(), getRevisionIds(), getStartTimestamp(), Maintenance\hasOption(), initializeServices(), Maintenance\output(), scanRevisionsById(), and scanRevisionsByTimestamp().

◆ getNextRevision()

FindBadBlobs::getNextRevision ( int  $revId,
string  $comp,
string  $dir 
)
private

Returns the revision ID next to $revId, according to $comp and $dir.

Parameters
int$revId
string$compthe comparator, either '<' or '>', to go with $dir
string$dirthe sort direction to go with $comp, either 'ARC' or 'DESC'
Returns
int

Definition at line 326 of file findBadBlobs.php.

References DB_REPLICA.

Referenced by scanRevisionsByTimestamp().

◆ getRevisionIds()

FindBadBlobs::getRevisionIds ( )
private
Returns
int[]

Definition at line 115 of file findBadBlobs.php.

References Maintenance\getOption(), and Maintenance\parseIntList().

Referenced by execute().

◆ getStartTimestamp()

FindBadBlobs::getStartTimestamp ( )
private
Returns
string

Definition at line 97 of file findBadBlobs.php.

References Maintenance\fatalError(), Maintenance\getOption(), and wfTimestamp().

Referenced by execute().

◆ handleStatus()

FindBadBlobs::handleStatus ( StatusValue  $status)
private

◆ initializeServices()

FindBadBlobs::initializeServices ( ?RevisionStore  $revisionStore = null,
?BlobStore  $blobStore = null,
?LoadBalancer  $loadBalancer = null,
?LBFactory  $lbFactory = null 
)

Definition at line 80 of file findBadBlobs.php.

References $blobStore, $lbFactory, $loadBalancer, and $revisionStore.

Referenced by execute().

◆ loadArchiveByRevisionId()

FindBadBlobs::loadArchiveByRevisionId ( int  $afterId,
int  $uptoId,
  $batchSize 
)
private
Parameters
int$afterId
int$uptoId
int$batchSize
Returns
RevisionArchiveRecord[]

Definition at line 294 of file findBadBlobs.php.

References DB_REPLICA, and handleStatus().

Referenced by scanRevisionsByTimestamp().

◆ loadRevisionsById()

FindBadBlobs::loadRevisionsById ( array  $ids)
private
Parameters
int[]$ids
Returns
RevisionRecord[]

Definition at line 373 of file findBadBlobs.php.

References DB_REPLICA, and handleStatus().

Referenced by scanRevisionsById().

◆ loadRevisionsByTimestamp()

FindBadBlobs::loadRevisionsByTimestamp ( int  $afterId,
string  $fromTimestamp,
  $batchSize 
)
private
Parameters
int$afterId
string$fromTimestamp
int$batchSize
Returns
RevisionStoreRecord[]

Definition at line 261 of file findBadBlobs.php.

References DB_REPLICA, and handleStatus().

Referenced by scanRevisionsByTimestamp().

◆ markBlob()

FindBadBlobs::markBlob ( RevisionRecord  $rev,
SlotRecord  $slot,
string  $error = null 
)
private
Parameters
RevisionRecord$rev
SlotRecord$slot
string | null$error
Returns
false|string

Definition at line 484 of file findBadBlobs.php.

References $args, DB_MASTER, MediaWiki\Revision\SlotRecord\getAddress(), MediaWiki\Revision\SlotRecord\getContentId(), Maintenance\getOption(), Maintenance\hasOption(), and wfArrayToCgi().

Referenced by checkSlot().

◆ scanRevisionsById()

FindBadBlobs::scanRevisionsById ( array  $ids)
private
Parameters
array$ids
Returns
int

Definition at line 343 of file findBadBlobs.php.

References checkRevision(), Maintenance\getBatchSize(), loadRevisionsById(), and Maintenance\output().

Referenced by execute().

◆ scanRevisionsByTimestamp()

FindBadBlobs::scanRevisionsByTimestamp (   $fromTimestamp,
  $total 
)
private
Parameters
string$fromTimestamp
int$total
Returns
int

Definition at line 183 of file findBadBlobs.php.

References checkRevision(), Maintenance\getBatchSize(), getNextRevision(), loadArchiveByRevisionId(), loadRevisionsByTimestamp(), Maintenance\output(), and waitForReplication().

Referenced by execute().

◆ waitForReplication()

FindBadBlobs::waitForReplication ( )
private

Definition at line 515 of file findBadBlobs.php.

Referenced by scanRevisionsByTimestamp().

Member Data Documentation

◆ $blobStore

BlobStore null FindBadBlobs::$blobStore
private

Definition at line 49 of file findBadBlobs.php.

Referenced by initializeServices().

◆ $lbFactory

LBFactory FindBadBlobs::$lbFactory
private

Definition at line 59 of file findBadBlobs.php.

Referenced by initializeServices().

◆ $loadBalancer

LoadBalancer null FindBadBlobs::$loadBalancer
private

Definition at line 54 of file findBadBlobs.php.

Referenced by initializeServices().

◆ $revisionStore

RevisionStore null FindBadBlobs::$revisionStore
private

Definition at line 44 of file findBadBlobs.php.

Referenced by initializeServices().


The documentation for this class was generated from the following file: