MediaWiki  REL1_31
CustomUppercaseCollation.php
Go to the documentation of this file.
1 <?php
41 
43  private $alphabet;
44 
46  private $puaSubset;
47 
54  public function __construct( array $alphabet, Language $lang ) {
55  if ( count( $alphabet ) < 1 || count( $alphabet ) >= 4096 ) {
56  throw new UnexpectedValueException( "Alphabet must be < 4096 items" );
57  }
58  $this->firstLetters = $alphabet;
59  // For digraphs, only the first letter is capitalized in input
60  $this->alphabet = array_map( [ $lang, 'uc' ], $alphabet );
61 
62  $this->puaSubset = [];
63  $len = count( $alphabet );
64  for ( $i = 0; $i < $len; $i++ ) {
65  $this->puaSubset[] = "\xF3\xB3" . chr( floor( $i / 64 ) + 128 ) . chr( ( $i % 64 ) + 128 );
66  }
67 
68  // Sort these arrays so that any trigraphs, digraphs etc. are first
69  // (and they get replaced first in convertToPua()).
70  $lengths = array_map( 'mb_strlen', $this->alphabet );
71  array_multisort( $lengths, SORT_DESC, $this->firstLetters, $this->alphabet, $this->puaSubset );
72 
73  parent::__construct( $lang );
74  }
75 
76  private function convertToPua( $string ) {
77  return str_replace( $this->alphabet, $this->puaSubset, $string );
78  }
79 
80  public function getSortKey( $string ) {
81  return $this->convertToPua( parent::getSortKey( $string ) );
82  }
83 
84  public function getFirstLetter( $string ) {
85  $sortkey = $this->getSortKey( $string );
86 
87  // In case a title begins with a character from our alphabet, return the corresponding
88  // first-letter. (This also happens if the title has a corresponding PUA code in it, to avoid
89  // inconsistent behaviour. This class mostly assumes that people will not use PUA codes.)
90  $index = array_search( substr( $sortkey, 0, 4 ), $this->puaSubset );
91  if ( $index !== false ) {
92  return $this->firstLetters[ $index ];
93  }
94 
95  // String begins with a character outside of our alphabet, fall back
96  return parent::getFirstLetter( $string );
97  }
98 }
array
the array() calling protocol came about after MediaWiki 1.4rc1.
CustomUppercaseCollation\$alphabet
$alphabet
Definition: CustomUppercaseCollation.php:43
CustomUppercaseCollation\convertToPua
convertToPua( $string)
Definition: CustomUppercaseCollation.php:76
CustomUppercaseCollation
Resort normal UTF-8 order by putting a bunch of stuff in PUA.
Definition: CustomUppercaseCollation.php:40
CustomUppercaseCollation\getFirstLetter
getFirstLetter( $string)
Given a string, return the logical "first letter" to be used for grouping on category pages and so on...
Definition: CustomUppercaseCollation.php:84
php
injection txt This is an overview of how MediaWiki makes use of dependency injection The design described here grew from the discussion of RFC T384 The term dependency this means that anything an object needs to operate should be injected from the the object itself should only know narrow no concrete implementation of the logic it relies on The requirement to inject everything typically results in an architecture that based on two main types of and essentially stateless service objects that use other service objects to operate on the value objects As of the beginning MediaWiki is only starting to use the DI approach Much of the code still relies on global state or direct resulting in a highly cyclical dependency which acts as the top level factory for services in MediaWiki which can be used to gain access to default instances of various services MediaWikiServices however also allows new services to be defined and default services to be redefined Services are defined or redefined by providing a callback the instantiator that will return a new instance of the service When it will create an instance of MediaWikiServices and populate it with the services defined in the files listed by thereby bootstrapping the DI framework Per $wgServiceWiringFiles lists includes ServiceWiring php
Definition: injection.txt:37
CustomUppercaseCollation\$puaSubset
$puaSubset
Definition: CustomUppercaseCollation.php:46
CustomUppercaseCollation\getSortKey
getSortKey( $string)
Given a string, convert it to a (hopefully short) key that can be used for efficient sorting.
Definition: CustomUppercaseCollation.php:80
NumericUppercaseCollation
Collation that orders text with numbers "naturally", so that 'Foo 1' < 'Foo 2' < 'Foo 12'.
Definition: NumericUppercaseCollation.php:35
CustomUppercaseCollation\__construct
__construct(array $alphabet, Language $lang)
Definition: CustomUppercaseCollation.php:54
UppercaseCollation\$lang
$lang
Definition: UppercaseCollation.php:25
Language
Internationalisation code.
Definition: Language.php:35