MediaWiki REL1_32
CustomUppercaseCollation.php
Go to the documentation of this file.
1<?php
41
43 private $alphabet;
44
46 private $puaSubset;
47
55 if ( count( $alphabet ) < 1 || count( $alphabet ) >= 4096 ) {
56 throw new UnexpectedValueException( "Alphabet must be < 4096 items" );
57 }
58 $this->firstLetters = $alphabet;
59 // For digraphs, only the first letter is capitalized in input
60 $this->alphabet = array_map( [ $lang, 'uc' ], $alphabet );
61
62 $this->puaSubset = [];
63 $len = count( $alphabet );
64 for ( $i = 0; $i < $len; $i++ ) {
65 $this->puaSubset[] = "\xF3\xB3" . chr( floor( $i / 64 ) + 128 ) . chr( ( $i % 64 ) + 128 );
66 }
67
68 // Sort these arrays so that any trigraphs, digraphs etc. are first
69 // (and they get replaced first in convertToPua()).
70 $lengths = array_map( 'mb_strlen', $this->alphabet );
71 array_multisort( $lengths, SORT_DESC, $this->firstLetters, $this->alphabet, $this->puaSubset );
72
73 parent::__construct( $lang );
74 }
75
76 private function convertToPua( $string ) {
77 return str_replace( $this->alphabet, $this->puaSubset, $string );
78 }
79
80 public function getSortKey( $string ) {
81 return $this->convertToPua( parent::getSortKey( $string ) );
82 }
83
84 public function getFirstLetter( $string ) {
85 $sortkey = $this->getSortKey( $string );
86
87 // In case a title begins with a character from our alphabet, return the corresponding
88 // first-letter. (This also happens if the title has a corresponding PUA code in it, to avoid
89 // inconsistent behaviour. This class mostly assumes that people will not use PUA codes.)
90 $index = array_search( substr( $sortkey, 0, 4 ), $this->puaSubset );
91 if ( $index !== false ) {
92 return $this->firstLetters[ $index ];
93 }
94
95 // String begins with a character outside of our alphabet, fall back
96 return parent::getFirstLetter( $string );
97 }
98}
Resort normal UTF-8 order by putting a bunch of stuff in PUA.
getSortKey( $string)
Given a string, convert it to a (hopefully short) key that can be used for efficient sorting.
__construct(array $alphabet, Language $lang)
getFirstLetter( $string)
Given a string, return the logical "first letter" to be used for grouping on category pages and so on...
Internationalisation code.
Definition Language.php:35
Collation that orders text with numbers "naturally", so that 'Foo 1' < 'Foo 2' < 'Foo 12'.
injection txt This is an overview of how MediaWiki makes use of dependency injection The design described here grew from the discussion of RFC T384 The term dependency this means that anything an object needs to operate should be injected from the the object itself should only know narrow no concrete implementation of the logic it relies on The requirement to inject everything typically results in an architecture that based on two main types of and essentially stateless service objects that use other service objects to operate on the value objects As of the beginning MediaWiki is only starting to use the DI approach Much of the code still relies on global state or direct resulting in a highly cyclical dependency which acts as the top level factory for services in MediaWiki which can be used to gain access to default instances of various services MediaWikiServices however also allows new services to be defined and default services to be redefined Services are defined or redefined by providing a callback the instantiator that will return a new instance of the service When it will create an instance of MediaWikiServices and populate it with the services defined in the files listed by thereby bootstrapping the DI framework Per $wgServiceWiringFiles lists includes ServiceWiring php
Definition injection.txt:37
The wiki should then use memcached to cache various data To use multiple just add more items to the array To increase the weight of a make its entry a array("192.168.0.1:11211", 2))