MediaWiki REL1_33
CustomUppercaseCollation.php
Go to the documentation of this file.
1<?php
41
43 private $alphabet;
44
46 private $puaSubset;
47
55 if ( count( $alphabet ) < 1 || count( $alphabet ) >= 4096 ) {
56 throw new UnexpectedValueException( "Alphabet must be < 4096 items" );
57 }
58 $this->firstLetters = $alphabet;
59 // For digraphs, only the first letter is capitalized in input
60 $this->alphabet = array_map( [ $lang, 'uc' ], $alphabet );
61
62 $this->puaSubset = [];
63 $len = count( $alphabet );
64 for ( $i = 0; $i < $len; $i++ ) {
65 $this->puaSubset[] = "\xF3\xB3" . chr( floor( $i / 64 ) + 128 ) . chr( ( $i % 64 ) + 128 );
66 }
67
68 // Sort these arrays so that any trigraphs, digraphs etc. are first
69 // (and they get replaced first in convertToPua()).
70 $lengths = array_map( 'mb_strlen', $this->alphabet );
71 array_multisort( $lengths, SORT_DESC, $this->firstLetters, $this->alphabet, $this->puaSubset );
72
73 parent::__construct( $lang );
74 }
75
76 private function convertToPua( $string ) {
77 return str_replace( $this->alphabet, $this->puaSubset, $string );
78 }
79
80 public function getSortKey( $string ) {
81 return $this->convertToPua( parent::getSortKey( $string ) );
82 }
83
84 public function getFirstLetter( $string ) {
85 $sortkey = $this->getSortKey( $string );
86
87 // In case a title begins with a character from our alphabet, return the corresponding
88 // first-letter. (This also happens if the title has a corresponding PUA code in it, to avoid
89 // inconsistent behaviour. This class mostly assumes that people will not use PUA codes.)
90 $index = array_search( substr( $sortkey, 0, 4 ), $this->puaSubset );
91 if ( $index !== false ) {
92 return $this->firstLetters[ $index ];
93 }
94
95 // String begins with a character outside of our alphabet, fall back
96 return parent::getFirstLetter( $string );
97 }
98}
and that you know you can do these things To protect your we need to make restrictions that forbid anyone to deny you these rights or to ask you to surrender the rights These restrictions translate to certain responsibilities for you if you distribute copies of the or if you modify it For if you distribute copies of such a whether gratis or for a you must give the recipients all the rights that you have You must make sure that receive or can get the source code And you must show them these terms so they know their rights We protect your rights with two and(2) offer you this license which gives you legal permission to copy
Resort normal UTF-8 order by putting a bunch of stuff in PUA.
getSortKey( $string)
Given a string, convert it to a (hopefully short) key that can be used for efficient sorting.
__construct(array $alphabet, Language $lang)
getFirstLetter( $string)
Given a string, return the logical "first letter" to be used for grouping on category pages and so on...
Internationalisation code.
Definition Language.php:36
Collation that orders text with numbers "naturally", so that 'Foo 1' < 'Foo 2' < 'Foo 12'.
The wiki should then use memcached to cache various data To use multiple just add more items to the array To increase the weight of a make its entry a array("192.168.0.1:11211", 2))