MediaWiki
1.28.0
|
Some of these functions are adapted from places in MediaWiki. More...
Go to the source code of this file.
Functions | |
codepointToUtf8 ($codepoint) | |
Return UTF-8 sequence for a given Unicode code point. More... | |
escapeSingleString ($string) | |
Escape a string for inclusion in a PHP single-quoted string literal. More... | |
hexSequenceToUtf8 ($sequence) | |
Take a series of space-separated hexadecimal numbers representing Unicode code points and return a UTF-8 string composed of those characters. More... | |
utf8ToCodepoint ($char) | |
Determine the Unicode codepoint of a single-character UTF-8 sequence. More... | |
utf8ToHexSequence ($str) | |
Take a UTF-8 string and return a space-separated series of hex numbers representing Unicode code points. More... | |
Some of these functions are adapted from places in MediaWiki.
Should probably merge them for consistency.
Copyright © 2004 Brion Vibber brion @pob ox.co mhttps://www.mediawiki.org/
This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.
You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA. http://www.gnu.org/copyleft/gpl.html
Definition in file UtfNormalUtil.php.
codepointToUtf8 | ( | $codepoint | ) |
Return UTF-8 sequence for a given Unicode code point.
int | $codepoint |
InvalidArgumentException | if fed out of range data. |
Definition at line 39 of file UtfNormalUtil.php.
Referenced by Sanitizer\cssDecodeCallback(), Sanitizer\decodeChar(), Sanitizer\decodeEntity(), GenerateCollationData\generateFirstChars(), and EditPage\unmakeSafe().
escapeSingleString | ( | $string | ) |
Escape a string for inclusion in a PHP single-quoted string literal.
string | $string | string to be escaped. |
Definition at line 97 of file UtfNormalUtil.php.
hexSequenceToUtf8 | ( | $sequence | ) |
Take a series of space-separated hexadecimal numbers representing Unicode code points and return a UTF-8 string composed of those characters.
Used by UTF-8 data generation and testing routines.
string | $sequence |
InvalidArgumentException | if fed out of range data. |
Definition at line 54 of file UtfNormalUtil.php.
Referenced by GenerateNormalizerDataMl\execute(), and GenerateNormalizerDataAr\execute().
utf8ToCodepoint | ( | $char | ) |
Determine the Unicode codepoint of a single-character UTF-8 sequence.
Does not check for invalid input data.
string | $char |
Definition at line 85 of file UtfNormalUtil.php.
Referenced by Language\firstChar(), IcuCollation\getFirstLetter(), ApiFormatXml\mangleName(), Sanitizer\normalizeCss(), and utf8ToHexSequence().
utf8ToHexSequence | ( | $str | ) |
Take a UTF-8 string and return a space-separated series of hex numbers representing Unicode code points.
For debugging.
this is private but extensions + maint scripts are using it
string | $str | UTF-8 string. |
Definition at line 67 of file UtfNormalUtil.php.
References as, and utf8ToCodepoint().
Referenced by Digit2Html\execute().