PHP - Manual: mb_strtolower

2026-08-02

mb_strtoupper »

« mb_strstr

mb_strtolower

(PHP 4 >= 4.3.0, PHP 5, PHP 7, PHP 8)

mb_strtolower — 使字符串小写

说明

mb_strtolower(string $string, ?string $encoding = null): string

返回所有字母字符转换成小写的 string。

参数

string: 要被小写的 string。
encoding: encoding 参数为字符编码。如果省略或是 null，则使用内部字符编码。

返回值

所有字母字符已被转换成小写的 string。

更新日志

版本	说明
8.3.0	为希腊字母 sigma 实现了条件性大小写规则。

示例

示例 #1 mb_strtolower() 示例

<?php
$str = "Mary Had A Little Lamb and She LOVED It So";
$str = mb_strtolower($str);
echo $str; // 输出： mary had a little lamb and she loved it so
?>

示例 #2 非拉丁 UTF-8 文本的 mb_strtolower() 例子

<?php
$str = "Τάχιστη αλώπηξ βαφής ψημένη γη, δρασκελίζει υπέρ νωθρού κυνός";
$str = mb_strtolower($str, 'UTF-8');
echo $str; // 输出 τάχιστη αλώπηξ βαφής ψημένη γη, δρασκελίζει υπέρ νωθρού κυνός
?>

注释

和 strtolower() 不同的是，“字母”字符的检测是根据字符的 Unicode 属性。因此函数的行为不会受语言设置的影响，能偶转换任意具有“字母”属性的字符，例如元音变音 A（ä）。

更多关于 Unicode 属性的信息，请参见 » http://www.unicode.org/reports/tr21/。

参见

mb_strtoupper() - 使字符串大写
mb_convert_case() - 对字符串进行大小写转换
strtolower() - 将字符串转化为小写

发现了问题？

了解如何改进此页面 • 提交拉取请求 • 报告一个错误

＋添加备注

用户贡献的备注 6 notes

down

akniep at linklift dot net ¶

13 years ago

Please, note that when using with UTF-8 mb_strtolower will only convert upper case characters to lower case which are marked with the Unicode property "Upper case letter" ("Lu"). However, there are also letters such as "Letter numbers" (Unicode property "Nl") that also have lower case and upper case variants. These characters will not be converted be mb_strtolower!


Example:

The Roman letters Ⅰ, Ⅱ, Ⅲ, ..., Ⅿ (UTF-8 code points 8544 through 8559) also exist in their respective lower case variants ⅰ, ⅱ, ⅲ, ..., ⅿ (UTF-8 code points 8560 through 8575) and should, in my opinion, also be converted by mb_strtolower, but they are not!


Big internet-companies (like Google) do match both variants as semantically equal (since the representations only differ in case).


Since I was not finding any proper solution in the internet on how to map all UTF8-strings to their lowercase counterpart in PHP, I offer the following hard-coded extended mb_strtolower function for UTF-8 strings:


The function wraps the existing function mb_strtolower() and additionally replaces uppercase UTF8-characters for which there is a lowercase representation. Since there is no proper Unicode uppercase and lowercase character-table in the internet that I was able to find, I checked the first million UTF8-characters against the Google-search and -KeywordTool and identified the following 78 characters as uppercase-characters, not being replaced by mb_strtolower, but having a UTF8 lowercase counterpart.


<?php


//the numbers in the in-line-comments display the characters' Unicode code-points (CP).

function strtolower_utf8_extended( $utf8_string )

{

$additional_replacements    = array

        ( "ǅ"    => "ǆ"        //   453 ->   454

, "ǈ"    => "ǉ"        //   456 ->   457

, "ǋ"    => "ǌ"        //   459 ->   460

, "ǲ"    => "ǳ"        //   498 ->   499

, "Ϸ"    => "ϸ"        //  1015 ->  1016

, "Ϲ"    => "ϲ"        //  1017 ->  1010

, "Ϻ"    => "ϻ"        //  1018 ->  1019

, "ᾈ"    => "ᾀ"        //  8072 ->  8064

, "ᾉ"    => "ᾁ"        //  8073 ->  8065

, "ᾊ"    => "ᾂ"        //  8074 ->  8066

, "ᾋ"    => "ᾃ"        //  8075 ->  8067

, "ᾌ"    => "ᾄ"        //  8076 ->  8068

, "ᾍ"    => "ᾅ"        //  8077 ->  8069

, "ᾎ"    => "ᾆ"        //  8078 ->  8070

, "ᾏ"    => "ᾇ"        //  8079 ->  8071

, "ᾘ"    => "ᾐ"        //  8088 ->  8080

, "ᾙ"    => "ᾑ"        //  8089 ->  8081

, "ᾚ"    => "ᾒ"        //  8090 ->  8082

, "ᾛ"    => "ᾓ"        //  8091 ->  8083

, "ᾜ"    => "ᾔ"        //  8092 ->  8084

, "ᾝ"    => "ᾕ"        //  8093 ->  8085

, "ᾞ"    => "ᾖ"        //  8094 ->  8086

, "ᾟ"    => "ᾗ"        //  8095 ->  8087

, "ᾨ"    => "ᾠ"        //  8104 ->  8096

, "ᾩ"    => "ᾡ"        //  8105 ->  8097

, "ᾪ"    => "ᾢ"        //  8106 ->  8098

, "ᾫ"    => "ᾣ"        //  8107 ->  8099

, "ᾬ"    => "ᾤ"        //  8108 ->  8100

, "ᾭ"    => "ᾥ"        //  8109 ->  8101

, "ᾮ"    => "ᾦ"        //  8110 ->  8102

, "ᾯ"    => "ᾧ"        //  8111 ->  8103

, "ᾼ"    => "ᾳ"        //  8124 ->  8115

, "ῌ"    => "ῃ"        //  8140 ->  8131

, "ῼ"    => "ῳ"        //  8188 ->  8179

, "Ⅰ"    => "ⅰ"        //  8544 ->  8560

, "Ⅱ"    => "ⅱ"        //  8545 ->  8561

, "Ⅲ"    => "ⅲ"        //  8546 ->  8562

, "Ⅳ"    => "ⅳ"        //  8547 ->  8563

, "Ⅴ"    => "ⅴ"        //  8548 ->  8564

, "Ⅵ"    => "ⅵ"        //  8549 ->  8565

, "Ⅶ"    => "ⅶ"        //  8550 ->  8566

, "Ⅷ"    => "ⅷ"        //  8551 ->  8567

, "Ⅸ"    => "ⅸ"        //  8552 ->  8568

, "Ⅹ"    => "ⅹ"        //  8553 ->  8569

, "Ⅺ"    => "ⅺ"        //  8554 ->  8570

, "Ⅻ"    => "ⅻ"        //  8555 ->  8571

, "Ⅼ"    => "ⅼ"        //  8556 ->  8572

, "Ⅽ"    => "ⅽ"        //  8557 ->  8573

, "Ⅾ"    => "ⅾ"        //  8558 ->  8574

, "Ⅿ"    => "ⅿ"        //  8559 ->  8575

, "Ⓐ"    => "ⓐ"        //  9398 ->  9424

, "Ⓑ"    => "ⓑ"        //  9399 ->  9425

, "Ⓒ"    => "ⓒ"        //  9400 ->  9426

, "Ⓓ"    => "ⓓ"        //  9401 ->  9427

, "Ⓔ"    => "ⓔ"        //  9402 ->  9428

, "Ⓕ"    => "ⓕ"        //  9403 ->  9429

, "Ⓖ"    => "ⓖ"        //  9404 ->  9430

, "Ⓗ"    => "ⓗ"        //  9405 ->  9431

, "Ⓘ"    => "ⓘ"        //  9406 ->  9432

, "Ⓙ"    => "ⓙ"        //  9407 ->  9433

, "Ⓚ"    => "ⓚ"        //  9408 ->  9434

, "Ⓛ"    => "ⓛ"        //  9409 ->  9435

, "Ⓜ"    => "ⓜ"        //  9410 ->  9436

, "Ⓝ"    => "ⓝ"        //  9411 ->  9437

, "Ⓞ"    => "ⓞ"        //  9412 ->  9438

, "Ⓟ"    => "ⓟ"        //  9413 ->  9439

, "Ⓠ"    => "ⓠ"        //  9414 ->  9440

, "Ⓡ"    => "ⓡ"        //  9415 ->  9441

, "Ⓢ"    => "ⓢ"        //  9416 ->  9442

, "Ⓣ"    => "ⓣ"        //  9417 ->  9443

, "Ⓤ"    => "ⓤ"        //  9418 ->  9444

, "Ⓥ"    => "ⓥ"        //  9419 ->  9445

, "Ⓦ"    => "ⓦ"        //  9420 ->  9446

, "Ⓧ"    => "ⓧ"        //  9421 ->  9447

, "Ⓨ"    => "ⓨ"        //  9422 ->  9448

, "Ⓩ"    => "ⓩ"        //  9423 ->  9449

, "𐐦"    => "𐑎"        // 66598 -> 66638

, "𐐧"    => "𐑏"        // 66599 -> 66639

);


$utf8_string    = mb_strtolower( $utf8_string, "UTF-8");


$utf8_string    = strtr( $utf8_string, $additional_replacements );


    return $utf8_string;

} //strtolower_utf8_extended()


?>

down

Philipp H ¶

17 years ago

Note that mb_strtolower() is very SLOW, if you have a database connection, you may want to use it to convert your strings to lower case. Even latin1/9 (iso-8859-1/15) and other encodings are possible.

Have a look at my simple benchmark:

<?php

$text = "Lörem ipßüm dölör ßit ämet, cönßectetüer ädipißcing elit. Sed ligülä. Präeßent jüßtö tellüß, grävidä eü, tempüß ä, mättiß nön, örci. Näm qüiß lörem. Näm äliqüet elit ßed elit. Phäßellüß venenätiß jüßtö eget enim. Dönec nißl. Pröin mättiß venenätiß jüßtö. Sed äliqüäm pörtä örci. Cräß elit nißl, cönvälliß qüiß, tincidünt ät, vehicülä äccümßän, ödiö. Sed möleßtie. Etiäm mölliß feügiät elit. Veßtibülüm änte ipßüm primiß in fäücibüß örci lüctüß et ültriceß pößüere cübiliä Cüräe; Mäecenäß nön nüllä.";

// mb_strtolower()
$timeMB = microtime(true);     

    for($i=0;$i<30000;$i++) 
$lower = mb_strtolower("$text/no-cache-$i");

$timeMB = microtime(true) - $timeMB;

// MySQL lower()
$timeSQL = microtime(true);    

mysql_query("set names latin1");               
    for($i=0;$i<30000;$i++) { 
$r = mysql_fetch_row(mysql_query("select lower('$text/no-cache-$i')"));
$lower = $r[0];
    }

$timeSQL = microtime(true) - $timeSQL;

echo "mb: ".sprintf("%.5f",$timeMB)." sek.<br />";
echo "sql: ".sprintf("%.5f",$timeSQL)." sek.<br />";

// Result on my notebook:
// mb: 11.50642 sek.
// sql: 5.44143 sek.

?>

down

fisharebest at gmail dot com ¶

12 years ago

There is not a one-to-one correspondence between upper and lower case letters.

Turkish is a good example of this.  In Turkish, the letter I/i has a dotted-upper-case form (İ) and a dotless-lower-case form (ı).

https://en.wikipedia.org/wiki/Dotted_and_dotless_I

This means that you cannot correctly convert between upper-case and lower-case without also knowing the locale of the data.

Since the function does not let you specify a locale, you should only use this function for text written in languages that follow the same orthography as English.

Although it does handle some digraphs, such as the Dutch ij (ĳ), it does not handle others, such as the Polish dz (ʣ).

down

btherl at yahoo dot com dot au ¶

19 years ago

If you use this function on a unicode string without telling PHP that it is unicode, then you will corrupt your string.  In particular, the uppercase 'A' with tilde, common in 2-byte UTF-8 characters, is converted to lowercase 'a' with tilde.

This can be handled correctly by:
$str = mb_strtolower($str, mb_detect_encoding($str));

Or if you know your data is UTF-8, just use the string "UTF-8" as the second argument.

You should check also that mb_detect_encoding() is checking the encodings you want it to check, and is detecting the correct encodings.

down

Ken Shiro ¶

14 years ago

[If you get this error:]

Fatal error: Call to undefined function: mb_strtolower() in ????.php on line ??


The PHP mbstring extension, which is required to handle international character sets, is not available on your server. Check your PHP configuration and make sure that PHP has been compiled with --enable-mbstring.


It's also apply to

Call to undefined function mb_eregi() / mb_strtolower()

down

Ukio ¶

9 years ago

Maybe it help someone.
Make up case with first char, low case for other.

<?php
function str_split_unicode($str, $l = 0) {
    if ($l > 0) {
$ret = array();
$len = mb_strlen($str, "UTF-8");
        for ($i = 0; $i < $len; $i += $l) {
$ret[] = mb_substr($str, $i, $l, "UTF-8");
        }
        return $ret;
    }
    return preg_split("//u", $str, -1, PREG_SPLIT_NO_EMPTY);
}

function ToCorrectCase($str){

$str = mb_strtolower($str);
$str_array = str_split_unicode($str);
$str_array[0] = mb_strtoupper($str_array[0]);
$str = '';
    foreach ($str_array as $key){
$str = $str.$key;
    }
    return $str;
}
?>

＋添加备注

官方地址：https://www.php.net/manual/en/function.mb-strtolower.php

有任何技术问题请点击这里网站运营推广招聘

IT PHP 编程语言开发编程 Linux 科技 Elasticsearch 数据库面试 HTML/CSS/XML 网络 JAVA NoSQL 操作系统 C/C++ Golang Git 算法正则表达式 Redis 互联网 MySql 软件运维 JavaScript 国际商业架构设计 Mac OS TCP/IP Excel Windows Oracle Socket VR Vim MongoDB 运营 Python MemCache 硬件电子娱乐设计摄影 nginx 游戏 WordPress HTTP 团建数码电器 Docker 大模型

mysql8切换用户密码验证方式 sha256_password警告 php7.3 使用 PDO_DM 扩展连接 DM8 中文乱码 laravel查看orm生成的sql 使用PHPWord将docx文件转换为html格式 docker-compose启动nginx与php-fpm PhpStorm中PHP注释的规范指南 composer install参数 PHPStorm ESC 会退出命令行 laravel orm中DB::insert方法导致内存泄漏的问题解决方法 Composer的Packagist资源在命令行直接运行 PHP 代码 PHP json解析（json_decode）页面工具 adodb连接mysql多个数据库的问题 PHP历史版本下载 adodb手册利用php soap实现web service PHP:GD 和图像处理函数 ADORecordSet对象 PHP的Socket通信之UDP篇 PHP从命令行参数列表中获取选项

略微加速

PHP官方手册 - 互联网笔记

mb_strtolower

说明

参数

返回值

更新日志

示例

注释

参见

发现了问题？

用户贡献的备注 6 notes