Documentation

RegularExpressions.php

Simple Machines Forum (SMF)

Tags
author

Simple Machines https://www.simplemachines.org

copyright

2025 Simple Machines and individual contributors

license

https://www.simplemachines.org/about/smf/license.php BSD

version
3.0

Alpha 2

Table of Contents

Functions

utf8_regex_properties()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles and utf8_convert_case.
utf8_regex_variation_selectors()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles.
utf8_regex_joining_type()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles.
utf8_regex_indic()  : array<string|int, mixed>
Helper function for utf8_sanitize_invisibles.

Functions

utf8_regex_properties()

Helper function for utf8_sanitize_invisibles and utf8_convert_case.

utf8_regex_properties() : array<string|int, mixed>

Character class lists compiled from: https://www.unicode.org/Public/UCD/latest/ucd/DerivedCoreProperties.txt https://www.unicode.org/Public/UCD/latest/ucd/PropList.txt https://www.unicode.org/Public/UCD/latest/ucd/emoji/emoji-data.txt https://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedGeneralCategory.txt https://www.unicode.org/Public/UCD/latest/ucd/auxiliary/WordBreakProperty.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for various Unicode properties.

utf8_regex_variation_selectors()

Helper function for utf8_sanitize_invisibles.

utf8_regex_variation_selectors() : array<string|int, mixed>

Character class lists compiled from: https://www.unicode.org/Public/UCD/latest/ucd/StandardizedVariants.txt https://www.unicode.org/Public/UCD/latest/ucd/emoji/emoji-variation-sequences.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for filtering variation selectors.

utf8_regex_joining_type()

Helper function for utf8_sanitize_invisibles.

utf8_regex_joining_type() : array<string|int, mixed>

Character class lists compiled from: https://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedJoiningType.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for joining characters in certain scripts.

utf8_regex_indic()

Helper function for utf8_sanitize_invisibles.

utf8_regex_indic() : array<string|int, mixed>

Character class lists compiled from: https://www.unicode.org/Public/UCD/latest/ucd/extracted/DerivedCombiningClass.txt https://www.unicode.org/Public/UCD/latest/ucd/IndicSyllabicCategory.txt

Developers: Do not update the data in this function manually. Instead, run "php -f other/update_unicode_data.php" on the command line.

Return values
array<string|int, mixed>

Character classes for Indic scripts that use viramas.


        
On this page

Search results