Perl Unicode Cookbook: Extract by Grapheme Instead of Codepoint (regex)
℞ 30: Extract by grapheme instead of by codepoint (regex) Remember that Unicode defines a grapheme as “what a user thinks of as a character”. A codepoint is an integer value in the Unicode codespace. While ASCII conflates the two,…