aboutsummaryrefslogtreecommitdiff
path: root/textproc/utf8proc/pkg-descr
blob: 9f4bf6db7a1833a54cf2b68d2700d1bdc683c12c (plain) (blame)
1
2
3
4
5
6
7
utf8proc is a library for processing UTF-8 encoded Unicode strings.
Some features are Unicode normalization, stripping of default ignorable
characters, case folding and detection of grapheme cluster boundaries.
A special character mapping is available, which converts for example the
characters "Hyphen" (U+2010), "Minus" (U+2212) and
"Hyphen-Minus" (U+002D, ASCII Minus) all into the ASCII minus sign, to
make them equal for comparisons.