Java library for text normalization

I'm looking for a Java library that allows text normalization Similar to normalizer, but wider (something like utf8proc lump)

It should replace all types of special characters with ASCII equivalents, if possible Space code 32 for all variants, minuses (long, short, thin, etc.) to code 45 for all variants, etc

Solution

You should look at cldr Latin ASCII deformation, which will be in ICU 4.6

The content of this article comes from the network collection of netizens. It is used as a learning reference. The copyright belongs to the original author.
THE END
分享
二维码
< <上一篇
下一篇>>