Skip to content

Latest commit

 

History

History
54 lines (41 loc) · 2.37 KB

NOTES.md

File metadata and controls

54 lines (41 loc) · 2.37 KB

Unicode Resources

Unicode related Haskell packages

Casemapping and Casefolding

The text package already provides proper unicode casemapping and casefolding operations.

Additional features in text-icu

The Haskell package text-icu is a full featured implementation of unicode operations via bindings to the C++ icu libraries.

text-icu provides the following additional features:

  • Normalization checks
  • FCD normalization for collation
  • String collation
  • Iteration
  • Regular expressions

Haskell Unicode Landscape

Unicode functionality in Haskell is fragmented across various packages. The most comprehensive functionality is provided by text-icu which is based on the icu C++ libraries. All related packages are listed here, they may or may not be up to date or useful.

Basic

  • base Data.Char module
  • charset Fast unicode character sets

Unicode Character Database

Unicode Strings

ByteStrings (UTF8)

  • utf8-string Support for reading and writing UTF8 Strings
  • utf8-light Lightweight UTF8 handling
  • hxt-unicode Unicode en-/decoding functions for utf8, iso-latin-* and other encodings

Text (UTF16)

  • text An efficient packed Unicode text type
  • text-normal Data types for Unicode-normalized text - depends on text-icu