How to do Early Uniform Normalization
-
Working together with Unicode Technical Committee
-
Current status (Unicode
TR#15):
-
Base on cannonical equivalence
-
Use precomposed where available
-
Exceptions for scripts such as Hebrew
-
Cutoff at version 3.0 of Unicode (and the next edition of ISO/IEC 10646-1)
-
Decomposition for precomposed forms introduced after Unicode 3.0
-
Details in talk B7: Normalization, by Mark Davis