JEP 267: Unicode 8.0
| Owner | Yuka Kamiya |
| Type | Feature |
| Scope | SE |
| Status | Closed / Delivered |
| Release | 9 |
| Component | core-libs / java.lang |
| Discussion | i18n dash dev at openjdk dot java dot net |
| Effort | S |
| Duration | S |
| Reviewed by | Brian Goetz |
| Endorsed by | Brian Goetz |
| Created | 2015/03/13 04:55 |
| Updated | 2019/05/13 22:53 |
| Issue | 8075115 |
Summary
Upgrade existing platform APIs to support version 8.0 of the Unicode Standard.
Goals
Support the latest version of Unicode, with changes to the following classes:
CharacterandStringin thejava.langpackage,NumericShaperin thejava.awt.fontpackage, andBidi,BreakIterator, andNormalizerin thejava.textpackage.
Non-Goals
Two related Unicode specifications will not be implemented by this JEP:
- UTS #10, Unicode Collation Algorithm, and
- UTS #46, Unicode IDNA Compatibility Processing.
Motivation
Unicode is an evolving industry standard, so we must keep Java to date with the latest version.
Description
This is a follow-on to JEP 227, which introduced Unicode 7.0 in JDK 9. Unicode 8.0 adds an additional ~8,000 characters, 10 blocks, and 6 scripts.
Testing
We will need to verify that the latest Unicode data is correctly used by the relevant classes.
Risks and Assumptions
-
Unicode 8 was released in June 2015. While fairly late in JDK 9 development, it is important to always implement the latest Unicode standard. Deferring until JDK 10 would put us more than three years behind.
-
It's possible that a minor update to the Unicode Standard (e.g., 8.0.X) will be released before JDK 9 ships, in which case we may want to consider incorporating that version.
Dependences
This feature depends on the Unicode Standard of the Unicode Consortium.