What do U+AE40 김 and U+6A02 樂 have in common? 🤔
I enjoy working with standards.
Interestingly, standards that were published by the Koreas—South Korea (aka ROK) and North Korea (aka DPRK)—include characters that appear more than once.
In the case of South Korea, it is well known that 268 of the 4,888 ideographs (aka hanja) in the KS X 1001 standard are duplicates, which affects ideographs for which there are more than one reading. This, of course, means that there are 4,620 unique ideographs in that standard.
In the case of North Korea, their original KPS 9566 standard that is dated 1997 separately encodes the modern hangul syllables that represent the names of the previous (at the time) and current (again, at the time) leaders.
In exactly 10 days, Japan is expected to reveal the name of its next era that will begin on 2019-05-01.
This article will cover several important standards or events that are related to the two-kanji square ligature forms of the current era name, the previous three, and the forthcoming one.
Prompted by recent activity on Twitter, I assembled a new mapping file that correlates Adobe-Japan1-6 CIDs (aka glyphs) to Source Han ones, but only for the 6,879 characters in the JIS X 0208 standard. Because the Source Han Sans and Source Han Serif glyphs sets are different, they require separate columns in the mapping file. Also, for the 161 kanji in JIS X 0208 that have both JIS90 (aka JIS X 0208-1990) and JIS2004 (aka JIS X 0213:2004) forms, the CIDs that correspond to the JIS2004 forms are indicated, and those for the JIS90 forms are in brackets.
I spent part of last week preparing the Adobe-Japan1-7 character collection specification update, which will be released sometime in early April, shortly after Japan announces the name of their new era name. (Until the update is released next month, Adobe-Japan1-6 is still reflected in the open source project.) The announcement is expected to take place on 2019-04-01, and while this date represents the start of a new fiscal year in Japan, it is an unfortunate choice for elsewhere.
Anyway, while performing said update, I came across a reference to Adobe Tech Note #5094, Adobe CJKV Character Collections and CMaps for CID-Keyed Fonts, which I last updated a dozen years ago. I decided to use this as an opportunity to obsolete yet another Adobe Tech Note by incorporating its meaningful content into the open source CMap Resources project. I did precisely that last week, which involved updating its content in the process.