List the Chinese characters in Unicode? #605

xfq · 2024-02-13T09:13:03Z

It might be useful to list the Chinese characters in Unicode, like klreq and alreq:

The basic set (U+4E00-U+9FA5), i.e., ISO/IEC 10646:1993
CJK Unified Ideographs Extension A, i.e., U+3400-U+4DB5 in ISO/IEC 10646:1999
U+3400-U+9FFF (BMP Chinese characters)
U+20000-U+2FFFF, i.e., CJK Unified Ideographs Extension B to Extension F (Extension I in September 2023), commonly known as the Supplementary Ideographic Plane (SIP)
U+30000-U+3FFFF, i.e., CJK Unified Ideographs Extension G to Extension H, commonly known as the Tertiary Ideographic Plane (TIP)
CJK Compatibility Ideographs in the Basic Multilingual Plane (U+F900-U+FAFF)

yisibl · 2024-04-18T11:36:32Z

Should CJK Compatibility Ideographs be abandoned?

xfq · 2024-04-21T04:17:41Z

Should CJK Compatibility Ideographs be abandoned?

There seem to be some standard Chinese characters in CJK Compatibility Ideographs. @eisoch?

AmeroHan · 2024-08-31T08:13:39Z

U+3007 (〇) IDEOGRAPHIC NUMBER ZERO in CJK Symbols and Punctuation (U+3000..U+303F) is also considered a hanzi by
standards, dictionaries and UCS according to 「〇」算不算汉字？ - 知乎 (Is “〇” a hanzi? - Zhihu).

Additionally, outside the list @xfq provided above, there are some other characters with script property “Han” in UCD, such as U+3005 (々) IDEOGRAPHIC ITERATION MARK and Suzhou numerals (U+3021..U+3029). Should they be listed?

xfq · 2024-10-17T06:06:20Z

We should probably list the various character sets defined by each region too, like https://www.w3.org/TR/hani-lreq/#h_script_overview

r12a · 2024-10-17T16:46:29Z

You probably need to revisit this list now that Unicode 16.0 has been released.

Also, there are other Unicode blocks that may need mentioning if mention is made of compatibility block (which i suggest should be mentioned separately, if at all), such as CJK radicals, CJK strokes, kanbun, etc. See a list at https://www.unicode.org/charts/ under East Asian Scripts.

It's probably best to clearly define what types of character should go in the list, and to do that to first be clear about why we're listing characters (ie. who will use the list, and for what).

xfq added the i:encoding Characters & encoding label Apr 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

List the Chinese characters in Unicode? #605

List the Chinese characters in Unicode? #605

xfq commented Feb 13, 2024

yisibl commented Apr 18, 2024

xfq commented Apr 21, 2024

AmeroHan commented Aug 31, 2024 •

edited

Loading

xfq commented Oct 17, 2024

r12a commented Oct 17, 2024

List the Chinese characters in Unicode? #605

List the Chinese characters in Unicode? #605

Comments

xfq commented Feb 13, 2024

yisibl commented Apr 18, 2024

xfq commented Apr 21, 2024

AmeroHan commented Aug 31, 2024 • edited Loading

xfq commented Oct 17, 2024

r12a commented Oct 17, 2024

AmeroHan commented Aug 31, 2024 •

edited

Loading