-
Notifications
You must be signed in to change notification settings - Fork 61
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
List the Chinese characters in Unicode? #605
Comments
Should CJK Compatibility Ideographs be abandoned? |
There seem to be some standard Chinese characters in CJK Compatibility Ideographs. @eisoch? |
U+3007 (〇) IDEOGRAPHIC NUMBER ZERO in CJK Symbols and Punctuation (U+3000..U+303F) is also considered a hanzi by Additionally, outside the list @xfq provided above, there are some other characters with script property “Han” in UCD, such as U+3005 (々) IDEOGRAPHIC ITERATION MARK and Suzhou numerals (U+3021..U+3029). Should they be listed? |
We should probably list the various character sets defined by each region too, like https://www.w3.org/TR/hani-lreq/#h_script_overview |
You probably need to revisit this list now that Unicode 16.0 has been released. Also, there are other Unicode blocks that may need mentioning if mention is made of compatibility block (which i suggest should be mentioned separately, if at all), such as CJK radicals, CJK strokes, kanbun, etc. See a list at https://www.unicode.org/charts/ under East Asian Scripts. It's probably best to clearly define what types of character should go in the list, and to do that to first be clear about why we're listing characters (ie. who will use the list, and for what). |
It might be useful to list the Chinese characters in Unicode, like klreq and alreq:
The text was updated successfully, but these errors were encountered: