Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Errors in definition of orthographic syllable #76

Open
NorbertLindenberg opened this issue Apr 22, 2024 · 2 comments
Open

Errors in definition of orthographic syllable #76

NorbertLindenberg opened this issue Apr 22, 2024 · 2 comments
Labels
bug Something isn't working

Comments

@NorbertLindenberg
Copy link

The glossary entry for "orthographic syllable" describes it as a "typographic character unit", which in turn is described as a unit "that is indivisible with respect to a particular typographic operation" such as line breaking. Orthographic syllables are in fact not always indivisible. For example, in line breaking, both Batak and Tulu-Tigalari allow line breaks within orthographic syllables (see Line breaking at orthographic syllable boundaries).

The entry also describes an orthographic syllable as a "unit that includes more than one grapheme cluster". That's not correct – simple orthographic syllables may consist of a single grapheme cluster. "क" is both an orthographic syllable and a grapheme cluster. So, change to "one or more grapheme clusters".

The entry also states that "this term is used but not defined in the Unicode Standard". That's no longer true: Both section 6.1 Writing Systems of the Core Specification and the Unicode glossary now define the term.

@aphillips aphillips added the bug Something isn't working label Apr 22, 2024
@aphillips
Copy link
Collaborator

Need to add "O" to the alphabetic index at the top of the glossary.

Discussing in the 2024-04-25 teleconference.

@r12a
Copy link
Contributor

r12a commented Jun 19, 2024

hi @NorbertLindenberg. Thanks for your comments.

The glossary entry for "orthographic syllable" describes it as a "typographic character unit", which in turn is described as a unit "that is indivisible with respect to a particular typographic operation" such as line breaking. Orthographic syllables are in fact not always indivisible. For example, in line breaking, both Batak and Tulu-Tigalari allow line breaks within orthographic syllables (see Line breaking at orthographic syllable boundaries).

This is a quote from a CSS spec, so the comment should be made there. But note that the quoted text provides a Thai example where an orthographic syllable (and grapheme cluster!) is split for letter-spacing - which is a script-specific variant, too.

The entry also describes an orthographic syllable as a "unit that includes more than one grapheme cluster". That's not correct – simple orthographic syllables may consist of a single grapheme cluster. "क" is both an orthographic syllable and a grapheme cluster. So, change to "one or more grapheme clusters".

Fix proposed.

The entry also states that "this term is used but not defined in the Unicode Standard". That's no longer true: Both section 6.1 Writing Systems of the Core Specification and the Unicode glossary now define the term.

Fix proposed.

PR is at #77

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants