-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Overlapping Hebrew vowel signs #12
Comments
Thanks for at-including me on this, @dscorbett. Whenever I see a bug I always wonder whether and how it could have been caught automatically. Are the Noto fonts tested by fontbakery? If so, these bugs could be "formally" caught by passing the handy test string you made to a collision check done by @simoncozens' collidoscope. Simon and I are both using collidoscope under fontbakery on our own projects but I don't think there is any "official" use of collidoscope in fontbakery yet. On the substance of the bugs themselves, I'm not sure how important these bugs are, because I'm not sure how common it is to expect these special cases to be handled in fonts for non-Biblical Hebrew. (I'm guessing the Noto fonts only strive to support non-Biblical Hebrew.) These special cases do not include trope (aka cantillation or accent) marks. Thus one could say they are not Biblical Hebrew, according to one definition of Biblical Hebrew. These special cases contain only vowel marks. But the way that these special cases use the vowel marks is specific to a Biblical situation known as ketiv/qere. A ketiv/qere situation occurs when what is traditionally read aloud for a word (its qere) is incompatible with the consonants that are traditionally written for that word (its ketiv). Ketiv/qere situations are notated in a variety of ways, but in the cases we're concerned with here, the way they are notated is by putting the vowel marks of the (implied) qere on the consonants of the ketiv. So that's how you end up with the "illegal" situation of two vowels on a single consonant (lamed): in the (implied) qere, there's a yod consonant after the lamed that renders the situation at least sensible if not "legal." While fonts for non-Biblical Hebrew should probably have some degree of support for vowel marks (though even that is debatable), it may be unreasonable to expect such fonts to handle this weird, Biblically-specific use of vowel marks. |
@dscorbett : thank you -- keep these coming :-) no matter if they are non-Biblical Hebrew or Biblical Hebrew ... Sooner or later I'd like Noto to support Biblical Hebrew |
Note that the example string is given in normalized (e.g. NFD) order, which is an order not corresponding to the desired RTL visual mark (vowel) order. Some shapers (notably HarfBuzz) will transiently re-order the code points of this string to the desired order before the font ever sees it. But other shapers (notably MS DirectWrite (descended from MS UniScribe)) will leave the string alone. Thus, what behavior one should expect from the example string depends on the shaper in use. Thus, this bug could be improved in the following ways:
|
Just to clarify what @moshfrid reports above (assuming I understand it), it is referring to the code point sequence:
Or, hopefully equivalently, it refers to the code point sequence with the two "HEBREW POINT" elements reversed (this would be the normalized form (order), which most people find unintuitive but apparently not the authors of the Unicode standard, unless the combining classes they chose were not what they intended). And, to further clarify, the appearance is expected to be something like this: (Tastes may differ as to the details, for instance tastes may differ as to whether the qamats should float upward a bit compared to this example, but of course all agree that collision (overlap) between the qamats and the dagesh should not occur!) This construction appears about a hundred times in the Hebrew Bible; I don't have the exact number. An early example is the word אַרְאֶֽךָּ׃, shown as an image below: This example is from בראשית יב,א (Genesis 12:1). |
The thing is: Noto basically already supports Biblical Hebrew. Cantillation marks, unique letters such as the nun hafuḵa (U+5C6) have practically no use in regular Hebrew. |
Fonts
NotoRashiHebrew-Regular.otf
NotoSansHebrew-Regular.otf
NotoSerifHebrew-Regular.otf
Where the fonts came from, and when
Site: https://github.com/googlefonts/noto-fonts/blob/81b283b55b3e5b80ec0e410d4b246d3573e1c7de/unhinted/otf/NotoRashiHebrew/NotoRashiHebrew-Regular.otf
Site: https://github.com/googlefonts/noto-fonts/blob/81b283b55b3e5b80ec0e410d4b246d3573e1c7de/unhinted/otf/NotoSansHebrew/NotoSansHebrew-Regular.otf
Site: https://github.com/googlefonts/noto-fonts/blob/81b283b55b3e5b80ec0e410d4b246d3573e1c7de/unhinted/otf/NotoSerifHebrew/NotoSerifHebrew-Regular.otf
Date: 2021-04-15
Font versions
Noto Rashi Hebrew: Version 1.002
Noto Sans Hebrew: Version 3.000
Noto Serif Hebrew: Version 2.000
Issue
Various inflections of “Jerusalem” in the Tanakh include two vowel signs side by side after the lamed, but in Noto, the vowel signs overlap. See Firefox bug 662055 or ask @bdenckla for more information.
Character data
לְַמלְָמלִַםלִָם
U+05DC HEBREW LETTER LAMED
U+05B0 HEBREW POINT SHEVA
U+05B7 HEBREW POINT PATAH
U+05DE HEBREW LETTER MEM
U+05DC HEBREW LETTER LAMED
U+05B0 HEBREW POINT SHEVA
U+05B8 HEBREW POINT QAMATS
U+05DE HEBREW LETTER MEM
U+05DC HEBREW LETTER LAMED
U+05B4 HEBREW POINT HIRIQ
U+05B7 HEBREW POINT PATAH
U+05DD HEBREW LETTER FINAL MEM
U+05DC HEBREW LETTER LAMED
U+05B4 HEBREW POINT HIRIQ
U+05B8 HEBREW POINT QAMATS
U+05DD HEBREW LETTER FINAL MEM
Screenshots
The text was updated successfully, but these errors were encountered: