Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Change example 13. #15

Open
wants to merge 1 commit into
base: gh-pages
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
34 changes: 16 additions & 18 deletions index.html
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
<!DOCTYPE html>
<!DOCTYPE html>
<html dir="ltr" lang="en">
<head>
<meta charset="UTF-8">
Expand Down Expand Up @@ -701,35 +701,33 @@ <h4>Visually identical text that is not canonically equivalent</h4>
<thead>
<tr>
<th>Description</th>
<th colspan=4 style="text-align:center">Examples</th>
</tr>
<th style="text-align:center">Examples</th>
<th style="text-align:center">&nbsp;</th>
<th style="text-align:center">&nbsp;</th>
</tr>
</thead>
<tbody>
<tr>
<td><strong>Canonically equivalent alternatives</strong><br/>(differences resolved by Unicode Normalization)</td>
<td class="exampleChar">&#x625;</td>
<td><code class="uname" translate="no">U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW</code></td>
<td class="exampleChar">&#x627;&#x655;</td>
<td><code class="uname" translate="no">U+0627 ARABIC LETTER ALEF</code> + <code class="uname" translate="no">U+0655 ARABIC HAMZA BELOW</code></td>

<td class="exampleChar"><img src="https://r12a.github.io/c/Arabic/large/0625.png" alt="&#x065B;" style="height: 4rem; width: 4rem;"></td>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Let's be sure to move the graphic to our repo and make the link relative?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, i didn't do that pending approval for the approach, but we will need to do that before we can publish the doc to TR.

<td><span class="codepoint" translate="no"><span class="uname">U+0625 ARABIC LETTER ALEF WITH HAMZA BELOW</span></span> Used in Kashmiri to indicate the sound <span class="ipa">ɨ</span> in word-initial position.</td>
<td><span class="codepoint" translate="no"><span class="uname">U+0627 ARABIC LETTER ALEF</span> + <span class="uname">U+0655 ARABIC HAMZA BELOW</span></span> Equally valid in Kashmiri, but the vowel is encoded separately here.</td>
</tr>
<tr>
<td><strong>Not canonically equivalent</strong><br/>(differences that <em>remain</em> after Unicode Normalization) Many of these are linked to user perception of whether the vowel is part of the base letter (<em lang="ar-Latn" translate="no">ijam</em>) vs. separable (<em lang="ar-Latn" translate="no">tashkil</em>)</td>
<td class="exampleChar">&#x6ce;</td>
<td><code class="uname" translate="no">U+06CE ARABIC LETTER YEH WITH SMALL V</code></td>
<td class="exampleChar">&#x6cc;&#x65a;</td>
<td><code class="uname" translate="no">U+06CC ARABIC LETTER FARSI YEH</code> + <code class="uname" translate="no">U+065A ARABIC VOWEL SIGN SMALL V ABOVE</code></td>
<td><strong>Not canonically equivalent</strong><br/>(differences that <em>remain</em> after Unicode Normalization) Many of these are linked to user perception of whether the vowel is part of the base letter (<em lang="ar-Latn" translate="no">ijam</em>) vs. separable (<em lang="ar-Latn" translate="no">tashkil</em> or <em lang="ar-Latn" translate="no">hamza</em>)</td>
<td class="exampleChar"><img src="https://r12a.github.io/c/Arabic/large/0681.png" alt="&#x065B;" style="height: 4rem; width: 4rem;"></td>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

and here

<td><span class="codepoint" translate="no"><span class="uname">U+062D ARABIC LETTER HAH</span> + <span class="uname">U+0654 ARABIC HAMZA ABOVE</span></span> Preferred usage for Kashmiri to represent a consonant+vowel sequence <span class="ipa">hə</span>. The hamza represents the vowel sound.</td>
<td><span class="codepoint" translate="no"><span class="uname">U+0681 ARABIC LETTER HAH WITH HAMZA ABOVE</span></span> Used for Pashto consonant <span class="ipa">d͡z</span>. Can then be combined with other diacritics to indicate vowels.</td>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Consider entitizing the IPA?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We should apply a style to the ipa that puts /../ around the marked up item and probably ensure that we apply a webfont that can show all Latin characters, such as Noto Sans or SIL Gentium Plus.

</tr>
<tr>
<td><strong>Confusables or spelling errors</strong><br/>these can be common in certain kinds of text due to gaps in keyboard support or due to a similarity in appearance</td>
<td class="exampleChar">&#x626;</td>
<td><code class="uname" translate="no">U+0626 ARABIC LETTER YEH WITH HAMZA ABOVE</code></td>
<td class="exampleChar">&#x6cc;&#x654;</td>
<td><code class="uname" translate="no">U+06CC ARABIC LETTER FARSI YEH</code> + <code class="uname" translate="no">U+0654 ARABIC HAMZA ABOVE</code></td>
<td class="exampleChar"><img src="https://r12a.github.io/c/Arabic/large/065B.png" alt="&#x065B;" style="height: 4rem; width: 4rem;"></td>
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

file location

<td><code class="uname" translate="no">U+0652 ARABIC SUKUN</code> In Kashmiri (as in Arabic) this indicates that no vowel follows the base consonant, but it looks like this when an appropriate Kashmiri (Nastaliq) font is used.</td>
<td><code class="uname" translate="no">U+065B ARABIC VOWEL SIGN INVERTED SMALL V ABOVE</code> Used for vowel sounds in African languages such as the Fulfulde vowel <span class="ipa">o</span>.</td>
</tr>
</tbody>
</table>
<p>(For more information, see Richard Ishida's doc <a href="https://r12a.github.io/scripts/arabic/ks.html#encoding">here</a>.)</p>
<p>(For more information, see Richard Ishida's doc <a href="https://r12a.github.io/scripts/arab/homographs#ijam_tashkil">here</a>.)</p>
</aside>

</section>
Expand Down