Skip to content

Commit

Permalink
Wrap all acronyms of 3+ letters in abbr tags for CSS smallcap styling…
Browse files Browse the repository at this point in the history
…, exlcuding tt, code, and literal quotations.
  • Loading branch information
n8willis committed Feb 26, 2023
1 parent 56a5d42 commit 34869e2
Show file tree
Hide file tree
Showing 45 changed files with 324 additions and 321 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,7 @@ _<aside>Thanks also to the developers of HarfBuzz and AllSorts, plus many other
At present, we are seeking comments and bugfixes on the Indic-script,
Arabic-like, Hangul, Hebrew, Thai/Lao, Tibetan, Khmer, Myanmar,
default, and USE documents. Interested readers and contributors can
default, and <abbr>USE</abbr> documents. Interested readers and contributors can
begin at the

- [Indic General](opentype-shaping-indic-general.md)
Expand All @@ -31,7 +31,7 @@ begin at the
- [Thai and Lao](opentype-shaping-thai-lao.md)
- [Tibetan](opentype-shaping-tibetan.md)
- [Myanmar](opentype-shaping-myanmar.md)
- [Universal Shaping Engine (USE)](opentype-shaping-use.md)
- [Universal Shaping Engine (<abbr>USE</abbr>)](opentype-shaping-use.md)
- All complex scripts that are not handled by a dedicated
script-specific shaping model
- [Default](opentype-shaping-default.md)
Expand Down
10 changes: 5 additions & 5 deletions character-tables/character-tables-arabic.md
Original file line number Diff line number Diff line change
Expand Up @@ -41,8 +41,8 @@ that sequences of adjacent marks can be reordered as required by the
orthography.

For Arabic, a subset of marks in the 220 and 230 classes are also
designated _Modifier Combining Marks_ (MCM). These are denoted with
_220_MCM_ and _230_MCM_ in the _Mark class_ column. The MCM marks are
designated _Modifier Combining Marks_ (<abbr>MCM</abbr>). These are denoted with
_220_MCM_ and _230_MCM_ in the _Mark class_ column. The <abbr>MCM</abbr> marks are
treated differently during the mark-reordering stage.


Expand Down Expand Up @@ -643,7 +643,7 @@ fashion; shaping engines should cope with this situation gracefully.



The combining grapheme joiner (CGJ) is primarily used to alter the
The combining grapheme joiner (<abbr>CGJ</abbr>) is primarily used to alter the
order in which adjacent marks are positioned during the
mark-reordering stage, in order to adhere to the needs of a
non-default language orthography.
Expand All @@ -661,12 +661,12 @@ sequence "ZWJ,_Letter_,ZWJ" would be used.

<!--- Zero-Width Non Joiner explanation --->

The right-to-left mark (RLM) and left-to-right mark (LRM) are used by
The right-to-left mark (<abbr>RLM</abbr>) and left-to-right mark (<abbr>LRM</abbr>) are used by
the Unicode bidirectionality algorithm (BiDi) to indicate the points
in a text run at which the writing direction changes.


<!--- How shaping is affected by the LTR and RTL markers explanation --->
<!--- How shaping is affected by the <abbr>LTR</abbr> and <abbr>RTL</abbr> markers explanation --->


The no-break space is primarily used to display those codepoints that
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-bengali.md
Original file line number Diff line number Diff line change
Expand Up @@ -298,7 +298,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-devanagari.md
Original file line number Diff line number Diff line change
Expand Up @@ -331,7 +331,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-gujarati.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,7 +299,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-gurmukhi.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,7 +299,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-kannada.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,7 +299,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
13 changes: 7 additions & 6 deletions character-tables/character-tables-khmer.md
Original file line number Diff line number Diff line change
Expand Up @@ -262,12 +262,13 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space is primarily used to display those codepoints that
are defined as non-spacing (marks, dependent vowels (matras),
below-base consonant forms, and post-base consonant forms) in an
isolated context, as an alternative to displaying them superimposed on
the dotted-circle placeholder. These sequences will match
"NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".
The no-break space (<abbr>NBSP<.abbr>) is primarily used to display
those codepoints that are defined as non-spacing (marks, dependent
vowels (matras), below-base consonant forms, and post-base consonant
forms) in an isolated context, as an alternative to displaying them
superimposed on the dotted-circle placeholder. These sequences will
match "NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or
"NBSP,_matra_".

In addition to general punctuation, runs of Khmer text often use the
danda (`U+0964`) and double danda (`U+0965`) punctuation marks from
Expand Down
12 changes: 6 additions & 6 deletions character-tables/character-tables-malayalam.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,10 +299,10 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space is primarily used to display those codepoints that
are defined as non-spacing (marks, dependent vowels (matras),
below-base consonant forms, and post-base consonant forms) in an
isolated context, as an alternative to displaying them superimposed on
the dotted-circle placeholder. These sequences will match
"NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".
The no-break space (<abbr>NBSP</abbr>) is primarily used to display
those codepoints that are defined as non-spacing (marks, dependent
vowels (matras), below-base consonant forms, and post-base consonant
forms) in an isolated context, as an alternative to displaying them
superimposed on the dotted-circle placeholder. These sequences will
match "NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".

4 changes: 2 additions & 2 deletions character-tables/character-tables-mongolian.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,8 +38,8 @@ that sequences of adjacent marks can be reordered as required by the
orthography.

For Mongolian, a subset of marks in the 220 and 230 classes are also
designated _Modifier Combining Marks_ (MCM). These are denoted with
_220_MCM_ and _230_MCM_ in the _Mark class_ column. The MCM marks are
designated _Modifier Combining Marks_ (<abbr>MCM</abbr>). These are denoted with
_220_MCM_ and _230_MCM_ in the _Mark class_ column. The <abbr>MCM</abbr> marks are
treated differently during the mark-reordering stage.


Expand Down
12 changes: 6 additions & 6 deletions character-tables/character-tables-myanmar.md
Original file line number Diff line number Diff line change
Expand Up @@ -399,10 +399,10 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space is primarily used to display those codepoints that
are defined as non-spacing (marks, dependent vowels (matras),
below-base consonant forms, and post-base consonant forms) in an
isolated context, as an alternative to displaying them superimposed on
the dotted-circle placeholder. These sequences will match
"NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".
The no-break space (<abbr>NBSP</abbr>) is primarily used to display
those codepoints that are defined as non-spacing (marks, dependent
vowels (matras), below-base consonant forms, and post-base consonant
forms) in an isolated context, as an alternative to displaying them
superimposed on the dotted-circle placeholder. These sequences will
match "NBSP,ZWJ,Halant,_Consonant_", "NBSP,_mark_", or "NBSP,_matra_".

6 changes: 3 additions & 3 deletions character-tables/character-tables-nko.md
Original file line number Diff line number Diff line change
Expand Up @@ -144,7 +144,7 @@ fashion; shaping engines should cope with this situation gracefully.



The combining grapheme joiner (CGJ) is primarily used to alter the
The combining grapheme joiner (<abbr>CGJ</abbr>) is primarily used to alter the
order in which adjacent marks are positioned during the
mark-reordering stage, in order to adhere to the needs of a
non-default language orthography.
Expand All @@ -162,12 +162,12 @@ sequence "ZWJ,_Letter_,ZWJ" would be used.

<!--- Zero-Width Non Joiner explanation --->

The right-to-left mark (RLM) and left-to-right mark (LRM) are used by
The right-to-left mark (<abbr>RLM</abbr>) and left-to-right mark (<abbr>LRM</abbr>) are used by
the Unicode bidirectionality algorithm (BiDi) to indicate the points
in a text run at which the writing direction changes.


<!--- How shaping is affected by the LTR and RTL markers explanation --->
<!--- How shaping is affected by the <abbr>LTR</abbr> and <abbr>RTL</abbr> markers explanation --->


The no-break space is primarily used to display those codepoints that
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-oriya.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,7 +299,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-sinhala.md
Original file line number Diff line number Diff line change
Expand Up @@ -323,7 +323,7 @@ Sinhala text. The <abbr>ZWNJ</abbr> is referenced below in various regular
expressions and shaping rules, however, because it is used by other
Indic scripts.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
10 changes: 5 additions & 5 deletions character-tables/character-tables-syriac.md
Original file line number Diff line number Diff line change
Expand Up @@ -38,8 +38,8 @@ that sequences of adjacent marks can be reordered as required by the
orthography.

For Syriac, a subset of marks in the 220 and 230 classes are also
designated _Modifier Combining Marks_ (MCM). These are denoted with
_220_MCM_ and _230_MCM_ in the _Mark class_ column. The MCM marks are
designated _Modifier Combining Marks_ (<abbr>MCM</abbr>). These are denoted with
_220_MCM_ and _230_MCM_ in the _Mark class_ column. The <abbr>MCM</abbr> marks are
treated differently during the mark-reordering stage.


Expand Down Expand Up @@ -197,7 +197,7 @@ does not encode a separate kashida character.
| | | | | | |


The combining grapheme joiner (CGJ) is primarily used to alter the
The combining grapheme joiner (<abbr>CGJ</abbr>) is primarily used to alter the
order in which adjacent marks are positioned during the
mark-reordering stage, in order to adhere to the needs of a
non-default language orthography.
Expand All @@ -215,12 +215,12 @@ sequence "ZWJ,_Letter_,ZWJ" would be used.

<!--- Zero-Width Non Joiner explanation --->

The right-to-left mark (RLM) and left-to-right mark (LRM) are used by
The right-to-left mark (<abbr>RLM</abbr>) and left-to-right mark (<abbr>LRM</abbr>) are used by
the Unicode bidirectionality algorithm (BiDi) to indicate the points
in a text run at which the writing direction changes.


<!--- How shaping is affected by the LTR and RTL markers explanation --->
<!--- How shaping is affected by the <abbr>LTR</abbr> and <abbr>RTL</abbr> markers explanation --->


The no-break space is primarily used to display those codepoints that
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-tamil.md
Original file line number Diff line number Diff line change
Expand Up @@ -399,7 +399,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
2 changes: 1 addition & 1 deletion character-tables/character-tables-telugu.md
Original file line number Diff line number Diff line change
Expand Up @@ -299,7 +299,7 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space (NBSP) is primarily used to display those
The no-break space (<abbr>NBSP</abbr>) is primarily used to display those
codepoints that are defined as non-spacing (marks, dependent vowels
(matras), below-base consonant forms, and post-base consonant forms)
in an isolated context, as an alternative to displaying them
Expand Down
12 changes: 6 additions & 6 deletions character-tables/character-tables-tibetan.md
Original file line number Diff line number Diff line change
Expand Up @@ -318,9 +318,9 @@ A secondary usage of the zero-width joiner is to prevent the formation of
where an initial "Ra,Halant" sequence without the zero-width joiner
otherwise would.

The no-break space is primarily used to display those codepoints that
are defined as non-spacing (marks, dependent vowels (matras),
below-base consonant forms, and post-base consonant forms) in an
isolated context, as an alternative to displaying them superimposed on
the dotted-circle placeholder. These sequences will match
"NBSP,ZWJ,Halant,_consonant_", "NBSP,_mark_", or "NBSP,_matra_".
The no-break space (<abbr>NBSP</abbr>) is primarily used to display
those codepoints that are defined as non-spacing (marks, dependent
vowels (matras), below-base consonant forms, and post-base consonant
forms) in an isolated context, as an alternative to displaying them
superimposed on the dotted-circle placeholder. These sequences will
match "NBSP,ZWJ,Halant,_consonant_", "NBSP,_mark_", or "NBSP,_matra_".
20 changes: 10 additions & 10 deletions errata.md
Original file line number Diff line number Diff line change
Expand Up @@ -8,15 +8,15 @@ specification documents.
**Table of Contents**

- [Unicode](#unicode)
- [ZWJ and ZWNJ](#zwj-and-zwnj)
- [Scope of ZWJ and ZWNJ](#scope-of-zwj-and-zwnj)
- [ZWJ in redundant ligature lookups](#zwj-in-redundant-ligature-lookups)
- [<abbr>ZWJ</abbr> and <abbr>ZWNJ</abbr>](#zwj-and-zwnj)
- [Scope of <abbr>ZWJ</abbr> and <abbr>ZWNJ</abbr>](#scope-of-zwj-and-zwnj)
- [<abbr>ZWJ</abbr> in redundant ligature lookups](#zwj-in-redundant-ligature-lookups)
- [Emoji](#emoji)
- [Skin-tone permutations](#skin-tone-permutations)
- [Gender permutations](#gender-permutations)
- [OpenType](#opentype)
- [Null offsets in GSUB and GPOS](#null-offsets-in-gsub-and-gpos)
- [Sorting of GSUB and GPOS lookups](#sorting-of-gsub-and-gpos-lookups)
- [Null offsets in <abbr>GSUB</abbr> and <abbr>GPOS</abbr>](#null-offsets-in-gsub-and-gpos)
- [Sorting of <abbr>GSUB</abbr> and <abbr>GPOS</abbr> lookups](#sorting-of-gsub-and-gpos-lookups)
- [Per-script applicability of feature tags](#per-script-applicability-of-feature-tags)
- [Ordering of post-base and below-base consonants in Indic2 base-consonant determination](#ordering-of-post-base-and-below-base-consonants-in-indic2-base-consonant-determination)
- [Lookup behavior](#lookup-behavior)
Expand All @@ -32,9 +32,9 @@ specification documents.

This section lists errata pertaining to the Unicode Standard.

### ZWJ and ZWNJ ###
### <abbr>ZWJ</abbr> and <abbr>ZWNJ</abbr> ###

#### Scope of ZWJ and ZWNJ ####
#### Scope of <abbr>ZWJ</abbr> and <abbr>ZWNJ</abbr> ####

Unicode provides the Zero Width Joiner (<abbr>ZWJ</abbr>) and Zero Width Non-Joiner
(<abbr>ZWNJ</abbr>) control characters so that a text sequence can "request a
Expand All @@ -58,7 +58,7 @@ in the same "a,b,ZWNJ,c,d" sequence should inhibit the application of
the ligature between "c" and "d".


#### ZWJ in redundant ligature lookups ####
#### <abbr>ZWJ</abbr> in redundant ligature lookups ####

An "Implementation Notes" section in chapter 23.2 of the Unicode
Standard says that font vendors should add <abbr>ZWJ</abbr> sequences to ligature
Expand Down Expand Up @@ -113,7 +113,7 @@ sequence "Woman,ZWJ,Man,ZWJ,Girl".

This section lists errata pertaining to the OpenType specification.

### Null offsets in GSUB and GPOS ###
### Null offsets in <abbr>GSUB</abbr> and <abbr>GPOS</abbr> ###

The headers of the <abbr>GSUB</abbr> and <abbr>GPOS</abbr> tables include fields that contain
the offsets at which other structures within the font binary are
Expand All @@ -135,7 +135,7 @@ Furthermore, font-validation functions may overwrite a `NULL` into an
offset field if the original value encountered was invalid.


### Sorting of GSUB and GPOS lookups ###
### Sorting of <abbr>GSUB</abbr> and <abbr>GPOS</abbr> lookups ###

The OpenType specification requires that lookups in the <abbr>GSUB</abbr> table
must be sorted into numeric order before they are applied.
Expand Down
Loading

0 comments on commit 34869e2

Please sign in to comment.