toki pona ASCII syllabary: Difference between revisions

no edit summary
No edit summary
No edit summary
Line 1:
{{Figure|{{tok|<code>j l bn e Oq Tw sn</code>}}|{{tp|jan li pana e moku tawa sina}}}}
The '''toki pona ASCII syllabary''' by {{tok|[[jan Misali]]}}, or '''{{tp|sitelen Aki}}'''{{citation needed|date=2024-01-10|reason=Origin of this name?}}, is a [[writing system]] that assigns each [[phonotactic]]ally allowed [[syllable]] in [[Toki Pona]] to a single {{w|ASCII}} character. It is designed so that many [[particle]]s and short and common [[content word]]s become recognizable [[Abbreviations|initials]], or become or integrate meaningful symbols.
 
==Syllables==
The '''toki pona ASCII syllabary''' by {{tok|[[jan Misali]]}}, or '''{{tp|sitelen Aki}}'''{{citation needed|date=2024-01-10|reason=Origin of this name?}}, is a [[writing system]] that assigns each [[phonotactic]]ally allowed [[syllable]] in [[Toki Pona]] to a single {{w|ASCII}} character.
Highlighted cells correspond to syllables that do not occur in any {{tp|[[nimi pu]]}} (but that can appear in [[name]]s, and some of which would later appear in {{tp|[[nimi ku]]}}). This affected the allocation of the characters, with many non-alphanumeric symbols being given to these syllables.
 
The syllabary can be used to [[Text compression|compress text]], with each character being reducible to 7 {{w|bit}}s. Word lengths range from unchanged (for one-letter words) to a third as long, such as {{tok|<code>C%</code>}} for {{tp|[[sinpin]]}}. A major limiting factor for the {{w|Data compression ratio|compression ratio}} is the need to separate words, which is generally done with the ASCII {{w|Whitespace character|space}} (<code> </code>). {{tok|Misali}} also suggests using the syllabary as an {{w|input method}} for other writing systems.
 
The only unused printable, non-whitespace ASCII characters are the period (<code>.</code>), generally used as a sentence separator, and the colon (<code>:</code>). Any other [[punctuation]] would be lost due to conflicting with syllable characters or being out of the ASCII range. Similarly, due to the significant {{w|letter case}} and the use of all ASCII {{w|bracket}}s, there is no clear way to specify proper [[name]]s with this system. It has been suggested to use the colon for name marking where necessary, using the period as the only sentence separator.<ref>{{cite Discord|url=//discord.com/channels/301377942062366741/1156485500102778900/1194802490235953183|thread={{tok|sitelen Aki pi jan Misali · sE$ ak P j mcl}}|channel={{tok|toki-suli}}|server={{tp|ma pona pi toki pona}}|author={{tok|kulupu Menasewi}}|access-date=2024-01-15|quote=}}</ref>
 
==Syllables==
{|class="wikitable" style="text-align:center;"
!rowspan="2"|
Line 33 ⟶ 30:
|{{tok|<code>U</code>}}
|{{tok|<code>&</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">7</code>}}}}
|{{tok|<code>A</code>}}
|-
Line 42 ⟶ 39:
|{{tok|<code>O</code>}}
|{{tok|<code>M</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">8</code>}}}}
|{{tok|<code>3</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">9</code>}}}}
|{{tok|<code>@</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">`</code>}}}}
|-
!{{tp|n-}}
Line 54 ⟶ 51:
|{{tok|<code>*</code>}}
|{{tok|<code>n</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">(</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">)</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">-</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">_</code>}}}}
|{{tok|<code>#</code>}}
|-
Line 67 ⟶ 64:
|{{tok|<code>b</code>}}
|{{tok|<code>%</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">+</code>}}}}
|{{tok|<code>F</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">&equals;</code>}}}}
|{{tok|<code>6</code>}}
|-
Line 79 ⟶ 76:
|{{tok|<code>T</code>}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">d</code>}}}}
|{{tok|<code>x</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">D</code>}}}}
|{{tok|<code>X</code>}}
|-
Line 91 ⟶ 88:
|{{tok|<code>K</code>}}
|{{tok|<code>Q</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">{</code>}}}}
|{{tok|<code>g</code>}}
|{{tok|<code>h</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">}</code>}}}}
|-
!{{tp|s-}}
Line 104 ⟶ 101:
|{{tok|<code>C</code>}}
|{{tok|<code>4</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">[</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">\</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">]</code>}}}}
|-
!{{tp|w-}}
Line 114 ⟶ 111:
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code>w</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">&VerticalLine;</code>}}}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code>v</code>}}
Line 127 ⟶ 124:
|{{tok|<code>L</code>}}
|{{tok|<code>I</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">;</code>}}}}
|{{tok|<code>$</code>}}
|{{tok|<code>R</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">'</code>}}}}
|-
!{{tp|j-}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">,</code>}}}}
|{{tok|<code>y</code>}}
|{{tok|<code>J</code>}}
|{{tok|<code>Y</code>}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;"><</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">"</code>}}}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">></code>}}}}
|{{tok|<code>j</code>}}
|}
 
==Punctuation==
Only three printable ASCII characters are not defined in the syllable chart, and are, ''de facto'', virtually always used as [[punctuation]] in the same way as in {{tp|[[sitelen Lasina]]}}.
 
The ASCII {{w|Whitespace character|space}} (<code> </code>) is used to separate words. The period (<code>.</code>) and the colon (<code>:</code>) are generally used as sentence separators. Any other [[punctuation]] would be lost due to conflicting with syllable characters or being out of the ASCII range.
 
The only unused printable, non-whitespace ASCII characters are the period (<code>.</code>), generally used as a sentence separator, and the colon (<code>:</code>). Any other [[punctuation]] would be lost due to conflicting with syllable characters or being out of the ASCII range. Similarly, due to the significant {{w|letter case}} and the use of all ASCII {{w|bracket}}s, there is no clear way to specify proper [[name]]s with this system. It has been suggested to instead use the colon for name marking where necessary, using the period as the only sentence separator, even after sentence-linking {{tp|[[ni]]}}.<ref>{{cite Discord|url=//discord.com/channels/301377942062366741/1156485500102778900/1194802490235953183|thread={{tok|sitelen Aki pi jan Misali · sE$ ak P j mcl}}|channel={{tok|toki-suli}}|server={{tp|ma pona pi toki pona}}|author={{tok|kulupu Menasewi}}|access-date=2024-01-15|quote=}}</ref>
 
==Usage==
The syllabary can be used to [[Text compression|compress text]], with each character being reducible to 7 {{w|bit}}s. Word lengths range from unchanged (for one-letter words) to a third as long, such as {{tok|<code>C%</code>}} for {{tp|[[sinpin]]}}. A major limiting factor for the {{w|Data compression ratio|compression ratio}} is the need to separate words, which is generally done with the ASCII {{w|Whitespace character|space}} (<code> </code>). {{tok|Misali}} also suggests using the syllabary as an {{w|input method}} for other writing systems.
 
{{tok|Misali}} also suggests using the syllabary as an {{w|input method}} for other writing systems.
 
The syllabary is rarely used as a writing system for general conversation.
 
==See also==