toki pona ASCII syllabary: Difference between revisions

From sona pona, the Toki Pona wiki
Content added Content deleted
No edit summary
No edit summary
Line 1: Line 1:
{{Figure|{{tok|<code>j l bn e Oq Tw sn</code>}}|{{tp|jan li pana e moku tawa sina}}}}
{{Figure|{{tok|<code>j l bn e Oq Tw sn</code>}}|{{tp|jan li pana e moku tawa sina}}}}
The '''toki pona ASCII syllabary''' by {{tok|[[jan Misali]]}}, or '''{{tp|sitelen Aki}}'''{{citation needed|date=2024-01-10|reason=Origin of this name?}}, is a [[writing system]] that assigns each [[phonotactic]]ally allowed [[syllable]] in [[Toki Pona]] to a single {{w|ASCII}} character. It is designed so that many [[particle]]s and short and common [[content word]]s become recognizable [[Abbreviations|initials]], or become or integrate meaningful symbols.


==Syllables==
The '''toki pona ASCII syllabary''' by {{tok|[[jan Misali]]}}, or '''{{tp|sitelen Aki}}'''{{citation needed|date=2024-01-10|reason=Origin of this name?}}, is a [[writing system]] that assigns each [[phonotactic]]ally allowed [[syllable]] in [[Toki Pona]] to a single {{w|ASCII}} character.
Highlighted cells correspond to syllables that do not occur in any {{tp|[[nimi pu]]}} (but that can appear in [[name]]s, and some of which would later appear in {{tp|[[nimi ku]]}}). This affected the allocation of the characters, with many non-alphanumeric symbols being given to these syllables.


The syllabary can be used to [[Text compression|compress text]], with each character being reducible to 7 {{w|bit}}s. Word lengths range from unchanged (for one-letter words) to a third as long, such as {{tok|<code>C%</code>}} for {{tp|[[sinpin]]}}. A major limiting factor for the {{w|Data compression ratio|compression ratio}} is the need to separate words, which is generally done with the ASCII {{w|Whitespace character|space}} (<code> </code>). {{tok|Misali}} also suggests using the syllabary as an {{w|input method}} for other writing systems.

The only unused printable, non-whitespace ASCII characters are the period (<code>.</code>), generally used as a sentence separator, and the colon (<code>:</code>). Any other [[punctuation]] would be lost due to conflicting with syllable characters or being out of the ASCII range. Similarly, due to the significant {{w|letter case}} and the use of all ASCII {{w|bracket}}s, there is no clear way to specify proper [[name]]s with this system. It has been suggested to use the colon for name marking where necessary, using the period as the only sentence separator.<ref>{{cite Discord|url=//discord.com/channels/301377942062366741/1156485500102778900/1194802490235953183|thread={{tok|sitelen Aki pi jan Misali · sE$ ak P j mcl}}|channel={{tok|toki-suli}}|server={{tp|ma pona pi toki pona}}|author={{tok|kulupu Menasewi}}|access-date=2024-01-15|quote=}}</ref>

==Syllables==
{|class="wikitable" style="text-align:center;"
{|class="wikitable" style="text-align:center;"
!rowspan="2"|
!rowspan="2"|
Line 33: Line 30:
|{{tok|<code>U</code>}}
|{{tok|<code>U</code>}}
|{{tok|<code>&</code>}}
|{{tok|<code>&</code>}}
|{{tok|<code>7</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">7</code>}}}}
|{{tok|<code>A</code>}}
|{{tok|<code>A</code>}}
|-
|-
Line 42: Line 39:
|{{tok|<code>O</code>}}
|{{tok|<code>O</code>}}
|{{tok|<code>M</code>}}
|{{tok|<code>M</code>}}
|{{tok|<code>8</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">8</code>}}}}
|{{tok|<code>3</code>}}
|{{tok|<code>3</code>}}
|{{tok|<code>9</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">9</code>}}}}
|{{tok|<code>@</code>}}
|{{tok|<code>@</code>}}
|{{tok|<code>`</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">`</code>}}}}
|-
|-
!{{tp|n-}}
!{{tp|n-}}
Line 54: Line 51:
|{{tok|<code>*</code>}}
|{{tok|<code>*</code>}}
|{{tok|<code>n</code>}}
|{{tok|<code>n</code>}}
|{{tok|<code>(</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">(</code>}}}}
|{{tok|<code>)</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">)</code>}}}}
|{{tok|<code>-</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">-</code>}}}}
|{{tok|<code>_</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">_</code>}}}}
|{{tok|<code>#</code>}}
|{{tok|<code>#</code>}}
|-
|-
Line 67: Line 64:
|{{tok|<code>b</code>}}
|{{tok|<code>b</code>}}
|{{tok|<code>%</code>}}
|{{tok|<code>%</code>}}
|{{tok|<code>+</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">+</code>}}}}
|{{tok|<code>F</code>}}
|{{tok|<code>F</code>}}
|{{tok|<code>&equals;</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">&equals;</code>}}}}
|{{tok|<code>6</code>}}
|{{tok|<code>6</code>}}
|-
|-
Line 79: Line 76:
|{{tok|<code>T</code>}}
|{{tok|<code>T</code>}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code>d</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">d</code>}}}}
|{{tok|<code>x</code>}}
|{{tok|<code>x</code>}}
|{{tok|<code>D</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">D</code>}}}}
|{{tok|<code>X</code>}}
|{{tok|<code>X</code>}}
|-
|-
Line 91: Line 88:
|{{tok|<code>K</code>}}
|{{tok|<code>K</code>}}
|{{tok|<code>Q</code>}}
|{{tok|<code>Q</code>}}
|{{tok|<code>{</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">{</code>}}}}
|{{tok|<code>g</code>}}
|{{tok|<code>g</code>}}
|{{tok|<code>h</code>}}
|{{tok|<code>h</code>}}
|{{tok|<code>}</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">}</code>}}}}
|-
|-
!{{tp|s-}}
!{{tp|s-}}
Line 104: Line 101:
|{{tok|<code>C</code>}}
|{{tok|<code>C</code>}}
|{{tok|<code>4</code>}}
|{{tok|<code>4</code>}}
|{{tok|<code>[</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">[</code>}}}}
|{{tok|<code>\</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">\</code>}}}}
|{{tok|<code>]</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">]</code>}}}}
|-
|-
!{{tp|w-}}
!{{tp|w-}}
Line 114: Line 111:
|{{N/A|{{space}}|link=wuwojiti}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code>w</code>}}
|{{tok|<code>w</code>}}
|{{tok|<code>&VerticalLine;</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">&VerticalLine;</code>}}}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code>v</code>}}
|{{tok|<code>v</code>}}
Line 127: Line 124:
|{{tok|<code>L</code>}}
|{{tok|<code>L</code>}}
|{{tok|<code>I</code>}}
|{{tok|<code>I</code>}}
|{{tok|<code>;</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">;</code>}}}}
|{{tok|<code>$</code>}}
|{{tok|<code>$</code>}}
|{{tok|<code>R</code>}}
|{{tok|<code>R</code>}}
|{{tok|<code>'</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">'</code>}}}}
|-
|-
!{{tp|j-}}
!{{tp|j-}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code>,</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">,</code>}}}}
|{{tok|<code>y</code>}}
|{{tok|<code>y</code>}}
|{{tok|<code>J</code>}}
|{{tok|<code>J</code>}}
|{{tok|<code>Y</code>}}
|{{tok|<code>Y</code>}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{N/A|{{space}}|link=wuwojiti}}
|{{tok|<code><</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;"><</code>}}}}
|{{tok|<code>"</code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">"</code>}}}}
|{{tok|<code>></code>}}
|{{CellCategory|1|2={{tok|1=<code style="mix-blend-mode:darken;">></code>}}}}
|{{tok|<code>j</code>}}
|{{tok|<code>j</code>}}
|}
|}

==Punctuation==
Only three printable ASCII characters are not defined in the syllable chart, and are, ''de facto'', virtually always used as [[punctuation]] in the same way as in {{tp|[[sitelen Lasina]]}}.

The ASCII {{w|Whitespace character|space}} (<code> </code>) is used to separate words. The period (<code>.</code>) and the colon (<code>:</code>) are generally used as sentence separators. Any other [[punctuation]] would be lost due to conflicting with syllable characters or being out of the ASCII range.

Similarly, due to the significant {{w|letter case}} and the use of all ASCII {{w|bracket}}s, there is no clear way to specify proper [[name]]s with this system. It has been suggested to instead use the colon for name marking where necessary, using the period as the only sentence separator, even after sentence-linking {{tp|[[ni]]}}.<ref>{{cite Discord|url=//discord.com/channels/301377942062366741/1156485500102778900/1194802490235953183|thread={{tok|sitelen Aki pi jan Misali · sE$ ak P j mcl}}|channel={{tok|toki-suli}}|server={{tp|ma pona pi toki pona}}|author={{tok|kulupu Menasewi}}|access-date=2024-01-15|quote=}}</ref>

==Usage==
The syllabary can be used to [[Text compression|compress text]], with each character being reducible to 7 {{w|bit}}s. Word lengths range from unchanged (for one-letter words) to a third as long, such as {{tok|<code>C%</code>}} for {{tp|[[sinpin]]}}. A major limiting factor for the {{w|Data compression ratio|compression ratio}} is the need to separate words.

{{tok|Misali}} also suggests using the syllabary as an {{w|input method}} for other writing systems.

The syllabary is rarely used as a writing system for general conversation.


==See also==
==See also==

Revision as of 22:28, 17 January 2024

j l bn e Oq Tw sn
jan li pana e moku tawa sina

The toki pona ASCII syllabary by jan Misali, or sitelen Aki[citation needed…], is a writing system that assigns each phonotactically allowed syllable in Toki Pona to a single ASCII character. It is designed so that many particles and short and common content words become recognizable initials, or become or integrate meaningful symbols.

Syllables

Highlighted cells correspond to syllables that do not occur in any nimi pu (but that can appear in names, and some of which would later appear in nimi ku). This affected the allocation of the characters, with many non-alphanumeric symbols being given to these syllables.

Null coda Coda nasal
-i -u -e -o -a -in -un -en -on -an
∅- i 0 e o a ! U & 7 A
m- m u ? O M 8 3 9 @ `
n- N / ^ * n ( ) - _ #
p- P f B p b % + F = 6
t- 2 E t T d x D X
k- k q G H K Q { g h }
s- s z Z S c C 4 [ \ ]
w- W V w | v 1
l- l 5 r ~ L I ; $ R '
j- , y J Y < " > j

Punctuation

Only three printable ASCII characters are not defined in the syllable chart, and are, de facto, virtually always used as punctuation in the same way as in sitelen Lasina.

The ASCII space ( ) is used to separate words. The period (.) and the colon (:) are generally used as sentence separators. Any other punctuation would be lost due to conflicting with syllable characters or being out of the ASCII range.

Similarly, due to the significant letter case and the use of all ASCII brackets, there is no clear way to specify proper names with this system. It has been suggested to instead use the colon for name marking where necessary, using the period as the only sentence separator, even after sentence-linking ni.[1]

Usage

The syllabary can be used to compress text, with each character being reducible to 7 bits. Word lengths range from unchanged (for one-letter words) to a third as long, such as C% for sinpin. A major limiting factor for the compression ratio is the need to separate words.

Misali also suggests using the syllabary as an input method for other writing systems.

The syllabary is rarely used as a writing system for general conversation.

See also

External links

References