ASCII: Difference between revisions

lipu pona: This is a good article.
lipu pona: This is a good article.
From sona pona, the Toki Pona wiki
Content added Content deleted
(Prepared the page for translation)
(Marked this version for translation)
 
Line 4: Line 4:
|blurb='''[[Special:MyLanguage/ASCII|ASCII]]''' is a character encoding standard that features 95 printable characters, limited to the basic Latin alphabet and symbols on the United States keyboard layout. This character set is the basis of several [[Special:MyLanguage/Toki Pona|Toki Pona]] projects, due to its availability compared to specialized keyboards and [[Special:MyLanguage/input method|input method]]s.
|blurb='''[[Special:MyLanguage/ASCII|ASCII]]''' is a character encoding standard that features 95 printable characters, limited to the basic Latin alphabet and symbols on the United States keyboard layout. This character set is the basis of several [[Special:MyLanguage/Toki Pona|Toki Pona]] projects, due to its availability compared to specialized keyboards and [[Special:MyLanguage/input method|input method]]s.
}}
}}
<translate>{{Wikipedia}}
<translate><!--T:1-->
{{Wikipedia}}
'''ASCII''' ('''American Standard Code for Information Interchange''') is a character encoding standard updated from 1963 to 1986, and a precursor to [[Special:MyLanguage/Unicode|Unicode]].
'''ASCII''' ('''American Standard Code for Information Interchange''') is a character encoding standard updated from 1963 to 1986, and a precursor to [[Special:MyLanguage/Unicode|Unicode]].


<!--T:2-->
ASCII features only 128 codepoints, many of which are {{w|control character}}s, and its 94 {{w|printable character}}s are limited to the {{w|ISO basic Latin alphabet|basic Latin alphabet}} (the 26 letters also comprising the {{w|English alphabet}}) and {{w|Typographical symbol|symbols}} on the {{w|British and American keyboards|United States keyboard layout}}. This character set is the basis of several [[Special:MyLanguage/Toki Pona|Toki Pona]] projects, due to its availability compared to specialized keyboards and [[Special:MyLanguage/input method|input method]]s.
ASCII features only 128 codepoints, many of which are {{w|control character}}s, and its 94 {{w|printable character}}s are limited to the {{w|ISO basic Latin alphabet|basic Latin alphabet}} (the 26 letters also comprising the {{w|English alphabet}}) and {{w|Typographical symbol|symbols}} on the {{w|British and American keyboards|United States keyboard layout}}. This character set is the basis of several [[Special:MyLanguage/Toki Pona|Toki Pona]] projects, due to its availability compared to specialized keyboards and [[Special:MyLanguage/input method|input method]]s.


Line 13: Line 15:
<translate>
<translate>


==In writing systems==
==In writing systems== <!--T:3-->


<!--T:4-->
Multiple Toki Pona [[Special:MyLanguage/writing system|writing system]]s have been based on ASCII.
Multiple Toki Pona [[Special:MyLanguage/writing system|writing system]]s have been based on ASCII.


</translate>
</translate>
{{tp|[[Special:MyLanguage/sitelen akesi|sitelen akesi]]}}
{{tp|[[Special:MyLanguage/sitelen akesi|sitelen akesi]]}}
<translate> <!--T:5-->
<translate> is an ASCII-compatible adaptation of {{tp|[[Special:MyLanguage/sitelen pona|sitelen pona]]}}. Glyphs may be represented as one or more characters.
is an ASCII-compatible adaptation of {{tp|[[Special:MyLanguage/sitelen pona|sitelen pona]]}}. Glyphs may be represented as one or more characters.


</translate>
</translate>
{{tok|[[Special:MyLanguage/jan Misali|jan Misali]]}}
{{tok|[[Special:MyLanguage/jan Misali|jan Misali]]}}
<translate><!--T:6-->
<translate>'s [[Special:MyLanguage/toki pona ASCII syllabary|toki pona ASCII syllabary]] assigns each [[Special:MyLanguage/phonotactic|phonotactic]]ally allowed [[Special:MyLanguage/syllable|syllable]] in Toki Pona to a single ASCII character.
's [[Special:MyLanguage/toki pona ASCII syllabary|toki pona ASCII syllabary]] assigns each [[Special:MyLanguage/phonotactic|phonotactic]]ally allowed [[Special:MyLanguage/syllable|syllable]] in Toki Pona to a single ASCII character.


==In fonts==
==In fonts== <!--T:7-->


===ASCII transcription===
===ASCII transcription=== <!--T:8-->


<!--T:9-->
<div style="float:right;margin-left:1em;">
<div style="float:right;margin-left:1em;">
{|class="wikitable" style="width:180px;"
{|class="wikitable" style="width:180px;"
Line 52: Line 58:


</translate>
</translate>
[[File:Word default no ligatures.tiff|thumb|180px|<translate>''ASCII transcription'' works like automatic Latin ligatures, where multiple glyphs are visually substituted with one.</translate>]]
[[File:Word default no ligatures.tiff|thumb|180px|<translate><!--T:10--> ''ASCII transcription'' works like automatic Latin ligatures, where multiple glyphs are visually substituted with one.</translate>]]
<translate>
<translate>
<!--T:11-->
This feature uses {{w|OpenType}} {{w|Ligature (writing)|ligatures}}. It works in the same way that some Latin-script fonts convert the letters <code>ffi</code> into a ligature that <em>looks</em> like the single glyph <code>ffi</code>, but is actually still 3 underlying characters.
This feature uses {{w|OpenType}} {{w|Ligature (writing)|ligatures}}. It works in the same way that some Latin-script fonts convert the letters <code>ffi</code> into a ligature that <em>looks</em> like the single glyph <code>ffi</code>, but is actually still 3 underlying characters.


<!--T:12-->
As a result, when ASCII transcription is used, the underlying characters are still ASCII-compatible instead of being converted to other codepoints. This may or may not be desired. You can confirm the difference by copying the {{w|rich text}} below, and pasting it into a {{w|plain text}} environment such as {{w|Windows Notepad}} or a search bar.
As a result, when ASCII transcription is used, the underlying characters are still ASCII-compatible instead of being converted to other codepoints. This may or may not be desired. You can confirm the difference by copying the {{w|rich text}} below, and pasting it into a {{w|plain text}} environment such as {{w|Windows Notepad}} or a search bar.


<!--T:13-->
{|class="wikitable"
{|class="wikitable"
!
!
Line 81: Line 90:
|}
|}


<!--T:14-->
Fonts with ASCII transcription may help with memorizing words, as the substitution will only occur when the word is spelled correctly, and in logographies (such as {{tp|sitelen pona}}) the resulting glyph often relates to the word's meaning.
Fonts with ASCII transcription may help with memorizing words, as the substitution will only occur when the word is spelled correctly, and in logographies (such as {{tp|sitelen pona}}) the resulting glyph often relates to the word's meaning.


==References==
==References== <!--T:15-->


<!--T:16-->
<references />
<references />
</translate>
</translate>

Latest revision as of 08:27, 5 May 2024

Other languages:
English Wikipedia has an article on
ASCII.

ASCII (American Standard Code for Information Interchange) is a character encoding standard updated from 1963 to 1986, and a precursor to Unicode.

ASCII features only 128 codepoints, many of which are control characters, and its 94 printable characters are limited to the basic Latin alphabet (the 26 letters also comprising the English alphabet) and symbols on the United States keyboard layout. This character set is the basis of several Toki Pona projects, due to its availability compared to specialized keyboards and input methods.

Printable ASCII characters
0 1 2 3 4 5 6 7 8 9 A B C D E F
2x SP ! " # $ % & ' ( ) * + , - . /
3x 0 1 2 3 4 5 6 7 8 9 : ; < = > ?
4x @ A B C D E F G H I J K L M N O
5x P Q R S T U V W X Y Z [ \ ] ^ _
6x ` a b c d e f g h i j k l m n o
7x p q r s t u v w x y z { | } ~ DEL

In writing systems[edit | edit source]

Multiple Toki Pona writing systems have been based on ASCII.

sitelen akesi

is an ASCII-compatible adaptation of sitelen pona. Glyphs may be represented as one or more characters.

jan Misali 's toki pona ASCII syllabary assigns each phonotactically allowed syllable in Toki Pona to a single ASCII character.

In fonts[edit | edit source]

ASCII transcription[edit | edit source]

Sample output of a sitelen pona font with ASCII transcription
Rich text Plain text
m m
mu mu
mut mut
mute mute

Many fonts for original Toki Pona writing systems include a feature called ASCII transcription,[1] where a run of characters typed in sitelen Lasina is visually substituted with a single glyph. This simulates the effects of an IME without the need to install software beyond the font, but unlike these, it does not actually replace the underlying text. Other ASCII characters are commonly used for additional features in the writing system or font, such as the hyphen (-) being used for sitelen pona combined glyphs.

ASCII transcription works like automatic Latin ligatures, where multiple glyphs are visually substituted with one.

This feature uses OpenType ligatures. It works in the same way that some Latin-script fonts convert the letters ffi into a ligature that looks like the single glyph , but is actually still 3 underlying characters.

As a result, when ASCII transcription is used, the underlying characters are still ASCII-compatible instead of being converted to other codepoints. This may or may not be desired. You can confirm the difference by copying the rich text below, and pasting it into a plain text environment such as Windows Notepad or a search bar.

Rich text Plain text Comparison
ASCII transcription toki-pona toki-pona
  • Falls back to legible sitelen Lasina
  • Writing system not specified
  • Less standardization for font features
UCSUR encoding 󱥬‍󱥔 󱥬‍󱥔
  • Falls back to illegible small rectangles (colloquially "tofu")
  • Writing system specified as sitelen pona
  • More standardization for font features

Fonts with ASCII transcription may help with memorizing words, as the substitution will only occur when the word is spelled correctly, and in logographies (such as sitelen pona) the resulting glyph often relates to the word's meaning.

References[edit | edit source]

  1. "nimi Linku". Google Sheets. Retrieved 17 January 2024.