Usage categories
A usage category is a criterion for word usage based on the annual surveys by Linku. These are a more granular, more frequently updated replacement for the book presence categories.[1]
Table of categories
In the following tables, a bold line represents the cutoff for the categories that are selected by default. Numbers are rounded to the nearest percentage point. For words that were below these categories, Linku used the term non-notable; nimi.li used the term marginal,[2] which has been adopted on sona pona to avoid the interpretation that these words are not notable for the wiki.
On 22 February 2024, waso Keli brought up the idea of simplifying Linku categories to aid their adoption. Teachers on the kama sona community discussed and reached a consensus, reducing the number of categories from six to four. This change was rolled out alongside the lipu Linku redesign on 30 March 2024.[3]
|
|
|
Notes
- ↑ 1.0 1.1 The sandbox threshold was at 2% for a few days after being implemented, then raised to 5% by consensus.
- ↑ 2.0 2.1 On 12 February 2024, a message was added clarifying that most speakers don't use uncommon or rare words.
- ↑ In the 2023 results post, the obscure category is split into a high end [5%, 10%) and low end [2%, 5%) purely for readability.
- ↑ New words below 2% usage are considered not notable for inclusion in the dictionary. Words below this threshold that are already included were planned to be moved into a separate sandbox resource. This was completed on 9 April 2024, shortly after the redesign was launched.
- ↑ On 12 February 2024, a message was added clarifying that most speakers don't use or understand obscure words.
Survey results
- 2023 survey results (868 responses)
- 2022 survey results (345 responses) The 2022 survey changed the methodology. It asks "do you use this word". Previous years asked "do you consider this word real". Because of this, results from 2022 and after cannot be directly compared to 2021 and before.
- 2021 survey results (152 responses)
- 2020 survey results (86 responses)
Correlations
According to a June 2024 study by jan Kekan San, less-used words tend to be discussed in other languages (such as English) more often than being used in Toki Pona.[4] Almost every sub-common word (below 60% usage) is used in other languages at least 30% of the time, and usually far more often; and the trend becomes "more pronounced" for sandboxed words (below 5% usage).[5]
References
- ↑ kala Asi. (7 August 2023). "wile sona nimi". kala Asi [@kala_asi]. YouTube. Archived from the original on 18 October 2023. Retrieved 18 October 2023.
- ↑ jan Tani. "about". nimi.li. Retrieved 17 January 2024.
- ↑ (30 March 2024). "lipu Linku". lipu Linku. Archived from the original on 30 March 2024. Retrieved 30 March 2024.
- ↑ jan Kekan San [@gregdan3]. (12 June 2024). Message in the
#Word frequency in Toki Pona
thread in#toki-suli
. ma pona pi toki pona. Discord. Retrieved 14 June 2024. - ↑ jan Kekan San [@gregdan3]. (12 June 2024). Message in the
#Word frequency in Toki Pona
thread in#toki-suli
. ma pona pi toki pona. Discord. Retrieved 14 June 2024.