Usage categories: Difference between revisions

From sona pona, the Toki Pona wiki
Content deleted Content added
No edit summary
No edit summary
Line 1: Line 1:
{{tok|[[Linku]]}} assigns words to broad '''usage categories''' based on annual surveys from 2022 onwards. These are a more granular, more frequently updated replacement for the [[book presence]] categories.<ref>{{tok|kala Asi}} (7 August 2023). [https://www.youtube.com/watch?v=wrFB1ETL1Hg {{tp|wile sona nimi}}]. [https://web.archive.org/web/20231018173305/https://www.youtube.com/watch?v=wrFB1ETL1Hg Archived] from the original on 18 October 2023. ''YouTube''. Retrieved 18 October 2023.</ref>
A '''usage category''' is a criteria for [[word usage]] based on the annual surveys by {{tok|[[Linku]]}}. These are a more granular, more frequently updated replacement for the [[book presence]] categories.<ref>{{cite YouTube|url=https://www.youtube.com/watch?v=wrFB1ETL1Hg|title={{tp|wile sona nimi}}|name={{tok|kala Asi}}|channel={{tok|kala Asi}}|handle=kala_asi|date=2023-08-07|archive-url=https://web.archive.org/web/20231018173305/https://www.youtube.com/watch?v=wrFB1ETL1Hg|archive-date=2023-10-18|access-date=2023-10-18}}</ref>

{{tok|[//nimi.li nimi.li]}}, a fork of {{tok|Linku}}, uses the term '''marginal''' to refer to words that are below these categories and thus deemed not notable for the dictionary project.


==Table of categories==
==Table of categories==
In the following tables, a bold line represents the cutoff for the categories that are selected by default. Numbers are rounded to the nearest percentage point (0.5% rounds to 1%).
In the following tables, a bold line represents the cutoff for the categories that are selected by default. Numbers are rounded to the nearest percentage point. {{tok|[[nimi.li]]}}, a fork of {{tok|Linku}}, uses the term ''marginal'' to refer to words that are below these categories and thus deemed not notable for the dictionary project.<ref>{{cite web|url=https://nimi.li/about|title=about|website=nimi.li|author={{tok|jan Tani}}|access-date=2024-01-17}}</ref>


{| style="vertical-align: top;"
{| style="vertical-align: top;"

Revision as of 00:30, 17 January 2024

A usage category is a criteria for word usage based on the annual surveys by Linku. These are a more granular, more frequently updated replacement for the book presence categories.[1]

Table of categories

In the following tables, a bold line represents the cutoff for the categories that are selected by default. Numbers are rounded to the nearest percentage point. nimi.li, a fork of Linku, uses the term marginal to refer to words that are below these categories and thus deemed not notable for the dictionary project.[2]

2023
Category Users
n = 868
Core [90%, 100%]
Widespread [70%, 90%)
Common [50%, 70%)
Uncommon [20%, 50%)
Rare [10%, 20%)
Obscure[a][b] [2%, 10%)
2022
Category Users
n = 345
Core [90%, 100%]
Widespread [70%, 90%)
Common [50%, 70%)
Uncommon [20%, 50%)
Rare [10%, 20%)
Obscure [1%, 10%)

Survey results

Notes

  1. In the 2023 results post, the obscure category is split into a high end [5%, 10%) and low end [2%, 5%) purely for readability.
  2. New words below 2% usage are considered not notable for inclusion in the dictionary. Words below this threshold that are already included are planned to be moved into a separate sandbox resource. As of the publication of the 2023 results, this is yet to be done.

References

  1. kala Asi. (7 August 2023). "wile sona nimi". kala Asi [@kala_asi]. YouTube. Archived from the original on 18 October 2023. Retrieved 18 October 2023.
  2. jan Tani. "about". nimi.li. Retrieved 17 January 2024.