Linku: Difference between revisions

From sona pona, the Toki Pona wiki
Content added Content deleted
No edit summary
No edit summary
Line 1: Line 1:
{{tok title}}
{{tok title}}
'''{{tok|ijo Linku}}''' are the collection of data and tools around the [//lipu-linku.github.io/about {{tok|Linku}} dictionary project].
'''{{tok|ijo Linku}}''' are the collection of data and tools around the {{tok|Linku}} dictionary project.<ref>[https://linku.la/about/ "About Linku"]. {{tp|lipu Linku}}. Retrieved 16 October 2023.</ref>


==Tools using {{tok|Linku}} data==
==Tools==
{{tok|Linku}} provides the data set as a public json file, [//lipu-linku.github.io/about/jasima {{tok|jasima Linku}}]
{{tok|Linku}} provides the data set as a public JSON file, called [//lipu-linku.github.io/about/jasima {{tok|jasima Linku}}].


* [//linku.la {{tok|lipu Linku}}], the main dictionary page
* [//linku.la {{tok|lipu Linku}}], the main dictionary page
Line 9: Line 9:


==Word usage surveys==
==Word usage surveys==
The {{tok|Linku}} team puts out annual word usage surveys, to update the database with the best information, and to allow users to filter the dictionary by their preferred usage cutoff point. {{tok|kala Asi}} [//youtu.be/wrFB1ETL1Hg discussed surveying words] in a segment for {{tok|[[suno pi toki pona]]}} 2023.
The {{tok|Linku}} team puts out annual word usage surveys, to update the database with the best information, and to allow users to filter the dictionary by their preferred usage cutoff point. {{tok|kala Asi}} discussed surveying words in a segment for {{tok|[[suno pi toki pona]]}} 2023.<ref>{{tok|kala Asi}} (7 August 2023). [https://www.youtube.com/watch?v=wrFB1ETL1Hg "{{tok|wile sona nimi}}"]. ''YouTube''.</ref>


===Usage categories===
===Usage categories===
Based on those surveys, {{tok|Linku}} has assigned words to a few broad categories since 2022. These are a more granular, more frequently updated replacement for the [[book presence]] categories.
Based on those surveys, {{tok|Linku}} has assigned words to a few broad categories since 2022. These are a more granular, more frequently updated replacement for the [[book presence]] categories. In the following tables, a bold line represents the cutoff for the categories that are selected by default.

In the following tables, a bold line represents the cutoff for the categories that are selected by default.


Numbers are rounded to the nearest percentage point (0.5% rounds to 1%).
Numbers are rounded to the nearest percentage point (0.5% rounds to 1%).


{|class="wikitable" style="float:left;margin-right:1em;text-align:center;"
{| style="vertical-align: top;"
|
{|class="wikitable" style="text-align:center;"
|+2023
|+2023
|-
|-
!style="width:0;"|Category
!style="width:0;"|Category
!Users<br /><small>{{abbr|<var>n</var>|Sample size}} = 868</small>
!Users<br/><small>{{abbr|<var>n</var>|Sample size}} = 868</small>
|-
|-
!Core
!Core
Line 42: Line 42:
|[2%, 10%)
|[2%, 10%)
|}
|}
|

{|class="wikitable" style="float:left;margin-right:1em;text-align:center;"
{|class="wikitable" style="text-align:center;"
|+2022
|+2022
|-
|-
!style="width:0;"|Category
!style="width:0;"|Category
!Users<br /><small>{{abbr|<var>n</var>|Sample size}} = 345</small>
!Users<br/><small>{{abbr|<var>n</var>|Sample size}} = 345</small>
|-
|-
!Core
!Core
Line 67: Line 67:
|[1%, 10%)
|[1%, 10%)
|}
|}
|}
<hr style="clear:both;" />
<references group="lower-alpha" />


===Survey results===
===Survey results===
*[https://github.com/lipu-linku/ijo/blob/main/survey/2023/results.md 2023 survey results] (868 responses)
* [https://github.com/lipu-linku/ijo/blob/main/survey/2023/results.md 2023 survey results] (868 responses)
*[//reddit.com/r/tokipona/comments/wqyczo/survey_results_how_many_people_use_words_in_2022 2022 survey results] (345 responses) {{Indent|The 2022 survey changed the methodology. It asks "do you use this word". Previous years asked "Do you consider this word real". Because of this, results from 2022 and after cannot be directly compared to 2021 and before.}}
* [//reddit.com/r/tokipona/comments/wqyczo/survey_results_how_many_people_use_words_in_2022 2022 survey results] (345 responses) {{Indent|The 2022 survey changed the methodology. It asks "do you use this word". Previous years asked "do you consider this word real". Because of this, results from 2022 and after cannot be directly compared to 2021 and before.}}
*[//reddit.com/r/tokipona/comments/qa3inn/survey_results_heres_how_real_these_tp_words_are 2021 survey results] (152 responses)
* [//reddit.com/r/tokipona/comments/qa3inn/survey_results_heres_how_real_these_tp_words_are 2021 survey results] (152 responses)
*[//reddit.com/r/tokipona/comments/g9ne0s/survey_results_heres_how_real_these_tp_words_are 2020 survey results] (86 responses)
* [//reddit.com/r/tokipona/comments/g9ne0s/survey_results_heres_how_real_these_tp_words_are 2020 survey results] (86 responses)

==Notes==
<references group="lower-alpha"/>

==References==
<references/>
{{General}}
{{General}}
[[Category:Resources]]
[[Category:Resources]]

Revision as of 17:09, 16 October 2023

ijo Linku are the collection of data and tools around the Linku dictionary project.[1]

Tools

Linku provides the data set as a public JSON file, called jasima Linku.

  • lipu Linku, the main dictionary page
  • nimi.li, an interactive dictionary that extends the Linku data with addition words too obscure or unused to be considered for Linku yet.

Word usage surveys

The Linku team puts out annual word usage surveys, to update the database with the best information, and to allow users to filter the dictionary by their preferred usage cutoff point. kala Asi discussed surveying words in a segment for suno pi toki pona 2023.[2]

Usage categories

Based on those surveys, Linku has assigned words to a few broad categories since 2022. These are a more granular, more frequently updated replacement for the book presence categories. In the following tables, a bold line represents the cutoff for the categories that are selected by default.

Numbers are rounded to the nearest percentage point (0.5% rounds to 1%).

2023
Category Users
n = 868
Core [90%, 100%]
Widespread [70%, 90%)
Common [50%, 70%)
Uncommon [20%, 50%)
Rare [10%, 20%)
Obscure[a][b] [2%, 10%)
2022
Category Users
n = 345
Core [90%, 100%]
Widespread [70%, 90%)
Common [50%, 70%)
Uncommon [20%, 50%)
Rare [10%, 20%)
Obscure [1%, 10%)

Survey results

Notes

  1. In the 2023 results post, the obscure category is split into a high end [5%, 10%) and low end [2%, 5%) purely for readability.
  2. New words below 2% usage are considered not notable for inclusion in the dictionary. Words below this threshold that are already included are planned to be moved into a separate sandbox resource. As of the publication of the 2023 results, this is yet to be done.

References

  1. "About Linku". lipu Linku. Retrieved 16 October 2023.
  2. kala Asi (7 August 2023). "wile sona nimi". YouTube.