Word frequency

From sona pona, the Toki Pona wiki

The frequency of words in Toki Pona is of interest to tokiponists such as learners, teachers, dictionary editors, and language change analysts. Influences on a word's frequency include grammatical constraints, breadth of semantic and phatic applicability, presentation in Official Toki Pona books, perceived usefulness, and trends in humor.

The most common words tend to be (semi)particles, pronouns, and certain prepositions and especially frequent content words. While the exact frequency list varies by corpus, the words li, mi, e, a, pona, toki, and ni (not a consistent order) comprise the top of the ilo Muni dataset from May 2018 to July 2024.

Usage categories[edit | edit source]

Multiple related categories have attempted to describe the frequency of word use across speakers, rather than a corpus. Toki Pona Dictionary clustered words into six frequency indices, selecting a threshold of reported usage above which certain nimi sin were promoted as nimi ku suli. The Linku online dictionary project uses a similar concept, setting usage categories based on yearly surveys, with a goal of showing learners how likely a given word is to be recognized or understood.

Table[edit | edit source]

Under construction This table needs work:

May not be filtered correctly (seemingly way too many cells returned invalid!); to redo with updated CSV

If you know about this topic, you can help us by editing it. (See all)

The following table is based on the ilo Muni database as of 10 September 2024, limited to the 200 all-time most frequently used tokens. Years start in August, not January (for example, the "2001" column represents August 2001 – July 2002).

The dataset includes many words that are always or sometimes proper names, parts thereof, or acronyms. Where a word has multiple possible representations (namely alternative spellings, abbreviations, sitelen Lasina vs. UCSUR codepoints) within the top 200 tokens, sums of the representations have also been added.

Word Frequency
2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018 2019 2020 2021 2022 2023 All time
li 250 600 896 4023 3633 9080 3499 2382 9700 4248 1954 1509 1249 2063 6132 17074 24672 20973 72596 150839 256799 265051 378073 1238778
mi 89 430 524 2980 2253 5074 2551 1783 5036 2771 1216 691 949 1525 4714 143085 244918 247300 343999 1153592
e 16465 22662 19628 62041 113938 185223 196401 282434 942205
a 21 31 56 242 312 704 255 146 547 291 128 114 290 112 1752 10905 15311 17412 46610 182459 166089 223551 793411
pona 294 429 1604 1870 3552 1545 1200 2123 1396 857 512 581 919 3250 11861 16992 15320 103637 161027 138550 180977 696800
toki 2279 2994 1536 822 556 898 13582 41366 75721 129634 128912 184691 629318
ni 49 233 1112 8654 12444 10646 36191 64821 127577 621972
la 60 158 234 1181 1250 2501 1343 718 2399 1242 620 361 364 730 2025 6089 8825 8143 27348 56342 102857 103009 163289 491816
lon 43 99 148 1030 830 2349 1361 531 2022 804 431 325 348 549 1682 5747 8823 7098 25943 101824 98410 151033 480668
ala 50 169 238 1105 893 2410 1000 648 2080 1050 453 352 401 617 1806 6191 9932 8012 51690 87390 92700 135907 433648
sina 45 118 168 910 895 1882 731 633 1480 762 548 303 380 616 1495 7995 26113 49917 92101 131725 424460
jan 79240 78898 101554 400879
o 15 64 70 525 519 1125 410 416 1260 534 314 359 246 1707 23633 76609 396994
tawa 48 8500 6954 22566 45194 105094 367950
pi 38 8102 6713 22177 75500 71826 363590
sona 110 808 676 1790 454 1347 708 402 224 250 351 1393 4164 6512 5753 18928 34266 61300 63019 92909 296715
ona 27 784 579 1573 679 445 212 1465 5946 4816 16981 58427 59853 292678
mute 94 3919 5259 16346 34991 52214 69730 260177
kama 37 600 636 1525 393 209 189 1044 3797 4756 4641 14658 52671 49762 70278 246672
tenpo 81 388 193 3785 4765 4319 31580 50217 68565 241390
ken 12 105 512 1082 461 297 965 527 245 186 172 269 738 2821 4165 3576 13332 28696 49975 240581
seme 32 37 58 224 340 637 211 157 436 235 121 63 104 100 679 2781 4754 3666 27361 47245 45900 227296
wile 30 526 166 889 25885 44145 43884 66023 213866
taso 3 47 360 1103 350 273 769 390 227 143 239 748 2176 3795 2890 10966 22080 36173 40310 59151 183277
nimi 8 48 62 428 851 369 167 163 2644 3356 2564 9371 19345 35995 55540 174573
tan 17 27 73 1068 299 961 220 100 2370 3556 2474 8900 18063 29955 33045 47666 152389
pilin 13 37 78 446 149 2656 8970 19218 30844 42679 150973
ma 49 163 700 1990 3326 8852 29475 31122 44526 148972
musi 21 38 167 252 504 213 115 469 398 107 43 145 111 504 1302 1880 1778 8906 18296 29392 143657
ike 58 338 342 252 864 168 99 625 18274 29126 28955 39205 140001
lili 19 43 934 333 18006 27809 26616 35071 131947
pali 27 338 847 185 120 121 176 542 1580 2684 25908 34310 130862
sitelen 5 36 48 228 557 106 574 268 129 100 1454 2256 2120 14890 27311 130759
ale 47 172 135 281 194 174 380 187 65 28 46 47 197 979 2092 1849 5957 15706 129190
kepeken 4 38 861 150 2071 13889 24284 23765 34253 115601
ilo 3 21 26 100 125 370 220 92 515 244 137 69 93 95 1253 1768 1961 6891 14779 23249 115252
tomo 32 312 681 189 13680 24061 22565 29495 109077
lukin 5 54 700 6536 12901 21885 21826 33704 107636
moku 38 20 175 169 449 193 59 768 269 94 19987 21497 29873 103424
ante 6 43 255 554 255 175 549 108 429 1087 1581 1298 5143 12043 21090 101580
ijo 8 159 186 644 94 535 205 108 78 83 126 1036 1832 1507 4803 11115 20737 21024 99701
suli 252 210 215 93 60 1268 1566 1607 5423 20255 21019 99534
mu 8 2 3 9 16 50 82 96 30 15 25 8 18 138 342 899 583 3260 17183 97362
sama 5 38 50 171 150 131 493 1245 1645 1239 10771 18899 95031
tu 11 19 163 189 97 462 17783 27676 93303
nasin 6 34 223 432 107 83 86 402 903 1269 1136 4163 92946
anu 27 24 116 157 314 113 242 113 64 38 59 63 315 900 1644 1126 4488 9843 18056 17741 86580
pana 34 472 248 90 67 316 1089 1582 1249 4381 9355 16746 28451 84435
soweli 4 47 152 527 225 67 202 68 85 1111 1231 1175 4153 17274 23900 82444
jo 86 16787 17240 20756 81432
wan 36 176 70 77 333 853 1478 1129 9683 16092 79477
wawa 8 6 11 90 72 276 66 44 128 70 52 70 299 752 1038 1026 2889 6849 13403 79084
kin 20 482 123 357 80 4136 9602 15379 23290 76763
lape 5 9 107 38 62 65 22 300 77 50 51 30 62 179 1231 19013 74006
n 7 10 8 18 24 56 35 80 24 12 13 15 252 2736 7038 14037 71093
sin 22 134 320 188 109 360 69 82 304 962 1225 1201 3833 14144 19977 70103
lipu 17 250 202 66 747 1054 1108 8224 14253 69580
telo 23 22 95 122 160 66 43 943 14400 19017 69335
nasa 109 349 154 53 261 160 38 40 74 311 870 8153 12962 68333
pini 2 22 13570 18544 67642
nanpa 7 19 41 355 134 85 51 40 53 259 647 857 909 3753 65406
lawa 6 24 157 313 45 654 978 12817 12963 18718 64092
kalama 4 13 93 57 263 290 126 49 17 54 851 1098 13339 18092 63899
kulupu 16 366 56 3657 7617 11635 13142 63273
luka 106 68 99 44 38 37 247 732 888 3403 7407 11155 62983
pakala 5 9 92 107 316 161 71 41 57 233 711 837 7295 61827
suno 6 16 49 56 11910 15183 59350
en 10927 11586 16174 59078
weka 2 10 19 71 86 268 72 42 184 546 882 883 6980 10779 57236
awen 5 99 100 261 60 327 71 41 69 194 518 2886 55615
sewi 13 623 6039 8749 10561 15751 51025
suwi 4 46 21 197 70 46 99 52 39 22 18 10 91 432 642 429 2322 5699 48614
kon 6 7 87 59 205 98 40 111 44 35 547 774 672 2484 8724 11532 41251
sike 8 7 108 60 123 45 29 454 832 642 5235 7813 8350 12080 39802
poka 2 12 42 178 399 550 534 1959 4280 7204 8109 38352
olin 17 85 51 156 63 213 111 32 24 13 117 2071 4233 7629 37917
kasi 5 35 29 31 151 521 689 2051 4716 7644 10255 36254
mama 3 7 87 21 93 620 565 2053 7055 6927 8189 33240
moli 189 36 176 4271 5461 7441 9487 33205
waso 4 7 65 66 59 22 247 98 20 110 324 496 532 1616 6964 8772 32894
kute 4 14 6 46 65 174 55 37 132 69 43 15 23 92 409 575 568 2024 6533 6426 8366 30393
insa 2 12 9 73 175 26 40 488 506 1606 3538 6376 6210 30373
utala 5 72 50 52 29 28 311 1659 3815 5615 6238 29617
kili 12 9 55 29 152 64 38 22 25 45 92 356 5905 5982 8289 29503
poki 8 3 14 59 110 10 206 26 42 28 9 36 76 263 323 348 1093 3461 5971 28083
len 2 14 47 93 40 26 173 74 25 17 105 362 338 1460 3316 27946
seli 9 3 39 22 153 26 214 59 28 32 1702 3610 5582 6077 7467 27177
kala 4 2 6 7 9 143 39 15 124 67 9 27 7 22 86 541 425 4982 5467 7060 25915
open 4 26 24 125 63 149 49 36 14 28 107 322 491 464 1545 3165 5327 5264 25915
kiwen 4 27 148 38 30 284 426 485 1592 3176 5087 5765 7538 25464
pu 124 26 33 3 8 3299 4141 25173
linja 5 10 33 54 148 14 69 27 21 39 312 418 402 1503 3215 5412 7188 25074
pimeja 2 11 31 55 36 21 5073 5043 6912 24872
alasa 72 20 15 23 14 14 49 132 196 203 920 2701 4701 24712
mani 124 22 181 3095 4711 4951 6809 24007
inli 10 129 33 15 20 20 407 1397 2492 4716 5039 23492
mun 6 46 64 107 61 10 9 15 104 199 383 301 900 2469 4259 23430
ko 2 4 11 12 91 15 150 28 18 6 22 412 1333 3111 4054 4847 23422
pipi 6 7 9 139 38 7 21 78 206 403 353 4651 6690 22911
uta 2 3 3 11 8 104 9 31 10 94 4755 6411 22667
sijelo 4 2 8 129 47 5 246 2817 4362 4856 22652
kule 5 5 15 6 110 23 107 18 9 7 211 275 256 956 6593 22171
pan 23 6 80 54 17 12 26 66 246 401 4022 4636 22013
unpa 4 20 26 69 19 9 22 17 4 41 1249 1804 2389 20955
meli 309 2242 3470 3752 6204 20829
jaki 18 15 36 13 128 39 12 8 5 52 4244 5349 20420
anpa 18 341 309 1014 2352 3641 4187 19906
kijetesantakalu 66 62 393 1562 19570
ali 3206 2652 3243 19328
esun 9 58 181 2404 5305 18967
akesi 46 16 38 21 10 5 44 186 290 5140 18862
loje 3 6 12 24 8 4 66 289 244 956 4860 17466
monsuta 38 41 7 7 25 142 182 149 907 2321 17466
palisa 24 25 12 14 199 256 2187 2807 16354
supa 10 40 19 209 207 16290
lete 2 3 24 15 41 13 104 7 12 60 185 2130 3163 4342 15875
wa 7 33 125 142 130 276 2812 3022 15638
mije 1762 2500 15214
noka 5 17 14 45 25 13 134 232 1972 2978 14363
laso 61 31 83 5 120 215 2112 14301
tonsi 475 2810 14088
walo 15 39 9 5 60 166 1651 2643 13151
nena 2 17 65 15 2582 2519 3487 12878
jelo 16 5 72 9 7 150 197 1936 2593 3405 12542
sinpin 22 9 31 170 1589 2355 2615 12523
epiku 719 1341 11640
oko 2 5 14 10 6 1319 2320 2770 11237
ku 3 11 112 61 82 177 1438 2112 2796 10892
lupa 27 1333 1857 2042 3120 10328
selo 3 12 70 5 181 537 1476 1901 3088 10231
te 15 8 10 11 8 6 21 153 104 2084 1352 2406 9746
leko 9 3 87 134 2078 1911 2649 9315
namako 60 10 3 8 23 88 142 128 423 1142 1873 9296
oke 10 55 53 65 1852 1038 1872 8572
kipisi 7 5 37 106 98 98 419 1187 1789 2525 8431
lanpan 128 299 827 8417
ki 6 7 18 6 3 117 111 119 1554 1782 1902 8360
monsi 15 117 120 1182 1530 1564 8323
soko 893 1339 2154 7099
po 7 24 14 4 3 866 1008 1839 6813
ta 6 4 5 3 839 764 1466 2185 6433
san 11 17 14 20 17 41 97 100 163 645 906 6362
pa 7 891 1181 1369 5772
󱤴 5749
sonja 35 105 1019 795 1201 5672
msa 224 904 1821 5627
manka 1140 1587 5296
󱤧 5184
epelanto 17 12 8 298 892 801 1071 5083
meso 730 1445 4789
mewika 80 565 861 1032 1343 4789
nja 396 678 936 4641
owe 4620
siko 247 517 982 4552
misikeke 514 1425 4551
󱤀 4490
nijon 8 6 10 3 9 45 363 682 4480
majuna 106 69 197 497 740 4474
or 6 10 77 976 4453
jasima 123 629 1219 4151
ke 84 215 460 774 1263 4117
powe 7 224 707 1122 4109
kanse 9 647 998 4008
su 13 65 524 3992
oki 59 183 522 763 1237 3984
misali 709 3872
󱤉 3787
nano 693 1063 3777
lu 56 53 44 150 635 641 1257 3676
x 5 8 79 528 3598
linluwi 173 497 1200 3562
󱥁 3375
nimisin 48 540 3350
epanja 7 10 6 45 403 667 644 3305
kapilu 496 3296
isipin 620 977 3261
󱥔 3229
tosi 39 66 49 506 3078
sonko 7 427 578 2987
lo 7 938 2946
nata 680 2900
kokosila 904 2882
usawi 1072 2878
kan 9 41 573 449 2841
kiki 418 670 1031 2812
lipamanka 637 2666
losi 10 586 451 2644
alu 150 2635
󱥬 2621
tuki 2597
󱤡 2514
to 925 2477
apeja 36 542 2410
linku 2392
son 339 2383
misa 2378
like 2286
puwa 2255
󱤬 2241
pingo 2239
a + 󱤀 797901
ala + x 653 1058 10011 52218 437246
ale + ali 148518
e + 󱤉 945992
la + 󱤡 944719
li + 󱤧 1243962
lon + 󱤬 482909
mi + 󱤴 1159341
ni + 󱥁 625347
pona + 󱥔 700029
toki + 󱥬 631939

See also[edit | edit source]

Notes[edit | edit source]

English Wikipedia has an article on
word frequency.