BabelNet 4.0: General statistics

Number of languages: 284
Total number of Babel synsets: 15,780,364
Total number of Babel senses: 808,974,108
Total number of concepts: 6,113,467
Total number of Named Entities: 9,666,897
Total number of lexico-semantic relations: 277,036,611
Total number of glosses (textual definitions): 91,218,220
Total number of images: 54,229,458
Total number of Babel synsets with at least one domain: 2,637,407
Total number of Babel synsets with at least one picture: 10,522,922
Total number of sources: 47

Statistics between the BabelNet versions

Version Release     
date
Languages Sources Babel   
synsets
Babel   
senses
Concepts Named     
Entities
Lexico-semantic
relations
Definitions Images RDF triples
4.0 2018/02                 284 47 15,780,364 808,974,108 6,113,467 9,666,897 277,036,611 91,218,220 54,229,458 -
3.7 2016/08                 271 14 13,801,844 745,859,932 6,066,396 7,735,448 380,239,084 40,709,194 10,767,833 -
3.6 2016/01                 271 13 13,801,844 745,856,326 6,066,396 7,735,448 380,239,084 40,705,588 10,767,833 1,958,820,772
3.5 2015/09                 272 13 13,801,844 119,036,997 6,066,396 7,735,448 380,239,084 40,634,604 10,767,833 1,951,194,299
3.0 2014/12                 271 7 13,789,332 117,204,438 6,418,418 7,370,914 354,538,633 40,328,194 10,960,634 1,932,845,662
2.5.1 2014/11 50 7 9,347,143 67,605,841 3,683,600 5,663,543 262,687,848 21,751,423 7,764,270 -
2.0.1 2014/03 50 5 9,348,287 50,282,923 3,684,512 5,663,775 262,687,848 17,961,157 7,764,270 1,138,337,378
1.1 2013/01 6 4 5,581,954 21,947,381 1,566,566 4,015,388 141,697,438 8,439,497 6,494,690 -
Version Release date Languages Resources New Features
4.0 2018/02  284
  • Update of all of the resources, including:
    • All of the Open Multilingual Wordnets at their latest release (downloaded in January 2017)
    • Wikipedia (February 2018 dump)
    • Wiktionary (February 2018 dump)
    • Wikidata (February 2018 dump)
    • OmegaWiki (January 2017 dump)
  • Integration of new open wordnets for Gaelic, Portuguese and Korean (downloaded in January 2017)
  • Improved management of Open Multilingual Wordnets that are no longer stored under a single resource name
  • Manual validation of the resource mapping, with more than 2000 wrong mappings now corrected
  • Total number of languages increased from 271 to 284. New languages include: Adyghe, Azerbaijani, Goan Konkani, Livvinkarjala, Maithili, Northern Luri, Serbo-Croatian, South Patois, Tarantino, Tulu
  • Cantonese, Min Nan and Classical Chinese are now managed and searchable separately from standard Chinese
  • All Chinese lemmas are now normalized to simplified characters, so simplified Chinese must be used when querying BabelNet
3.7 2016/08  271
  • FrameNet (version 1.6)
  • Mappings with several versions of WordNet now integrated (from 1.6 to 3.0)
  • More than 2500 Babel synsets identified as key concepts
  • More than 2.6 million Babel synsets labeled with domains (was 1,558,806 in v3.6)
3.6 2016/01  271
  • ItalWordNet (December 2015 dump)
  • Open Dutch WordNet (December 2015 dump)
  • Microsoft terminology (July 2015 dump)
  • WoNeF (high precision version, downloaded in August 2015)
  • GeoNames (April 2015 dump)
  • Wikiquote (November 2014 dump)
  • VerbNet (version 3.2)
  • English Wiktionary (August 2014 dump)
  • Wikidata (November 2014 dump)
  • OmegaWiki (July 2015 dump)
  • Open Multi WordNet (downloaded in August 2015)
  • WordNet (version 3.0)
  • Wikipedia (11/2014)
  • Semcor automatic translations
  • Wikipedia automatic translations
  • New International senses
  • Added other forms
  • Links to YAGO
3.5 2015/09  272
  • Microsoft terminology (July 2015 dump)
  • WoNeF (high precision version, downloaded in August 2015)
  • GeoNames (April 2015 dump)
  • Wikiquote (November 2014 dump)
  • VerbNet (version 3.2)
  • English Wiktionary (August 2014 dump)
  • Wikidata (November 2014 dump)
  • OmegaWiki (July 2015 dump)
  • Open Multi WordNet (downloaded in August 2015
  • WordNet (version 3.0)
  • Wikipedia (11/2014)
  • Semcor automatic translations
  • Wikipedia automatic translations
  • Added domains
  • Added dompounds
  • Casing for Wikipedia senses
  • All definitions disambiguated with Babelfy
  • Integration of the Multilingual Wikipedia taxonomy
  • Integration of Wikidata relations
  • Integration of InfoBoxes (Wikipedia English) relations
  • Integration of DEFIE relations
  • New definitions from Wiktionary translations
  • New images from ImageNet
  • New synsets from Microsoft Terminology
3.0 2014/12  271
  • English Wiktionary (August 2014 dump)
  • Wikidata (November 2014 dump)
  • OmegaWiki (July 2015 dump)
  • Open Multi WordNet (downloaded in September 2013)
  • WordNet (version 3.0)
  • Wikipedia (November 2014 dump)
  • Semcor automatic translations
  • Wikipedia automatic translations
  • Links to Freebase
  • Added pronunciations
  • Added examples
  • Integration of Wikipedia taxonomy
  • Integration of Wikidata Taxonomy
  • Integration of OmegaWiki images
  • New definitions from Wikipedia
  • New synsets from OmegaWiki
  • New synsets from Wiktionary
2.5.1 2014/11 50
  • English Wiktionary (February 2014 dump)
  • Wikidata (April 2014 dump)
  • OmegaWiki (August 2013 dump)
  • Open Multi WordNet (downloaded in September 2013)
  • WordNet (version 3.0)
  • Wikipedia (October 2012 dump)
  • Semcor automatic translations
  • Wikipedia automatic translations
  • Links to Wikipedia taxonomy
2.0.1 2014/03 50
  • OmegaWiki (August 2013 dump)
  • Open Multi WordNet (downloaded in September 2013)
  • WordNet (version 3.0)
  • Wikipedia (October 2012 dump)
  • Semcor automatic translations
  • Wikipedia automatic translations
-
1.1 2013/01 6
  • WordNet (version 3.0)
  • Wikipedia (October 2012 dump)
  • Semcor automatic translations
  • Wikipedia automatic translations
  • Links to DBpedia

Languages and Coverage

Turkmen 2406821 2416056
Interlingue 2494487 2500522
Chechen 2561479 2707965
Volapük 2519359 2775472
Venetian 2525483 2549205
Crimean Tatar 2400934 2404814
Cornish 2404181 2411123
Low Saxon 2600149 2643808
Novial 2403764 2406933
Macedonian 2480620 2653977
Indonesian 2993602 4180176
Zeelandic 2404866 2411846
Vepsian 2403508 2412579
Lower Sorbian 2430835 2436580
Estonian 2806883 3302560
Marathi 2441424 2555487
Akan 2402456 2403696
Tibetan 2411179 2428088
Kannada 2430028 2470418
Kinyarwanda 2405001 2409745
Javanese 2458265 2531515
Navajo 2405060 2414020
Danish 3372151 4001884
Rusyn 2403226 2412353
Karachay-Balkar 2401132 2406037
Samogitian 2411419 2418006
Bambara 2421147 2422882
Bishnupriya Manipuri 2413969 2452326
Silesian 2405171 2413424
Kashmiri 2399785 2400909
Korean 2782478 3783368
Upper Sorbian 2439932 2461054
Nauruan 2402174 2404881
Komi 2403037 2411818
Tetum 2402866 2405534
Karakalpak 2403481 2406760
Sardinian 2530136 2540770
Maltese 2559420 5069417
Irish 2728792 3025648
South Azerbaijani 46546 55694
Bislama 2402258 2403869
Fiji Hindi 2411705 2420275
English 8855224 22922522
Lojban 2403326 2409704
Pontic 2399901 2401120
Luganda 2402907 2405099
Malay 2881338 3494556
Russian 4180086 8907098
Komi-Permyak 2401937 2409432
Central Bicolano 2405026 2414702
Avar 2401725 2406153
Võro 2401952 2403784
Kirundi 2401799 2403210
Amharic 2413019 2440540
Swedish 5074788 11226956
Limburgish 2541945 2583260
Lombard 2441851 2497541
Cebuano 4180816 8277352
Manx 2409546 2419594
Lingala 2403665 2408332
Romani 2400367 2401836
Ripuarian 2405331 2411042
Sango 2401482 2402543
Anglo-Saxon 2404921 2414088
Kabyle 2423662 2429034
West Frisian 2449795 2507522
Bulgarian 2848350 3488526
Saterland Frisian 2405377 2412648
Somali 2405655 2414131
Palatinate German 2427546 2432036
Tumbuka 2401560 2402973
Wu 2404359 2417824
Pennsylvania German 2404657 2408473
Telugu 2470766 2577083
Romansh 2551402 2560186
Goan Konkani 5842 6255
Mongolian 2415057 2448231
North Frisian 2435463 2446293
Papiamentu 2421434 2424489
Buryat (Russia) 2400994 2405335
Tok Pisin 2402494 2405127
Chichewa 2401742 2402893
Shona 2404468 2411397
Sorani 2415515 2478417
Tagalog 2642903 3006934
Western Panjabi 2428888 2491457
Turkish 2848081 3608752
Northern Luri 7748 7980
Tongan 2402836 2407104
Inupiak 2402180 2406108
Pali 2401683 2407909
Aromanian 2398762 2400351
Gilaki 2405987 2414003
Ido 2498440 2536697
Kapampangan 2405706 2418144
Luxembourgish 2591106 2659662
Burmese 2439991 2489502
Corsican 2526953 2539010
Inuktitut 2401730 2404496
Thai 2555803 2922602
Swahili 2674081 2885994
Gothic 2400555 2402506
Hawaiian 2403059 2406718
Spanish 4462049 8574586
Erzya 2402627 2408105
Hill Mari 2403676 2424109
Norwegian (Nynorsk) 3177778 3461728
Tigrinya 2401753 2403234
Belarusian 2505212 2781644
Kabardian Circassian 2400826 2403873
Khmer 2412529 2427937
Ewe 2402336 2403770
Walloon 2532588 2567482
Tahitian 2406598 2408897
Greek 2669475 3120804
Assamese 2406307 2426444
Dutch 6109061 9272939
Xhosa 2402265 2403970
Cherokee 2401763 2404606
Albanian 2619867 2848218
Sesotho 2401899 2403172
Cheyenne 2402313 2404098
Serbo-Croatian 435468 4030142
Sinhalese 2417175 2448083
Fijian 2402036 2403338
Armenian 2589935 3167454
Waray-Waray 2524663 4654773
Nahuatl 2408222 2424320
Tajik 2466135 2552897
Slovak 2830523 3315845
Chuvash 2432400 2474382
Moksha 2400457 2403952
Twi 2401278 2402584
Dzongkha 2399876 2401172
Scots 2579946 2636958
Basque 2733188 3128966
Catalan 3137412 4471374
Tswana 2401826 2403290
Tsonga 2401742 2402905
Fula 2401906 2403129
Quechua 2418208 2470534
Tulu 3466 3764
Norwegian (Bokmål) 2847191 3318038
Adyghe 2981 3548
Hakka 2408736 2419515
Lithuanian 2755343 3242601
Hebrew 2687520 3358632
Tuvan 2401261 2404193
Extremaduran 2407412 2412945
Latgalian 2401258 2403421
Lezgian 2402161 2407044
Cantonese 47293 87174
Finnish 3326770 4464995
Kazakh 2559927 2912556
Northern Sotho 2404437 2408539
Arabic 2942886 4267782
Sanskrit 2409160 2436516
Urdu 2483352 2813564
Greenlandic 2409721 2412803
Bengali 2511607 2809032
Abkhazian 2400535 2403210
Norfolk 2401841 2403259
Aymara 2405571 2411363
Chamorro 2401950 2403338
Sakha 2409625 2427879
Friulian 2521147 2528521
Hungarian 3223444 4030352
Yoruba 2420030 2468314
Japanese 3578753 6156727
Lak 2400446 2403719
Maithili 13742 16267
Sranan 2413849 2415911
Asturian 2560160 2618920
Meadow Mari 2406522 2422527
Picard 2521642 2531144
Ukranian 3008530 4548155
Livvinkarjala 6286 7163
Mazandarani 2410643 2433138
Igbo 2405207 2409168
Sundanese 2427683 2453015
Zamboanga Chavacano 2398161 2400405
Min Nan 180158 378944
Tatar 2471851 2604224
Kalmyk 2401419 2405460
Vietnamese 2908228 4492709
Breton 2565586 2657474
Pangasinan 2404776 2411477
Neapolitan 2521225 2540941
Uyghur 2405750 2415068
Min Dong 2403638 2411868
Mirandese 2405117 2409383
Banyumasan 13335 14046
Chinese 3309470 5690940
Ilokano 2410065 2426579
Maori 2414261 2427052
Cree 2400567 2401828
Latvian 2643851 3007433
Minangkabau 2569978 2801893
Aragonese 2571513 2644905
Galician 2674205 2919344
Ossetian 2406772 2434473
Italian 4403808 7059461
Esperanto 2695221 3164890
Oromo 2400764 2402678
Egyptian Arabic 2415716 2445077
Bavarian 2571042 2609507
Tamil 2494485 2642492
Franco-Provençal/Arpitan 2494262 2500481
Banjar 2415969 2419579
Occitan 2582411 2695560
Hausa 2409015 2412137
Yiddish 2413463 2447149
Lao 2405862 2413569
Serbian 2742815 3914108
Guarani 2403959 2410152
Romanian 2980849 4209068
Swati 2401397 2402688
Latin 2558553 2779449
Classical Chinese 4778 8670
Croatian 2787246 3207664
Kashubian 2405797 2413092
French 6517922 10907139
Faroese 2433157 2454913
Emilian-Romagnol 2407865 2419173
Zhuang 2400837 2403671
West Flemish 2522974 2535812
Gagauz 2404469 2407725
Bashkir 2436112 2486083
Hindi 2651024 2996824
Alemannic 2410136 2417100
Sindhi 2407678 2420395
Afrikaans 2722582 2958793
Gujarati 2434166 2474150
Oriya 2411238 2439714
Zazaki 2410373 2421804
Samoan 2402109 2403971
Kikuyu 2402700 2405040
Malagasy 2518472 2657726
Buginese 2416130 2431818
Haitian 2436469 2498229
Northern Sami 2412827 2423793
Nepali 2431291 2474684
Persian 2935502 5326168
Acehnese 2457203 2464358
Aramaic 2400924 2405759
Wolof 2492663 2497485
Georgian 2487130 2677166
Portuguese 3399312 5640967
Mingrelian 2405045 2417916
Interlingua 2517062 2543983
Slovenian 3816966 4266718
Icelandic 2706183 5277113
Ladino 2404482 2410941
Sicilian 2532657 2586155
Zulu 2490547 2494327
Moldovan 2400788 2402080
Newar / Nepal Bhasa 2463531 2587459
Dutch Low Saxon 2401661 2406297
Malayalam 2444915 2596619
Uzbek 2461329 2996008
Tarantino 9175 10490
Divehi 2403359 2409943
Azerbaijani 2501377 2672127
Gan 2404071 2416894
Polish 3545258 5561294
Kurdish 2422967 2466363
Norman 2492195 2500844
Bihari 2401907 2411311
Welsh 2817852 3127258
Piedmontese 2536955 2609108
Udmurt 2402249 2411365
Punjabi 2425996 2464374
Scottish Gaelic 2539993 2571730
Ligurian 2520516 2527706
Old Church Slavonic 2400250 2402976
Pashto 2408192 2421050
Czech 2999270 3907009
Venda 2401572 2402615
Bosnian 2499628 2671798
Belarusian (Taraškievica) 2444028 2477762
Kongo 2490380 2493986
German 5446410 9667871
Kirghiz 2444990 2518148
Patois 35709 36867

Composition of Babel synsets (number of senses)

Turkmen 2392688 0 6291 542 0 0 195 1915
Interlingue 2362319 0 3553 172 0 0 46 257
Chechen 2397020 0 159169 3270 0 0 142 519
Volapük 2294949 0 114598 121289 0 0 1404 4175
Venetian 2351210 0 10810 6570 0 0 433 1436
Crimean Tatar 2395549 0 4906 2501 0 0 178 925
Cornish 2396131 0 3734 1050 0 0 269 692
Low Saxon 2335599 0 23567 8618 0 0 0 1063
Novial 2396078 0 1659 226 0 0 536 476
Macedonian 2383428 0 86730 38613 0 0 1671 8533
Indonesian 2199505 106688 389311 301246 104865 221995 2666 4904
Zeelandic 2393996 0 4331 1320 0 0 0 22
Vepsian 2394462 0 5028 2194 0 0 0 121
Lower Sorbian 2392078 0 3082 568 0 0 207 1093
Estonian 2293749 0 147586 113713 104029 218092 6045 6764
Marathi 2397438 0 44770 36059 0 0 799 2007
Akan 2396794 0 315 45 0 0 2 34
Tibetan 2397659 0 9925 2019 0 0 0 859
Kannada 2397469 0 20693 5957 0 0 272 939
Kinyarwanda 2396753 0 1889 1059 0 0 0 111
Javanese 2386263 0 50347 16582 0 0 224 531
Navajo 2397154 0 3637 2325 0 0 514 2557
Danish 2078098 5859 209911 133078 104680 220910 7797 12610
Rusyn 2395540 0 5981 1041 0 0 0 312
Karachay-Balkar 2396874 0 1991 1826 0 0 86 182
Samogitian 2396603 0 15941 5299 0 0 0 163
Bambara 2373785 0 483 154 0 0 16 55
Bishnupriya Manipuri 2397667 0 25232 173 0 0 86 41
Silesian 2394405 0 5083 1541 0 0 24 213
Kashmiri 2397676 0 324 48 0 0 0 35
Korean 2380120 20415 339164 314953 101409 83467 3476 12536
Upper Sorbian 2389128 0 10501 4386 0 0 252 643
Nauruan 2396522 0 1269 426 0 0 12 90
Komi 2395043 0 4768 1550 0 0 0 0
Tetum 2396810 0 1405 321 0 0 67 102
Karakalpak 2396450 0 2008 346 0 0 19 107
Sardinian 2360112 0 5577 1147 0 0 0 1560
Maltese 4749184 0 3210 1837 104893 181127 786 2139
Irish 2324931 92922 39669 7026 105676 210051 1536 12861
South Azerbaijani 0 0 29527 3469 0 0 0 38
Bislama 2396465 0 652 113 0 0 57 52
Fiji Hindi 2394457 0 9869 659 0 0 0 30
English 39255 206941 4953358 7239644 0 0 46088 285911
Lojban 2397098 0 1270 1656 0 0 228 120
Pontic 2397674 0 441 51 0 0 2 23
Luganda 2397181 0 1177 222 0 0 695 67
Malay 2268538 105028 289325 46889 105272 221641 960 6557
Russian 548289 0 1284956 1699201 101218 180868 9509 48306
Komi-Permyak 2394793 0 3477 1767 0 0 45 99
Central Bicolano 2393865 0 7017 931 0 0 5 90
Avar 2397434 0 2399 773 0 0 11 197
Võro 2397123 0 5481 988 0 0 192 0
Kirundi 2397117 0 620 122 0 0 0 6
Amharic 2397486 0 13784 5961 0 0 180 722
Swedish 716439 6904 3321034 2490645 103924 218278 11372 23944
Limburgish 2359982 0 10821 25327 0 0 356 312
Lombard 2361203 0 34252 14544 0 0 198 188
Cebuano 1181027 0 2879221 1606422 0 0 1012 452
Manx 2395241 0 4822 1886 0 0 283 2300
Lingala 2396835 0 2808 868 0 0 81 133
Romani 2397256 0 675 105 0 0 73 11
Ripuarian 2395998 0 2819 1118 0 0 0 7
Sango 2397181 0 264 58 0 0 0 23
Anglo-Saxon 2396020 0 2854 2954 0 0 343 1387
Kabyle 2373390 0 3212 919 0 0 12 65
West Frisian 2386493 0 31779 18537 0 0 755 1315
Bulgarian 601177 8936 213731 111720 103321 202674 6569 13312
Saterland Frisian 2395349 0 3736 1926 0 0 0 117
Somali 2396416 0 4686 2337 0 0 76 291
Palatinate German 2393743 0 2054 679 0 0 0 2
Tumbuka 2396895 0 636 54 0 0 0 6
Wu 2397447 0 5154 3540 0 0 0 115
Pennsylvania German 2396126 0 1926 689 0 0 103 89
Telugu 2396918 0 60215 23096 0 0 1900 3480
Romansh 2357778 0 3312 757 0 0 0 1900
Goan Konkani 0 0 3528 211 0 0 0 11
Mongolian 2395250 0 17831 5567 0 0 581 2939
North Frisian 2391868 0 4533 4070 0 0 0 305
Papiamentu 2373333 0 1619 366 0 0 0 132
Buryat (Russia) 2397613 0 1795 847 0 0 0 3
Tok Pisin 2396011 0 1307 238 0 0 104 426
Chichewa 2397145 0 355 42 0 0 20 221
Shona 2396662 0 2979 3263 0 0 0 100
Sorani 2396975 0 17981 38358 0 0 394 303
Tagalog 2348767 0 63976 97881 105723 220509 942 7001
Western Panjabi 2395650 0 42694 3027 0 0 0 65
Turkish 2249960 0 271838 237011 102454 178888 5431 13796
Northern Luri 0 0 5411 180 0 0 0 4
Tongan 2397068 0 1669 1171 0 0 34 143
Inupiak 2396778 0 634 259 0 0 0 47
Pali 2397342 0 3173 34 0 0 0 88
Aromanian 2397666 0 1235 362 0 0 0 1088
Gilaki 2397667 0 6488 1738 0 0 7 5
Ido 2354241 0 26525 3155 0 0 1240 4322
Kapampangan 2394095 0 8656 905 0 0 0 93
Luxembourgish 2331815 0 45847 11794 0 0 729 1830
Burmese 2397199 0 35914 5142 0 0 280 2426
Corsican 2359996 0 5313 2800 0 0 289 253
Inuktitut 2397521 0 477 821 0 0 0 242
Thai 2384451 95517 105127 132920 0 0 2004 6109
Swahili 2335980 0 32778 16399 105889 217359 1353 3822
Gothic 2397553 0 589 365 0 0 0 316
Hawaiian 2393977 0 2058 234 0 0 168 1005
Spanish 116779 144560 1204975 1601302 102919 210195 36266 34932
Erzya 2395400 0 3293 1004 0 0 73 351
Hill Mari 2396833 0 10491 1974 0 0 3 35
Norwegian (Nynorsk) 2115868 4768 127921 75733 0 0 1036 5674
Tigrinya 2397542 0 293 80 0 0 31 196
Belarusian 2388479 0 113661 104107 0 0 0 6146
Kabardian Circassian 2397655 0 1602 300 0 0 2 114
Khmer 2397446 0 6507 2198 0 0 1762 3692
Ewe 2396868 0 354 74 0 0 151 338
Walloon 2352473 0 13966 13472 0 0 412 1796
Tahitian 2396157 0 1263 83 0 0 22 121
Greek 2365238 24106 119211 63002 103737 196342 6592 18748
Assamese 2397327 0 4430 11486 0 0 25 882
Dutch 41285 60237 1785941 646622 103434 223016 26728 24993
Xhosa 2397171 0 744 149 0 0 226 143
Cherokee 2397481 0 825 658 0 0 161 573
Albanian 2342602 9599 61937 17474 104324 176024 547 3222
Sesotho 2397132 0 406 49 0 0 132 275
Cheyenne 2396829 0 763 94 0 0 0 66
Serbo-Croatian 0 0 411536 3511155 0 0 0 17541
Sinhalese 2396624 0 16207 7207 0 0 150 930
Fijian 2396788 0 350 67 0 0 17 143
Armenian 2377430 0 198543 304218 0 0 2213 10719
Waray-Waray 1212965 0 1257037 830337 0 0 0 230
Nahuatl 2392612 0 9546 4338 0 0 0 322
Tajik 2394650 0 48026 15750 0 0 526 2644
Slovak 2244442 44030 194040 59071 103049 178702 6193 6083
Chuvash 2393281 0 34360 2989 0 0 130 484
Moksha 2397324 0 1306 707 0 0 20 125
Twi 2396968 0 620 22 0 0 0 1
Dzongkha 2397655 0 259 118 0 0 179 71
Scots 2339742 0 39200 10852 0 0 147 930
Basque 2196712 48933 249367 73230 0 0 5626 2860
Catalan 2116108 99153 513521 352575 102636 207673 5038 15255
Tswana 2397067 0 692 52 0 0 23 71
Tsonga 2397200 0 463 77 0 0 0 17
Fula 2396872 0 254 107 0 0 0 16
Quechua 2391033 0 19438 18824 0 0 0 1263
Tulu 0 0 981 128 0 0 0 10
Norwegian (Bokmål) 2264546 5592 429746 255144 105121 222144 7000 11833
Adyghe 0 0 512 259 0 0 0 195
Hakka 2393423 0 6662 2052 0 0 0 105
Lithuanian 2306301 16032 173157 77722 103628 189236 2336 5376
Hebrew 2373318 6543 187643 166068 102179 158341 4251 8226
Tuvan 2397634 0 1699 455 0 0 33 347
Extremaduran 2394012 0 3006 1080 0 0 39 127
Latgalian 2397497 0 880 125 0 0 217 790
Lezgian 2397638 0 2760 650 0 0 40 124
Cantonese 0 0 47159 38745 0 0 629 641
Finnish 2161846 189309 386366 236279 103511 203132 10905 46084
Kazakh 2374416 0 219880 45861 0 0 524 3047
Northern Sotho 2393094 0 3708 220 0 0 0 41
Arabic 2342566 37352 448290 439743 102704 144952 4376 11532
Sanskrit 2397258 0 11108 10031 0 0 497 955
Urdu 2392939 0 106011 185642 0 0 1018 2770
Greenlandic 2377744 0 1680 205 0 0 0 629
Bengali 2389005 0 46142 173519 0 0 997 2615
Abkhazian 2397499 0 965 497 0 0 47 346
Norfolk 2397106 0 524 95 0 0 0 62
Aymara 2396223 0 3919 413 0 0 0 120
Chamorro 2396695 0 440 107 0 0 7 49
Sakha 2396072 0 12097 3411 0 0 0 392
Friulian 2361724 0 3280 666 0 0 103 1361
Hungarian 2085325 0 387777 174325 103208 210267 6915 15477
Yoruba 2379182 0 31458 9706 0 0 56 311
Japanese 2378138 169234 978100 618505 100028 95378 8333 26646
Lak 2397464 0 1244 633 0 0 0 66
Maithili 0 0 10621 936 0 0 0 17
Sranan 2394831 0 1157 108 0 0 18 60
Asturian 2338094 0 52611 8657 0 0 805 2846
Meadow Mari 2395239 0 9151 4590 0 0 19 3
Picard 2363282 0 3334 1584 0 0 27 41
Ukranian 2337575 0 635073 390761 101126 178219 2310 8389
Livvinkarjala 0 0 2023 295 0 0 0 17
Mazandarani 2397612 0 12540 5018 0 0 5 108
Igbo 2396471 0 1228 1372 0 0 23 109
Sundanese 2391541 0 19290 3237 0 0 60 403
Zamboanga Chavacano 2396960 0 3230 209 0 0 0 6
Min Nan 0 0 179680 197353 0 0 1101 810
Tatar 2392108 0 68822 54137 0 0 278 1250
Kalmyk 2395445 0 2262 360 0 0 20 443
Vietnamese 1508622 0 1142833 189517 105159 204513 2211 7992
Breton 2335190 0 56140 18910 0 0 3434 1753
Pangasinan 2395830 0 5042 569 0 0 0 19
Neapolitan 2353859 0 14421 1202 0 0 1052 338
Uyghur 2397456 0 4226 1660 0 0 90 1233
Min Dong 2396842 0 3760 1039 0 0 0 85
Mirandese 2396085 0 2847 395 0 0 45 265
Banyumasan 0 0 13333 712 0 0 0 1
Chinese 2296874 97661 880634 730982 100717 131623 6859 22233
Ilokano 2390178 0 8205 6138 0 0 53 175
Maori 2395210 0 7108 855 0 0 257 9820
Cree 2397217 0 245 117 0 0 0 70
Latvian 2333154 0 67043 93106 104012 193955 1913 6838
Minangkabau 2215230 0 221912 948 0 0 14 31
Aragonese 2335472 0 30605 25233 0 0 450 447
Galician 2303822 53118 126843 50854 0 0 2042 8379
Ossetian 2395361 0 9968 9924 0 0 183 547
Italian 132552 74197 1252572 648881 103199 214096 27642 28051
Esperanto 2260775 0 221362 155397 0 0 4900 10757
Oromo 2397512 0 946 250 0 0 0 32
Egyptian Arabic 2396866 0 15745 7275 0 0 236 1016
Bavarian 2350512 0 19356 12810 0 0 758 212
Tamil 2394892 0 89399 35833 0 0 4766 1776
Franco-Provençal/Arpitan 2363551 0 2681 424 0 0 44 118
Banjar 2395125 0 1581 329 0 0 7 36
Occitan 2280615 0 79580 11546 0 0 586 2406
Hausa 2395485 0 1839 212 0 0 0 540
Yiddish 2395070 0 13442 10604 0 0 504 3170
Lao 2397631 0 2337 727 0 0 287 2539
Serbian 2311100 0 316684 528212 103077 166290 6202 0
Guarani 2395967 0 3179 1481 0 0 0 251
Romanian 2189583 84639 360443 502649 104246 189667 3423 17098
Swati 2397275 0 437 88 0 0 7 48
Latin 1245646 0 122922 49217 0 0 1866 7631
Classical Chinese 0 0 4792 3878 0 0 0 0
Croatian 2277846 47876 150030 48731 103878 192888 2148 0
Kashubian 2395666 0 5218 556 0 0 68 338
French 41414 102659 1682778 1397323 102427 212936 29180 38650
Faroese 2369527 0 12076 4877 0 0 597 3564
Emilian-Romagnol 2392571 0 8630 2145 0 0 12 0
Zhuang 2397467 0 1305 225 0 0 0 332
West Flemish 2360056 0 5877 3430 0 0 406 64
Gagauz 2396517 0 2854 146 0 0 22 106
Bashkir 2395230 0 36134 6584 0 0 139 1896
Hindi 2381331 0 108875 42930 104525 197939 2160 5654
Alemannic 2389305 0 21335 5194 0 0 0 0
Sindhi 2397520 0 8045 1519 0 0 37 257
Afrikaans 2310836 0 42430 21546 104995 227060 2609 2688
Gujarati 2397464 0 26757 2158 0 0 654 1043
Oriya 2393721 0 11036 10925 0 0 46 400
Zazaki 2392217 0 6840 2761 0 0 0 618
Samoan 2397107 0 835 290 0 0 50 184
Kikuyu 2396436 0 1438 46 0 0 3 189
Malagasy 2324520 0 82476 41431 0 0 76 370
Buginese 2380850 0 14139 110 0 0 8 17
Haitian 2382524 0 49944 5354 0 0 249 617
Northern Sami 2393591 0 7407 1214 0 0 67 566
Nepali 2397497 0 30784 4529 0 0 164 515
Persian 2383470 30461 502456 1357906 104004 165441 3380 9162
Acehnese 2393376 0 3201 1001 0 0 0 63
Aramaic 2395714 0 1498 1552 0 0 0 424
Wolof 2364390 0 1230 438 0 0 179 296
Georgian 2382497 0 104014 36106 0 0 8238 9450
Portuguese 1988086 106665 899215 697511 103381 206218 13273 35675
Mingrelian 2397274 0 6728 2867 0 0 97 110
Interlingua 2270407 0 19635 3030 0 0 2610 1794
Slovenian 2122797 70945 145770 61346 103441 166438 5439 6116
Icelandic 4639514 16004 40168 23266 105472 211004 2652 9308
Ladino 2395670 0 3820 1481 0 0 0 248
Sicilian 2350822 0 24378 15542 0 0 845 1222
Zulu 2364451 0 937 279 0 0 308 505
Moldovan 2397414 0 427 117 0 0 0 0
Newar / Nepal Bhasa 2395762 0 72774 27800 0 0 23 66
Dutch Low Saxon 2396820 0 5679 3723 0 0 0 75
Malayalam 2388765 0 44468 69417 0 0 502 1234
Uzbek 2324253 0 127628 316906 0 0 219 2698
Tarantino 0 0 9175 1315 0 0 0 0
Divehi 2397556 0 4130 725 0 0 171 372
Azerbaijani 2366905 0 108681 33596 0 0 0 3223
Gan 2395597 0 6410 2763 0 0 0 11
Polish 2024393 52397 1133169 399139 103033 167689 9672 20053
Kurdish 2394852 0 22026 13308 0 0 283 2751
Norman 2362757 0 3560 2051 0 0 0 111
Bihari 2395850 0 7167 3991 0 0 0 9
Welsh 2293623 0 85609 42001 105431 209698 1817 4758
Piedmontese 2325717 0 62704 2947 0 0 1000 148
Udmurt 2396529 0 3878 2390 0 0 33 262
Punjabi 2396898 0 24631 10261 0 0 0 634
Scottish Gaelic 2357942 0 13959 8172 0 0 1148 6634
Ligurian 2363231 0 3277 992 0 0 103 178
Old Church Slavonic 2395767 0 570 808 0 0 1 431
Pashto 2397642 0 7927 1908 0 0 0 1201
Czech 2220656 0 352144 232046 103419 187643 7695 19008
Venda 2397195 0 303 42 0 0 2 21
Bosnian 2365799 0 66220 98236 0 0 728 0
Belarusian (Taraškievica) 2394333 0 59050 23215 0 0 1164 0
Kongo 2364220 0 1202 193 0 0 0 64
German 82197 0 1721252 1350747 102403 202866 27810 44831
Kirghiz 2395364 0 56045 3755 0 0 1028 2530
Patois 0 0 1648 767 0 0 0 28
Turkmen 11668 132 0 1 1 0 0 2623
Interlingue 134138 37 0 0 0 0 0 0
Chechen 147017 828 0 0 0 0 0 0
Volapük 238583 472 0 1 1 0 0 0
Venetian 178520 226 0 0 0 0 0 0
Crimean Tatar 655 100 0 0 0 0 0 0
Cornish 8678 567 0 1 1 0 0 0
Low Saxon 273967 993 0 1 0 0 0 0
Novial 7882 76 0 0 0 0 0 0
Macedonian 128656 2647 0 0 0 0 0 3699
Indonesian 840527 3238 0 219 32 0 0 4980
Zeelandic 12128 49 0 0 0 0 0 0
Vepsian 10563 211 0 0 0 0 0 0
Lower Sorbian 39457 95 0 0 0 0 0 0
Estonian 400729 2947 0 339 68 0 0 8499
Marathi 69838 1377 0 35 12 0 0 3152
Akan 6500 6 0 0 0 0 0 0
Tibetan 16511 1115 0 0 0 0 0 0
Kannada 41587 352 0 7 0 0 0 3142
Kinyarwanda 7108 113 0 0 0 0 0 2712
Javanese 77261 307 0 0 0 0 0 0
Navajo 7724 109 0 0 0 0 0 0
Danish 1215601 2131 0 56 11 0 0 11142
Rusyn 9383 96 0 0 0 0 0 0
Karachay-Balkar 5045 33 0 0 0 0 0 0
Samogitian 0 0 0 0 0 0 0 0
Bambara 48369 19 0 1 0 0 0 0
Bishnupriya Manipuri 25428 3699 0 0 0 0 0 0
Silesian 12012 146 0 0 0 0 0 0
Kashmiri 2766 59 0 1 0 0 0 0
Korean 505129 6811 0 372 163 0 0 15353
Upper Sorbian 55825 319 0 0 0 0 0 0
Nauruan 6387 175 0 0 0 0 0 0
Komi 10337 120 0 0 0 0 0 0
Tetum 6732 97 0 0 0 0 0 0
Karakalpak 7821 9 0 0 0 0 0 0
Sardinian 172286 88 0 0 0 0 0 0
Maltese 23039 357 0 0 0 0 0 2845
Irish 226103 1413 0 0 0 0 0 3460
South Azerbaijani 22660 0 0 0 0 0 0 0
Bislama 6508 22 0 0 0 0 0 0
Fiji Hindi 15246 14 0 0 0 0 0 0
English 9617044 473414 308 20219 16089 0 0 20645
Lojban 9263 69 0 0 0 0 0 0
Pontic 2907 22 0 0 0 0 0 0
Luganda 5753 4 0 0 0 0 0 0
Malay 444205 1461 0 0 0 0 0 4680
Russian 5002693 0 0 8150 8189 0 0 15719
Komi-Permyak 9170 81 0 0 0 0 0 0
Central Bicolano 12741 53 0 0 0 0 0 0
Avar 5309 30 0 0 0 0 0 0
Võro 0 0 0 0 0 0 0 0
Kirundi 5322 23 0 0 0 0 0 0
Amharic 18689 662 0 2 1 0 0 3053
Swedish 4316525 6005 0 0 0 0 0 11886
Limburgish 186016 437 0 9 0 0 0 0
Lombard 86885 271 0 0 0 0 0 0
Cebuano 2608713 505 0 0 0 0 0 0
Manx 14779 283 0 0 0 0 0 0
Lingala 7554 53 0 0 0 0 0 0
Romani 3676 40 0 0 0 0 0 0
Ripuarian 11038 62 0 0 0 0 0 0
Sango 5004 13 0 0 0 0 0 0
Anglo-Saxon 10268 257 0 3 2 0 0 0
Kabyle 51408 28 0 0 0 0 0 0
West Frisian 67946 697 0 0 0 0 0 0
Bulgarian 2206549 10810 0 1959 859 0 0 6909
Saterland Frisian 11462 58 0 0 0 0 0 0
Somali 9987 338 0 0 0 0 0 0
Palatinate German 35379 179 0 0 0 0 0 0
Tumbuka 5376 6 0 0 0 0 0 0
Wu 11231 337 0 0 0 0 0 0
Pennsylvania German 9450 90 0 0 0 0 0 0
Telugu 87716 306 0 238 77 0 0 3137
Romansh 196190 249 0 0 0 0 0 0
Goan Konkani 2505 0 0 0 0 0 0 0
Mongolian 24582 1481 0 0 0 0 0 0
North Frisian 45441 76 0 0 0 0 0 0
Papiamentu 48999 40 0 0 0 0 0 0
Buryat (Russia) 5035 42 0 0 0 0 0 0
Tok Pisin 6979 62 0 0 0 0 0 0
Chichewa 5107 3 0 0 0 0 0 0
Shona 8389 4 0 0 0 0 0 0
Sorani 23833 573 0 0 0 0 0 0
Tagalog 161514 621 0 0 0 0 0 0
Western Panjabi 45114 4907 0 0 0 0 0 0
Turkish 532293 2861 0 1145 189 0 0 12886
Northern Luri 2385 0 0 0 0 0 0 0
Tongan 7007 12 0 0 0 0 0 0
Inupiak 8377 13 0 0 0 0 0 0
Pali 7266 6 0 0 0 0 0 0
Aromanian 0 0 0 0 0 0 0 0
Gilaki 8093 5 0 0 0 0 0 0
Ido 146445 769 0 0 0 0 0 0
Kapampangan 14119 276 0 0 0 0 0 0
Luxembourgish 264193 577 0 2 1 0 0 2874
Burmese 48265 276 0 0 0 0 0 0
Corsican 170184 173 0 2 0 0 0 0
Inuktitut 3684 42 0 0 0 0 0 1709
Thai 183154 3480 0 0 0 0 0 9840
Swahili 171912 502 0 0 0 0 0 0
Gothic 3657 26 0 0 0 0 0 0
Hawaiian 9110 166 0 0 0 0 0 0
Spanish 5081012 13254 0 5122 7158 0 0 16112
Erzya 7952 32 0 0 0 0 0 0
Hill Mari 14329 444 0 0 0 0 0 0
Norwegian (Nynorsk) 1124995 1751 0 474 3 0 0 3505
Tigrinya 2809 43 0 0 0 0 0 2240
Belarusian 163592 3140 0 37 1 0 0 2481
Kabardian Circassian 4165 35 0 0 0 0 0 0
Khmer 13202 316 0 0 0 0 0 2814
Ewe 5979 6 0 0 0 0 0 0
Walloon 185159 204 0 0 0 0 0 0
Tahitian 11218 33 0 0 0 0 0 0
Greek 207662 5719 0 517 244 0 0 9686
Assamese 9376 52 0 0 0 0 0 2866
Dutch 6339708 7921 0 92 24 0 0 12938
Xhosa 5522 15 0 0 0 0 0 0
Cherokee 3822 24 0 0 0 0 0 1062
Albanian 128341 1008 0 61 6 0 0 3073
Sesotho 5169 9 0 0 0 0 0 0
Cheyenne 6331 15 0 0 0 0 0 0
Serbo-Croatian 89910 0 0 0 0 0 0 0
Sinhalese 26890 75 0 0 0 0 0 0
Fijian 5963 10 0 0 0 0 0 0
Armenian 264450 5363 0 555 1087 0 0 2876
Waray-Waray 1350413 3791 0 0 0 0 0 0
Nahuatl 17324 178 0 0 0 0 0 0
Tajik 88015 1039 0 0 0 0 0 2247
Slovak 470228 2001 0 648 184 0 0 7174
Chuvash 42641 497 0 0 0 0 0 0
Moksha 4427 43 0 0 0 0 0 0
Twi 4964 9 0 0 0 0 0 0
Dzongkha 2801 89 0 0 0 0 0 0
Scots 245762 325 0 0 0 0 0 0
Basque 546138 1874 0 234 105 0 0 3887
Catalan 1049898 4538 0 1385 359 0 0 3235
Tswana 5379 6 0 0 0 0 0 0
Tsonga 5141 7 0 0 0 0 0 0
Fula 5866 14 0 0 0 0 0 0
Quechua 36686 447 0 1 1 0 0 2841
Tulu 2645 0 0 0 0 0 0 0
Norwegian (Bokmål) 258 4953 0 605 59 0 0 11037
Adyghe 2582 0 0 0 0 0 0 0
Hakka 17070 203 0 0 0 0 0 0
Lithuanian 354451 4855 0 558 410 0 0 8539
Hebrew 334092 4824 0 1946 685 0 0 10516
Tuvan 4022 3 0 0 0 0 0 0
Extremaduran 14558 123 0 0 0 0 0 0
Latgalian 3895 17 0 0 0 0 0 0
Lezgian 5789 43 0 0 0 0 0 0
Cantonese 0 0 0 0 0 0 0 0
Finnish 1110494 4291 0 1133 182 0 0 11463
Kazakh 254916 8744 0 1 0 0 0 5167
Northern Sotho 11472 4 0 0 0 0 0 0
Arabic 713651 10728 0 705 125 0 0 11058
Sanskrit 16298 323 0 27 19 0 0 0
Urdu 120772 1102 0 77 9 0 0 3224
Greenlandic 32473 72 0 0 0 0 0 0
Bengali 195710 1044 0 0 0 0 0 0
Abkhazian 3810 46 0 0 0 0 0 0
Norfolk 5429 43 0 0 0 0 0 0
Aymara 10523 165 0 0 0 0 0 0
Chamorro 6018 22 0 0 0 0 0 0
Sakha 15581 326 0 0 0 0 0 0
Friulian 161292 95 0 0 0 0 0 0
Hungarian 1031721 2540 0 823 85 0 0 11889
Yoruba 44562 113 0 0 0 0 0 2926
Japanese 1742403 23101 0 68 29 0 0 16764
Lak 4276 36 0 0 0 0 0 0
Maithili 4693 0 0 0 0 0 0 0
Sranan 19731 6 0 0 0 0 0 0
Asturian 215450 439 0 17 1 0 0 0
Meadow Mari 13315 210 0 0 0 0 0 0
Picard 162697 179 0 0 0 0 0 0
Ukranian 875120 16775 0 1988 819 0 0 0
Livvinkarjala 4828 0 0 0 0 0 0 0
Mazandarani 17099 756 0 0 0 0 0 0
Igbo 7053 41 0 0 0 0 0 2871
Sundanese 38368 115 0 1 0 0 0 0
Zamboanga Chavacano 0 0 0 0 0 0 0 0
Min Nan 0 0 0 0 0 0 0 0
Tatar 84466 339 0 1 2 0 0 2821
Kalmyk 6834 96 0 0 0 0 0 0
Vietnamese 1321752 4823 0 162 24 0 0 5101
Breton 240407 1615 0 25 0 0 0 0
Pangasinan 9996 21 0 0 0 0 0 0
Neapolitan 169161 908 0 0 0 0 0 0
Uyghur 7328 864 0 1 2 0 0 2208
Min Dong 10120 22 0 0 0 0 0 0
Mirandese 9703 43 0 0 0 0 0 0
Banyumasan 0 0 0 0 0 0 0 0
Chinese 1365185 30131 0 0 0 0 0 28041
Ilokano 21614 216 0 0 0 0 0 0
Maori 10921 91 0 0 0 0 0 2790
Cree 4164 15 0 0 0 0 0 0
Latvian 196075 2639 0 0 0 0 0 8698
Minangkabau 363739 19 0 0 0 0 0 0
Aragonese 251838 860 0 0 0 0 0 0
Galician 369172 1291 0 302 47 0 0 3474
Ossetian 17460 1030 0 0 0 0 0 0
Italian 4498963 16833 0 11793 9441 0 24323 16918
Esperanto 506340 5074 0 231 54 0 0 0
Oromo 3923 15 0 0 0 0 0 0
Egyptian Arabic 23617 322 0 0 0 0 0 0
Bavarian 225595 264 0 0 0 0 0 0
Tamil 111332 1207 0 124 24 0 0 3139
Franco-Provençal/Arpitan 133220 443 0 0 0 0 0 0
Banjar 22480 21 0 0 0 0 0 0
Occitan 318152 2675 0 0 0 0 0 0
Hausa 10988 14 0 0 0 0 0 3059
Yiddish 23739 620 0 0 0 0 0 0
Lao 8617 214 0 0 0 0 0 1217
Serbian 458648 19600 0 212 49 0 0 4034
Guarani 9200 74 0 0 0 0 0 0
Romanian 745412 3542 0 273 58 0 0 8035
Swati 4799 34 0 0 0 0 0 0
Latin 1347070 5021 0 40 36 0 0 0
Classical Chinese 0 0 0 0 0 0 0 0
Croatian 375634 1456 0 506 40 0 0 6631
Kashubian 11008 238 0 0 0 0 0 0
French 7243043 15290 0 4025 834 19654 0 16926
Faroese 63932 340 0 0 0 0 0 0
Emilian-Romagnol 15815 0 0 0 0 0 0 0
Zhuang 4309 30 0 1 2 0 0 0
West Flemish 165862 117 0 0 0 0 0 0
Gagauz 8033 47 0 0 0 0 0 0
Bashkir 45574 526 0 0 0 0 0 0
Hindi 146473 1313 0 53 10 0 0 5561
Alemannic 988 277 0 1 0 0 0 0
Sindhi 10771 15 0 0 0 0 0 2231
Afrikaans 242692 741 0 21 19 0 0 3156
Gujarati 42664 266 0 5 1 0 0 3138
Oriya 23401 185 0 0 0 0 0 0
Zazaki 19211 157 0 0 0 0 0 0
Samoan 5486 19 0 0 0 0 0 0
Kikuyu 6921 7 0 0 0 0 0 0
Malagasy 208766 87 0 0 0 0 0 0
Buginese 36688 6 0 0 0 0 0 0
Haitian 59140 401 0 0 0 0 0 0
Northern Sami 20565 383 0 0 0 0 0 0
Nepali 38095 248 0 0 0 0 0 2852
Persian 703651 60351 0 2067 425 0 0 3394
Acehnese 66612 105 0 0 0 0 0 0
Aramaic 6372 199 0 0 0 0 0 0
Wolof 128162 74 0 1 2 0 0 2713
Georgian 130400 3211 0 342 67 0 0 2841
Portuguese 1564692 10364 0 5053 1855 0 0 8979
Mingrelian 10568 272 0 0 0 0 0 0
Interlingua 246308 199 0 0 0 0 0 0
Slovenian 1576206 1021 0 329 131 0 0 6739
Icelandic 224222 958 0 147 0 0 0 4398
Ladino 9560 162 0 0 0 0 0 0
Sicilian 192765 581 0 0 0 0 0 0
Zulu 127814 33 0 0 0 0 0 0
Moldovan 4122 0 0 0 0 0 0 0
Newar / Nepal Bhasa 89992 1042 0 0 0 0 0 0
Dutch Low Saxon 0 0 0 0 0 0 0 0
Malayalam 88252 606 0 122 113 0 0 3140
Uzbek 220885 357 0 15 4 0 0 3043
Tarantino 0 0 0 0 0 0 0 0
Divehi 6862 127 0 0 0 0 0 0
Azerbaijani 153271 2603 0 217 15 0 0 3616
Gan 12023 90 0 0 0 0 0 0
Polish 1607985 10589 0 17936 1277 0 0 13962
Kurdish 30273 566 0 52 38 0 0 2214
Norman 132243 122 0 0 0 0 0 0
Bihari 4248 46 0 0 0 0 0 0
Welsh 379549 788 0 205 3 0 0 3776
Piedmontese 216134 458 0 0 0 0 0 0
Udmurt 8057 216 0 0 0 0 0 0
Punjabi 28595 524 0 0 0 0 0 2831
Scottish Gaelic 181122 458 0 0 0 0 0 2295
Ligurian 159766 159 0 0 0 0 0 0
Old Church Slavonic 5283 116 0 0 0 0 0 0
Pashto 10861 292 0 0 0 0 0 1219
Czech 764761 2578 0 3298 169 0 0 13592
Venda 5048 4 0 0 0 0 0 0
Bosnian 136742 670 0 112 21 0 0 3270
Belarusian (Taraškievica) 0 0 0 0 0 0 0 0
Kongo 128279 28 0 0 0 0 0 0
German 6105754 11062 0 1403 262 0 0 17284
Kirghiz 59099 324 0 3 0 0 0 0
Patois 34424 0 0 0 0 0 0 0

Number of senses by part of speech

Turkmen 2415660 218 30 148
Interlingue 2500481 21 4 16
Chechen 2707930 1 6 28
Volapük 2774689 185 98 500
Venetian 2548726 294 44 141
Crimean Tatar 2404650 68 34 62
Cornish 2411013 50 10 50
Low Saxon 2643622 114 16 56
Novial 2406636 135 47 115
Macedonian 2652180 827 161 809
Indonesian 4121221 44602 2041 12312
Zeelandic 2411844 0 0 2
Vepsian 2412573 2 0 4
Lower Sorbian 2436325 157 35 63
Estonian 3299368 1393 620 1179
Marathi 2555121 180 18 168
Akan 2403684 1 0 11
Tibetan 2427982 48 18 40
Kannada 2470199 120 13 86
Kinyarwanda 2409589 99 1 56
Javanese 2531428 34 5 48
Navajo 2413743 88 77 112
Danish 3994740 3376 836 2932
Rusyn 2412320 8 12 13
Karachay-Balkar 2406027 2 1 7
Samogitian 2417994 2 0 10
Bambara 2422875 2 0 5
Bishnupriya Manipuri 2452324 0 0 2
Silesian 2413413 3 0 8
Kashmiri 2400906 1 0 2
Korean 3777991 3352 570 1455
Upper Sorbian 2460926 80 5 43
Nauruan 2404871 1 0 9
Komi 2411816 0 0 2
Tetum 2405531 0 0 3
Karakalpak 2406756 0 0 4
Sardinian 2540298 357 13 102
Maltese 5066927 1049 566 875
Irish 2983608 16536 1822 23682
South Azerbaijani 55687 4 0 3
Bislama 2403856 5 3 5
Fiji Hindi 2420273 0 0 2
English 22728996 61155 19653 112718
Lojban 2409648 30 8 18
Pontic 2401118 0 0 2
Luganda 2404954 79 11 55
Malay 3435963 43450 2078 13065
Russian 8890918 7805 1990 6385
Komi-Permyak 2409418 8 0 6
Central Bicolano 2414696 2 2 2
Avar 2406136 5 2 10
Võro 2403773 4 1 6
Kirundi 2403208 0 0 2
Amharic 2440309 136 8 87
Swedish 11214908 5655 1185 5208
Limburgish 2583219 18 9 14
Lombard 2497472 34 20 15
Cebuano 8277169 70 29 84
Manx 2418927 103 11 553
Lingala 2408312 9 2 9
Romani 2401826 3 0 7
Ripuarian 2411040 0 0 2
Sango 2402541 0 0 2
Anglo-Saxon 2413574 278 38 198
Kabyle 2429028 2 0 4
West Frisian 2507239 119 40 124
Bulgarian 3478406 5695 799 3626
Saterland Frisian 2412641 4 0 3
Somali 2414105 5 9 12
Palatinate German 2432034 0 0 2
Tumbuka 2402971 0 0 2
Wu 2417814 5 2 3
Pennsylvania German 2408431 20 9 13
Telugu 2576025 530 95 433
Romansh 2559630 382 24 150
Goan Konkani 6254 0 1 0
Mongolian 2447716 299 51 165
North Frisian 2446191 85 2 15
Papiamentu 2424470 4 2 13
Buryat (Russia) 2405333 0 0 2
Tok Pisin 2405005 73 18 31
Chichewa 2402873 6 5 9
Shona 2411388 0 0 9
Sorani 2478296 69 12 40
Tagalog 3003684 1236 582 1432
Western Panjabi 2491455 0 0 2
Turkish 3602768 2388 881 2715
Northern Luri 7980 0 0 0
Tongan 2407083 4 2 15
Inupiak 2406106 0 0 2
Pali 2407905 2 0 2
Aromanian 2399957 258 36 100
Gilaki 2414001 0 0 2
Ido 2535452 593 109 543
Kapampangan 2418123 0 5 16
Luxembourgish 2659084 336 30 212
Burmese 2489056 246 64 136
Corsican 2538978 20 1 11
Inuktitut 2404366 82 2 46
Thai 2900240 13428 2609 6325
Swahili 2883023 1469 603 899
Gothic 2402412 47 15 32
Hawaiian 2406514 70 20 114
Spanish 8509828 28955 5667 30136
Erzya 2408087 5 2 11
Hill Mari 2424107 0 0 2
Norwegian (Nynorsk) 3459743 1115 89 781
Tigrinya 2403093 92 4 45
Belarusian 2780433 730 102 379
Kabardian Circassian 2403866 1 0 6
Khmer 2427005 553 92 287
Ewe 2403714 34 7 15
Walloon 2566871 361 91 159
Tahitian 2408873 9 1 14
Greek 3108533 7526 888 3857
Assamese 2426057 248 18 121
Dutch 9247320 15721 1774 8124
Xhosa 2403942 6 4 18
Cherokee 2404436 97 9 64
Albanian 2841963 4681 586 988
Sesotho 2403116 48 2 6
Cheyenne 2404096 0 0 2
Serbo-Croatian 4028053 887 215 987
Sinhalese 2447982 58 9 34
Fijian 2403307 13 4 14
Armenian 3164842 1085 228 1299
Waray-Waray 4654744 19 3 7
Nahuatl 2424268 27 8 17
Tajik 2552443 251 40 163
Slovak 3294152 12871 2113 6709
Chuvash 2474347 11 3 21
Moksha 2403945 1 2 4
Twi 2402582 0 0 2
Dzongkha 2401168 1 0 3
Scots 2636698 86 59 115
Basque 3118855 9584 50 477
Catalan 4437688 16938 2330 14418
Tswana 2403286 0 0 4
Tsonga 2402902 0 0 3
Fula 2403127 0 0 2
Quechua 2469867 516 18 133
Tulu 3764 0 0 0
Norwegian (Bokmål) 3311378 3323 806 2531
Adyghe 3541 0 2 5
Hakka 2419506 2 4 3
Lithuanian 3233655 2466 1553 4927
Hebrew 3353196 2468 832 2136
Tuvan 2404113 48 2 30
Extremaduran 2412926 7 2 10
Latgalian 2403162 143 16 100
Lezgian 2407035 1 0 8
Cantonese 87033 69 16 56
Finnish 4389848 30557 7738 36852
Kazakh 2911975 301 43 237
Northern Sotho 2408537 0 0 2
Arabic 4240563 23925 910 2384
Sanskrit 2436351 44 25 96
Urdu 2812898 318 85 263
Greenlandic 2412743 31 6 23
Bengali 2808658 151 43 180
Abkhazian 2403190 3 5 12
Norfolk 2403252 2 0 5
Aymara 2411355 0 0 8
Chamorro 2403333 1 0 4
Sakha 2427861 5 2 11
Friulian 2528100 282 13 126
Hungarian 4024181 2565 883 2723
Yoruba 2468125 107 5 77
Japanese 6079747 48326 8033 20621
Lak 2403717 0 0 2
Maithili 16267 0 0 0
Sranan 2415909 0 0 2
Asturian 2618211 257 36 416
Meadow Mari 2422523 0 0 4
Picard 2531138 4 0 2
Ukranian 4545260 1518 631 746
Livvinkarjala 7162 1 0 0
Mazandarani 2433125 8 0 5
Igbo 2408983 105 5 75
Sundanese 2452890 54 8 63
Zamboanga Chavacano 2400402 0 1 2
Min Nan 378614 183 30 117
Tatar 2603919 171 20 114
Kalmyk 2405428 15 2 15
Vietnamese 4488855 1912 660 1282
Breton 2656870 283 76 245
Pangasinan 2411470 0 2 5
Neapolitan 2540664 173 22 82
Uyghur 2414841 116 10 101
Min Dong 2411836 16 0 16
Mirandese 2409325 25 10 23
Banyumasan 14046 0 0 0
Chinese 5631695 17371 4445 37429
Ilokano 2426562 9 2 6
Maori 2423289 2154 122 1487
Cree 2401825 0 0 3
Latvian 3003818 1500 694 1421
Minangkabau 2801884 0 1 8
Aragonese 2644852 19 9 25
Galician 2899623 7359 1212 11150
Ossetian 2434425 14 5 29
Italian 7012558 21682 5394 19827
Esperanto 3161938 1159 394 1399
Oromo 2402675 0 0 3
Egyptian Arabic 2444920 56 29 72
Bavarian 2609298 112 29 68
Tamil 2641139 779 110 464
Franco-Provençal/Arpitan 2500462 13 1 5
Banjar 2419575 0 0 4
Occitan 2694829 387 55 289
Hausa 2411946 115 3 73
Yiddish 2446461 294 68 326
Lao 2413044 300 38 187
Serbian 3912145 1013 502 448
Guarani 2410125 11 3 13
Romanian 4173374 19253 4969 11472
Swati 2402681 0 0 7
Latin 2776844 1361 317 927
Classical Chinese 8670 0 0 0
Croatian 3184693 18929 950 3092
Kashubian 2413058 11 6 17
French 10851748 25838 6098 23455
Faroese 2454325 232 74 282
Emilian-Romagnol 2419171 0 0 2
Zhuang 2403618 32 6 15
West Flemish 2535769 17 5 21
Gagauz 2407718 0 0 7
Bashkir 2485799 44 43 197
Hindi 2993837 1212 617 1158
Alemannic 2417098 0 0 2
Sindhi 2420232 101 9 53
Afrikaans 2955859 1355 614 965
Gujarati 2473879 131 8 132
Oriya 2439690 8 5 11
Zazaki 2421641 88 25 50
Samoan 2403949 2 2 18
Kikuyu 2405037 0 1 2
Malagasy 2657666 17 15 28
Buginese 2431809 0 0 9
Haitian 2498079 83 16 51
Northern Sami 2423737 32 4 20
Nepali 2474486 116 5 77
Persian 5313588 5999 832 5749
Acehnese 2464353 0 0 5
Aramaic 2405734 5 5 15
Wolof 2497313 101 10 61
Georgian 2673552 1461 367 1786
Portuguese 5592243 21859 4357 22508
Mingrelian 2417912 0 2 2
Interlingua 2543249 338 75 321
Slovenian 4236934 15006 1447 13331
Icelandic 5266610 5473 741 4289
Ladino 2410910 10 2 19
Sicilian 2585808 219 17 111
Zulu 2494242 45 6 34
Moldovan 2402078 0 0 2
Newar / Nepal Bhasa 2587457 0 0 2
Dutch Low Saxon 2406293 0 1 3
Malayalam 2596329 174 14 102
Uzbek 2995551 243 30 184
Tarantino 10490 0 0 0
Divehi 2409933 1 1 8
Azerbaijani 2671586 270 39 232
Gan 2416892 0 0 2
Polish 5547278 3432 1058 9526
Kurdish 2465503 411 124 325
Norman 2500793 34 12 5
Bihari 2411309 0 0 2
Welsh 3124052 1440 558 1208
Piedmontese 2608998 39 10 61
Udmurt 2411337 7 2 19
Punjabi 2464175 109 6 84
Scottish Gaelic 2569818 749 198 965
Ligurian 2527672 17 3 14
Old Church Slavonic 2402871 56 3 46
Pashto 2420752 109 32 157
Czech 3899812 2905 1007 3285
Venda 2402613 0 0 2
Bosnian 2671599 126 5 68
Belarusian (Taraškievica) 2477673 59 6 24
Kongo 2493976 1 0 9
German 9649761 7234 1999 8877
Kirghiz 2517625 229 43 251
Patois 36859 5 3 0
284 810208738 632196 113255 534805

Number of concepts, Named Entities and definitions

Turkmen 1827708 579113 6090
Interlingue 1846306 648181 3565
Chechen 1918581 642898 170752
Volapük 1849461 669898 115047
Venetian 1849474 676009 12701
Crimean Tatar 1823263 577671 5071
Cornish 1825029 579152 4671
Low Saxon 1854215 745934 103669
Novial 1825045 578719 1750
Macedonian 1864578 616042 115666
Indonesian 2049991 943611 1505707
Zeelandic 1824638 580228 12965
Vepsian 1824530 578978 5168
Lower Sorbian 1825813 605022 3010
Estonian 2033259 773624 708378
Marathi 1834954 606470 45153
Akan 1824458 577998 340
Tibetan 1832943 578236 9917
Kannada 1835077 594951 30018
Kinyarwanda 1826581 578420 1637
Javanese 1833808 624457 146013
Navajo 1826681 578379 3577
Danish 2040605 1331546 595904
Rusyn 1824792 578434 8027
Karachay-Balkar 1824082 577050 1404
Samogitian 1833292 578127 15939
Bambara 1826324 594823 444
Bishnupriya Manipuri 1824238 589731 25261
Silesian 1824999 580172 5318
Kashmiri 1823248 576537 314
Korean 2037406 745072 367609
Upper Sorbian 1828714 611218 21874
Nauruan 1824145 578029 1120
Komi 1824949 578088 4887
Tetum 1824528 578338 1175
Karakalpak 1824744 578737 1961
Sardinian 1848950 681186 7074
Maltese 1952307 607113 4636
Irish 2007026 721766 56442
South Azerbaijani 11090 35456 29569
Bislama 1824191 578067 645
Fiji Hindi 1828349 583356 9886
English 3229307 5625917 11235344
Lojban 1825777 577549 1387
Pontic 1823232 576669 437
Luganda 1824930 577977 876
Malay 2023899 857439 305531
Russian 2594393 1585693 2508276
Komi-Permyak 1824570 577367 3618
Central Bicolano 1824900 580126 8388
Avar 1824052 577673 2141
Võro 1824935 577017 5491
Kirundi 1824169 577630 611
Amharic 1828201 584818 20618
Swedish 2339760 2735028 3848586
Limburgish 1850704 691241 19842
Lombard 1829211 612640 42074
Cebuano 2049584 2131232 2933384
Manx 1827085 582461 74216
Lingala 1825316 578349 2805
Romani 1823420 576947 761
Ripuarian 1824967 580364 4694
Sango 1823978 577504 261
Anglo-Saxon 1825105 579816 3115
Kabyle 1827011 596651 3001
West Frisian 1839660 610135 173948
Bulgarian 2130804 717546 2125681
Saterland Frisian 1825196 580181 4092
Somali 1826324 579331 4644
Palatinate German 1824398 603148 71508
Tumbuka 1824011 577549 631
Wu 1825015 579344 5272
Pennsylvania German 1824601 580056 1810
Telugu 1867150 603616 67069
Romansh 1856268 695134 3368
Goan Konkani 2315 3527 3531
Mongolian 1830930 584127 19244
North Frisian 1826309 609154 74924
Papiamentu 1826643 594791 1541
Buryat (Russia) 1823766 577228 1789
Tok Pisin 1824488 578006 1265
Chichewa 1824185 577557 350
Shona 1826191 578277 6200
Sorani 1828532 586983 19941
Tagalog 1968206 674697 66959
Western Panjabi 1829482 599406 38835
Turkish 1997878 850203 315698
Northern Luri 2669 5079 4839
Tongan 1824919 577917 1444
Inupiak 1824322 577858 627
Pali 1823531 578152 3035
Aromanian 1823044 575718 1156
Gilaki 1825725 580262 1484
Ido 1849926 648514 929685
Kapampangan 1825260 580446 9055
Luxembourgish 1869325 721781 110822
Burmese 1836702 603289 99075
Corsican 1849058 677895 5708
Inuktitut 1824768 576962 454
Thai 1922688 633115 117018
Swahili 1986315 687766 41401
Gothic 1823610 576945 492
Hawaiian 1824877 578182 2045
Spanish 2352214 2109835 3481454
Erzya 1824730 577897 3662
Hill Mari 1825385 578291 10490
Norwegian (Nynorsk) 1899364 1278414 443555
Tigrinya 1825146 576607 281
Belarusian 1862787 642425 139310
Kabardian Circassian 1823840 576986 1516
Khmer 1833645 578884 6952
Ewe 1824410 577926 355
Walloon 1856098 676490 18276
Tahitian 1824608 581990 1056
Greek 1985173 684302 371703
Assamese 1826988 579319 9190
Dutch 2469868 3639193 7011398
Xhosa 1824624 577641 1663
Cherokee 1824740 577023 816
Albanian 1963230 656637 1552991
Sesotho 1824309 577590 67884
Cheyenne 1824417 577896 761
Serbo-Croatian 79612 355856 476602
Sinhalese 1828392 588783 19923
Fijian 1824168 577868 348
Armenian 1896958 692977 1174531
Waray-Waray 1874333 650330 1266698
Nahuatl 1826269 581953 9483
Tajik 1837759 628376 59321
Slovak 2017580 812943 242619
Chuvash 1837542 594858 36313
Moksha 1823738 576719 1336
Twi 1823974 577304 608
Dzongkha 1823353 576523 221
Scots 1864544 715402 68459
Basque 1925639 807549 365320
Catalan 2101647 1035765 2270216
Tswana 1824200 577626 637
Tsonga 1824099 577643 383
Fula 1824037 577869 254
Quechua 1832473 585735 20329
Tulu 1758 1708 1038
Norwegian (Bokmål) 2043941 803250 433060
Adyghe 1472 1509 510
Hakka 1825152 583584 7064
Lithuanian 2017910 737433 223268
Hebrew 1969232 718288 1952058
Tuvan 1824327 576934 1457
Extremaduran 1824881 582531 107874
Latgalian 1824121 577137 791
Lezgian 1824462 577699 2485
Cantonese 25484 21809 46773
Finnish 2148219 1178551 733440
Kazakh 1918329 641598 267073
Northern Sotho 1824954 579483 3397
Arabic 2018295 924591 2831010
Sanskrit 1827221 581939 11871
Urdu 1843797 639555 109152
Greenlandic 1825986 583735 1691
Bengali 1868539 643068 1599057
Abkhazian 1823626 576909 931
Norfolk 1824126 577715 447
Aymara 1824722 580849 4112
Chamorro 1824087 577863 444
Sakha 1828295 581330 12814
Friulian 1848632 672515 3656
Hungarian 2054656 1168788 676121
Yoruba 1828442 591588 103346
Japanese 2509973 1068780 1170979
Lak 1823734 576712 1227
Maithili 3531 10211 19578
Sranan 1826537 587312 12915
Asturian 1862793 697367 191512
Meadow Mari 1826581 579941 12056
Picard 1847584 674058 73160
Ukranian 2080040 928490 733181
Livvinkarjala 2867 3419 2015
Mazandarani 1826348 584295 13225
Igbo 1826457 578750 2367
Sundanese 1828951 598732 48379
Zamboanga Chavacano 1822102 576059 3225
Min Nan 17130 163028 179692
Tatar 1862223 609628 69756
Kalmyk 1824028 577391 1802
Vietnamese 2049763 858465 1167379
Breton 1861709 703877 82719
Pangasinan 1826626 578150 4997
Neapolitan 1848523 672702 21761
Uyghur 1826746 579004 2464
Min Dong 1824833 578805 3854
Mirandese 1825150 579967 3024
Banyumasan 725 12610 13267
Chinese 2355184 954286 939966
Ilokano 1825720 584345 15635
Maori 1834142 580119 10174
Cree 1823434 577133 219
Latvian 1979042 664809 183457
Minangkabau 1862964 707014 233331
Aragonese 1862006 709507 937167
Galician 1902317 771888 1781173
Ossetian 1827034 579738 11978
Italian 2337907 2065901 2963855
Esperanto 1903714 791507 293937
Oromo 1823495 577269 882
Egyptian Arabic 1826304 589412 16115
Bavarian 1853551 717491 35442
Tamil 1850559 643926 924671
Franco-Provençal/Arpitan 1846269 647993 3450
Banjar 1823971 591998 2145
Occitan 1867189 715222 235986
Hausa 1826936 582079 1795
Yiddish 1829333 584130 25530
Lao 1828460 577402 2356
Serbian 1994378 748437 418067
Guarani 1825231 578728 3215
Romanian 2046784 934065 2110376
Swati 1824066 577331 415
Latin 1934165 624388 150914
Classical Chinese 1948 2830 4774
Croatian 2013445 773801 193230
Kashubian 1825685 580112 6226
French 2636931 3880991 4615106
Faroese 1833440 599717 14481
Emilian-Romagnol 1826668 581197 4527
Zhuang 1823787 577050 1311
West Flemish 1848801 674173 6072
Gagauz 1825503 578966 2862
Bashkir 1843507 592605 79254
Hindi 1971146 679878 156620
Alemannic 1826683 583453 21361
Sindhi 1827327 580351 6983
Afrikaans 2001092 721490 88236
Gujarati 1846679 587487 48872
Oriya 1828409 582829 21598
Zazaki 1825791 584582 7204
Samoan 1824322 577787 809
Kikuyu 1824346 578354 1417
Malagasy 1852148 666324 82922
Buginese 1823475 592655 14152
Haitian 1832928 603541 51230
Northern Sami 1826062 586765 17721
Nepali 1833799 597492 45321
Persian 2003411 932091 704527
Acehnese 1826422 630781 76887
Aramaic 1824019 576905 1422
Wolof 1848475 644188 1033
Georgian 1855410 631720 184781
Portuguese 2132769 1266543 2095733
Mingrelian 1824864 580181 76548
Interlingua 1866430 650632 29659
Slovenian 2031374 1785592 212681
Icelandic 1999188 706995 59286
Ladino 1824850 579632 4928
Sicilian 1854973 677684 47978
Zulu 1846547 644000 3006
Moldovan 1823526 577262 382
Newar / Nepal Bhasa 1841307 622224 73647
Dutch Low Saxon 1823943 577718 5647
Malayalam 1841909 603006 52855
Uzbek 1856211 605118 137611
Tarantino 416 8759 9167
Divehi 1824577 578782 4080
Azerbaijani 1867917 633460 172543
Gan 1825821 578250 8234
Polish 2186653 1358605 1362728
Kurdish 1835425 587542 22168
Norman 1846708 645487 3586
Bihari 1823338 578569 7179
Welsh 2004157 813695 117458
Piedmontese 1853736 683219 2759
Udmurt 1824446 577803 3887
Punjabi 1835134 590862 27799
Scottish Gaelic 1855456 684537 17311
Ligurian 1847958 672558 3078
Old Church Slavonic 1823575 576675 563
Pashto 1826726 581466 7720
Czech 2061649 937621 1482523
Venda 1824077 577495 295
Bosnian 1854230 645398 91916
Belarusian (Taraškievica) 1840406 603622 59040
Kongo 1846067 644313 1185
German 2470049 2976361 5554443
Kirghiz 1854099 590891 54922
Patois 4781 30928 1646