en.wikipedia.org

List of dictionaries by number of words - Wikipedia

This is a list of dictionaries considered authoritative or complete by approximate number of total words, or headwords, included number of words in a language.[1][2] In compiling a dictionary, a lexicographer decides whether the evidence of use is sufficient to justify an entry in the dictionary. This decision is not the same as determining whether the word exists.[citation needed]

The green background means a given dictionary is the largest in a given language.

Language Approx. no. of headwords Approx. no. of definitions Dictionary Notes Tamil 1,533,669 Sorkuvai Archived October 28, 2021, at the Wayback Machine An online open dictionary run by the Tamil Nadu government.[3] The dictionary contains 1,533,669 words.[4] Korean 1,149,538 Urimalsaem Online open dictionary run by South Korean government, including North Korean dialects(66,172 words).[5] Portuguese 818,000 Aulete Digital Online dictionary including expressions.[6] Finnish 800,000 RedFox Pro Online dictionary. The free version has over 300,000 Finnish words and the Pro version has over 800,000 Finnish words. The dictionary has agglomerated other dictionaries, such as technical ones,[7] and the largest set comes from Wordnet.[8] This dictionary essentially does not include inflections. English 795,606 1,578,939 English Wiktionary Contains 853,456 gloss entries and 1,578,939 total definitions.[9][10] Swedish 600,000 Svenska Akademiens ordbok, Swedish Academy After having completed letters A through T SAOB included 470,000 words, but 600,000 words when the alphabet was completed in 2017. Korean 511,282 Standard Korean Language Dictionary[11] Contains 511,282 entries. Italian 500,000 Grande Dizionario Hoepli Italiano[12][13] The number of "sayable and writable" word-forms is estimated at over 2 million.[14] Japanese 500,000 Nihon Kokugo Daijiten[15] Lithuanian 500,000 Lietuvių kalbos žodynas (Academic Dictionary of Lithuanian) 22,000 pages in 20 volumes with quotations from all kinds of writing and dialect records between 1547 and 2001. Accessible online at www.lkz.lt.[16] English 470,000 Webster's Third New International Dictionary and Addenda Section Contains 470,000 entries.[17] Northern Kurdish 411,864 Kurdish Wiktionary Contains 411,864 entries with all conjugations of words in Northern Kurdish and other Kurdish varieties, among a total of 1,000,051 entries, including loanwords from non-Kurdish languages.[18][19] French 408,078 French Wiktionary Contains more than 408,000 lemma, associated to more than 636,500 definitions and 1,880,500 inflections, distributed other 1,924,200 entries.[20] Serbo-Croatian 400,000 Rječnik hrvatskoga ili srpskoga jezika Published from 1880 to 1976 in 97 fascicles collected into 23 volumes under the auspices of the Yugoslav Academy of Sciences and Arts, estimated at a minimum of 400,000 words by Dragica Malić.[21][22] Includes only words found in the Shtokavian dialect; words from Chakavian and Kajkavian dialects are excluded. Dutch 400,000 Woordenboek der Nederlandsche Taal The 43 volumes of the WNT (including three supplements) consist of 49,255 pages, describing Dutch words from 1500 to 1976.[23] Chinese 378,103 Hanyu Da Cidian The 3rd edition of the digital version contains 18,014 single-character words, 336,706 compound words, 23,383 idioms (chengyu), 504,040 definitions, and 861,956 examples.[24] Finnish 350,000 Suomen murteiden sanakirja (in progress) Suomen murteiden sanakirja (SMS) will include 350,000 words from different dialects, with well-documented definitions, based on the archives (Suomen murteiden sana-arkisto) of 400,000 words, with over 8 million definitions.[25][26] Persian 343,466 Dehkhoda Dictionary, 1998, ISBN 9789640396025 The original series initially consisted of 3 million records (Persian: فیش (French: fiche) or برگه "barge") (up to 100 meanings/records for each word or proper noun) until Dehkhoda's death in March 1956, and currently contains 343,466 entries that, according to the latest digital release of the dictionary by Tehran University Press (version 3.0) are based on an ever-growing library of over 2,300 volumes in lexicology and various other scientific fields.[27][28][circular reference] Norwegian 330,000 Norsk Ordbok The finished dictionary has about 330,000 headwords, whereas the corpus it's built upon contains about 500,000 words in total.[29] German 330,000 Deutsches Wörterbuch 330,000 words in use since the mid-fifteenth century.[30] Arabic[notes 1] 120,000 Taj al-ʿArus Min Jawahir al-Qamus The dictionary includes 120,000 entries filling 40 volumes, whereby one entry comprises dozens of words.[33] Turkish 316,000 Ötüken Türkçe Sözlük[34] Turkish dictionary (Old and Middle Turkic, Ottoman and Modern Turkish), includes 316,000 entries.[35] Dutch 300,000 Etymologisch woordenboek van het Nederlands[36] Norwegian 300,000 Tanums store rettskrivningsordbok (10. utgave) A dictionary of orthography.[29] Ukrainian 300,000 Український лексикон кінця XVIII — початку XXI ст.: словник-індекс: у 3-х томах Лексичний словник.[37] Urdu 286,563 823,743 Rekhta Dictionary[38] Rekhta Dictionary, an initiative of Rekhta Foundation is a multi-lingual dictionary available in three scripts, i.e. Urdu, Devanagari and Roman. Gujarati 281,377 Bhagavadgomandal 2.81 lakh words and their meanings in 9 volumes. Also serves as an encyclopedia with almost 8.22 lakh words.[39] Romanian 280,000 dexonline Online dictionary. Project of digitisation of 67 general, specialty and archaic dictionaries. Launched in 2001. As of 2023, it contained over 280,000 unique words and 1,065,000 definitions. English 273,000 600,000 Oxford English Dictionary, Second Edition Oxford Dictionary has 273,000 headwords; 171,476 of them being in current use, 47,156 being obsolete words and around 9,500 derivative words included as subentries. The dictionary contains 157,000 combinations and derivatives, and 169,000 phrases and combinations, making a total of over 600,000 word-forms.[40][41]
There is one count that puts the English vocabulary at about 1 million words—but that count presumably includes words such as Latin species names, prefixed and suffixed words, scientific terminology, jargon, foreign words of extremely limited English use and technical acronyms.[42][43][44] Urdu 264,000 Urdu Lughat[45][46] Ukrainian 253,000 Великий орфографічний словник сучасної української лексики A dictionary of orthography. Contains 253,000 entries (253,000 words).[47][48] Czech 250,000 Příruční slovník jazyka českého [cs] Nine volumes of this dictionary were printed in years 1935–1957. They contain about 250,000 words, their meanings and example usage from literature. The dictionary is available online.[49][50] Serbo-Croatian 241,000 Dictionary of Serbo-Croatian Literary and Vernacular Language This dictionary is incomplete. So far, 20 volumes of the planned 40 have been published. These 20 volumes contain 241.000 headwords. When complete, this Dictionary will have around 500.000 headwords.[51] Portuguese 228,000 382,000 Houaiss Dictionary of the Portuguese Language 228,000 entries and 382,000 meanings.[52] Belarusian 223,000 Вялікі слоўнік беларускай мовы: арфаграфія, акцэнтуацыя, парадыгматыка[53] Russian 220,000 250,000 Толковый словарь живого великорусского языка The 3rd edition by Baudouin de Courtenay contains about 250,000 entries (220,000 words and 30,000 proverbs).[54][55] Indonesian 208,283 229,443 Kamus Besar Bahasa Indonesia, 6th edition, 2024 Finnish 201,000 Nykysuomen sanakirja, 1961 Nykysuomen sanakirja can be translated to The Dictionary of Modern Finnish or The Dictionary of Contemporary Finnish, but the language can be quite dated; the dictionary only reflects the language as it was no later than 1961. Even though it has been published again, it has not been updated. The dictionary contains over 201,000 headwords in six volumes.[56] For modern language, The New Dictionary of Modern Finnish is more relevant. English 200,000 The American Heritage Dictionary of the English Language, Fifth Edition The third edition contained "more than 200,000 boldface forms" (entries).[57] The fourth and fifth editions each added roughly 10,000 additional "new words and senses".[58] It is not clear how many of these were new words. German 200,000 Großes Wörterbuch der deutschen Sprache Dictionary by the Berlin-Brandenburg Academy of Sciences and Humanities of over 200,000 contemporary words.[59] Norwegian (bokmål) 200,000 Det Norske Akademis ordbok The Norwegian Academy Dictionary contains more than 200,000 entries and more than 300,000 literary quotes. Furthermore, it contains fixed expressions and pronunciation.The dictionary is free and edited daily. Danish 200,000 Ordbog over det danske Sprog Dictionary maintained by the Society for Danish Language and Literature [da]. Covers Danish language use 1700–1950.[60] The society also maintains a sister dictionary, Den Danske Ordbog [da] covering language use since 1950. Slovak 200,000 Slovník slovenského jazyka z r. 1959 – 1968, Slovník súčasného slovenského jazyka A – G, H – L, M – N z r. 2006, 2011, 2015 Here is the information about the number of words in Slovak written by Jazykovedný ústav Ľ. Štúra SAV. Tibetan 195,919 Rangjung Yeshe Dharma Dictionary Considering the large number of Buddhist terminology, colloquial expressions and modern literary Tibetan neologisms not included in this dictionary, the actual total number is probably about twice the number of terms included on this website (195,919), perhaps 375–400,000 Tibetan words in total.

[61]

Sanskrit 186,000 Monier-Williams Sanskrit-English Dictionary, 1899 (IITS Koelin) The work of Monier Monier-Williams, who as a Boden Professor of Sanskrit at the University of Oxford.[62][63] Hindi 183,175 Hindi Wiktionary A free dictionary that gives everyone the right to edit.[64] Kazakh 166,000 15 томдық "Қазақ тілінің түсіндірме сөздігі" Explanatory dictionary of the Kazakh language.[65] English 155,327 207,016 WordNet, 3.1 As of November 2012 WordNet's latest Online-version is 3.1. The database contains 155,327 words organized in 175,979 synsets for a total of 207,016 word-sense pairs.[66] Romanian 155,000 DLR Dicționarul Limbii Române. Belarusian 150,000 Слоўнік беларускай мовы[67] Icelandic 150,000 Orðabók Blöndals[68] The dictionary contains 150,000 headwords in 17 volumes.[69] Russian 150,000 Большой академический словарь русского языка Great Academy Dictionary of Russian language.[70] Swiss German 150,000 Schweizerisches Idiotikon[71] The dictionary contains 150,000 words from the late Middle Ages to today.[72] German 148,000 Duden – Die deutsche Rechtschreibung The most influential dictionary in Germany, a dictionary of orthography.[73] Polish 140,000 Wielki słownik ortograficzny PWN Big orthography dictionary PWN contains new words, proper nouns and latest spelling changes. German 141,154 German Wiktionary Contains 141,154 german gloss entries.[74][75] French 135,000 Trésor de la Langue Française informatisé ATILF[76] (Analyse et Traitement Informatique de la Langue Française – Computer Processing and Analysis of the French Language)
135,000 (Larousse Dictionnaire de français, published by Editions Larousse).[77][78] Ukrainian 134,058 Словник української мови (The Dictionary of the Ukrainian language) The dictionary was finished in late 1970s–early 1980s.[79][80] Dutch 134,000 Woordenlijst Nederlandse Taal (Het Groene Boekje)[81] Russian 130,000 Большой толковый словарь русского языка Great Dictionary of Russian language.[82] Swedish 130,000 Rikstermbanken[83] Sweden's national term bank.[84] Swedish 126,000 Svenska Akademiens ordlista (SAOL)[85] Normative Swedish language spelling dictionary, which includes only commonly used words, currently includes ~126,000 words,[86] after having added 13,500 and removed 9,000 in its latest edition, SAOL 14, plus an additional 200,000 still encountered words in earlier editions.[87][88] Eastern Armenian 125,000 Ժամանակակից հայոց լեզվի բացատրական բառարան Žamanakakic' hayoc' lezvi bac'atrakan baṙaran.[89] Tamil 124,405 University of Madras Tamil Lexicon The dictionary includes 124,405 separate entries.[90] Malaysian 120,000 Kamus Dewan Perdana, 1st Edition, 2020 Arabic[notes 2] 120,000 Taj al-ʿArus Min Jawahir al-Qamus The dictionary includes 120,000 entries filling 40 volumes, whereby one entry comprises dozens of words.[33] Albanian 120,000 Fjalor i gjuhes shqipe, 2024 70,000 words added from the previous edition of 1980[93] Frisian 120,000 Het Wurdboek fan de Fryske taal[94] Dictionary of New Frisian (Nieuwfries) from 1800 to 1975.[95] Finland Swedish 120,000 Ordbok över Finlands svenska folkmål[96] (in progress) The dictionary includes around 120,000 headwords.[97] Bulgarian 119,200 Dictionary of the Bulgarian Language (monolingual academic explanatory dictionary), (Многотомен) Речник на българския език in Bulgarian, in 15+ volumes This dictionary covers vocabulary from the last 150 years of Bulgarian and is compiled and edited by linguistics (primarily native lexicographers and lexicologists) from The Institute for the Bulgarian Language (part of the Bulgarian Academy of Sciences). It includes basic, commonly used, literary, colloquial, dialectical, archaic and obsolete Bulgarian words, as well as some specialized terminology. The latest volume (15th) published in 2015 ends with headwords beginning with the (Bulgarian Cyrillic) letter Р.[98] French 116,000 Le Dictionnaire universel francophone (DUF)[99] Dictionary published by Hachette.[100] Georgian 112,949 ქართული ენის განმარტებითი ლექსიკონი (Explanatory dictionary of the Georgian language) The electronic version of this Dictionary, consisting of 129755 records, is the joint project of Arnold Chikobava Institute of Linguistics and Language Modeling Association. The project was financed by Rustaveli Foundation.[101] Belarusian 112,462 Skarnik As of August 2019. Belarusian-Russian online dictionary contains 112,462 words.[102] Slovene 110,180 Slovar slovenskega knjižnega jezika, Second edition, 2014 The official dictionary of modern Slovene is Slovar slovenskega knjižnega jezika (SSKJ; Standard Slovene Dictionary). It was published in five volumes by Državna Založba Slovenije between 1970 and 1991 and contains more than 100,000 entries and subentries with accentuation, part-of-speech labels, common collocations, and various qualifiers. In the 1990s, an electronic version of the dictionary was published and it is available online. Finnish 102,174 Kielitoimiston sanakirja, 2018 Online dictionary. Institute for the Languages of Finland (governmental institute) has selected the core vocabulary, and many headwords are not included.[103] Afrikaans 100,000 Handwoordeboek van die Afrikaanse Taal (HAT), 2015 New 6th edition contains 3228 new keywords and 5365 meaning.[104] French 100,000 Le Grand Robert, 2019 Contains 100,000 words and 350,000 definitions.[105] German 100,000 Österreichisches Wörterbuch, 2018 Official dictionary of the German language in the Republic of Austria.[106] Polish 100,000 Słownik języka polskiego PWN Polish dictionary of PWN contains about 100,000 articles and 145,000 definitions.[107] Russian 100,000 Орфографический словарь русского языка[108] Normative Russian dictionary,[109] the dictionary includes around 100,000 words.[110] Turkish 98,738 Güncel Türkçe Sözlük[111] Online dictionary of the Turkish Language Association[112] Contains 64,994 words, 33,744 idioms and proverbs and 132,548 meanings. Turkish 93,000 Misalli Büyük Türkçe Sözlük (Kubbealtı Lugatı)[113] Turkish dictionary that is composed of modern and Ottoman Turkish, includes 60,000 entries and 33,000 idioms with around 100,000 examples based on literary works of 840 writers. It's online version is named "Kubbealtı Lugatı".[114] Spanish 93,000 Diccionario de la lengua española de la Real Academia Española, 23rd edition, 2014[115] Central Kurdish 93,000 فەرهەنگی زانستگای کوردستان University of Kurdistan Dictionary: Kurdish-Persian, Rohani, M. (2018)[116] The four volumes of the Kurdish-to-Persian and Persian-to-Kurdish dictionary have 93,000 vocabulary terms. Director Majid Rouhani worked for ten years to complete it.[117] Spanish 90,000 Diccionario de uso del español [es], 2007 Contains 90,000 keywords and 190,000 meaning. Dutch 90,000 Van Dale, 14th edition, 2005 [118] Catalan 88,500 172,000 Gran Diccionari de la llengua catalana (Great Dictionary of the Catalan language, includes the definitions in the Diccionari de la llengua catalana) Contains 88,500 headwords and 172,000 definitions.[119] Chinese 85,568 Zhonghua Zihai The largest character dictionary covering all varieties of Chinese, a compilation of Chinese characters in use over three millennia of written history.[120][121][122] Arabic 83,015 المعجم المعاصر2019 The first version (2019) contains approximately 83,015 entries.[123][124] Scots 80,259 Dictionaries of the Scots Language An online dictionary comprising A Dictionary of the Older Scottish Tongue (covering the period up to 1700) and The Scottish National Dictionary (from 1700). Hebrew 80,000 The Academy of the Hebrew Language Notes for word count can be found on The Academy of the Hebrew Language website [125] Arabic 80,000 Lisan Al-Arab The dictionary includes around 80,000 entries.[33] French 80,000 Dictionnaire de la langue française[126] Four-volume dictionary[127] of the French language by Émile Littré.[128] Uzbek 80,000 Oʻzbek tilining izohli lugʻati (Annotated Dictionary of the Uzbek Language) The largest Uzbek language dictionary, made of five volumes and including around 80,000 entries.[129] Middle Dutch 75,000 Het Middelnederlandsch Woordenboek [nl][130] Historical dictionary of Middle Dutch: 1250–1550.[131] Romanian 67,000 Dicționarul explicativ al limbii române (Published by the Romanian Academy) Tamazight 65,716 Amawal Ameqran, Abdelhafed Idres. 2017 Swedish 65,000 Svensk ordbok utgiven av Svenska Akademien (Svensk ordbok, SO)[132] The dictionary includes around 65,000 headwords.[132] Punjabi 64,263 Mahan Kosh First published in 1930, the dictionary includes 64,263 entries in Gurmukhi script. It has been republished many times.[133][134][135][136] Swedish 64,000 Ordbok öfver svenska språket (Dalin Ordbok)[137] The dictionary includes around 64,000 headwords.[138] Arabic 60,000 Al-Qamus al-Muhit wa al-Qabus al-Wasit[notes 3] The dictionary includes around 60,000 entries.[139] Dutch 60,000 Groot Woordenboek Afrikaans en Nederlands[140] Dutch-Afrikaans dictionary.[141] Turkish 60,000 Osmanlıca-Türkçe Ansiklopedik Lûgat[142] Ottoman Turkish dictionary, includes 60,000 entries.[143] Galician 59,999 Dicionario da Real Academia Galega (Dictionary of the Royal Galician Academy)[144] Western Armenian 56,000 Հայոց լեզուի նոր բառարան Hayoc' lezowi nor baṙaran.[145][clarification needed] Tatar 56,000 Татарско-русский словарь Ш.Н. Асылгараева, Ф.А. Ганиева, М.З. Закиева, К.М. Миннуллина, Д.Б. Рамазанова Tatar-Russian dictionary of Sh.N. Asylgaraev, F.A. Ganiev, M.Z. Zakiyev, K.M. Minnullin, D.B. Ramazanova.[146] French 55,000 Dictionnaire de l'Académie française (DAF) Normative French dictionary,[147] once complete, it will contain 55,000 words.[148] Dutch 52,000 Woordenlijst Nederlandse Taal[149] Normative Dutch dictionary,[150] the dictionary includes around 52,000 entries and around 134.000 derivative words.[151] Dutch 50,000 Het Groene woordenboek - Handwoordenboek Nederlands[152] Turkmen 50,000 Türkmen diliniň düşündirişli sözlügi Turkmen Explanatory Dictionary.[153] Lakota (Lakȟóta) 44,000 New Lakota Dictionary A bilingual Lakota-English and English-Lakota dictionary, book format (editions: 2008, 2011, 2022), mobile app version (2014), desktop version (2015). Azerbaijani 44,750 Azərbaycan dilinin izahlı lüğəti Azerbaijani Explanatory Dictionary.[154] Syriac 43,030 Sureth dictionary Published by the Association Assyrophile de France, it features Assyrian Neo-Aramaic, Turoyo and Chaldean Neo-Aramaic words of all dialects.[155] Icelandic 43,000 560,000 Orðabók Háskólans 43,000 basic words and 519,000 compound words of which more than half are attested only once or don't get into print ("instant combinations").[156] Thai 40,840 พจนานุกรม ฉบับราชบัณฑิตยสถาน พ.ศ. ๒๕๕๔ Bashkir 40,000 Башкирско-русский словарь Ураксин З.Г. Bashkir-Russian dictionary Uraksin Z. G.[157] Chuvash 40,000 Чувашско-русский словарь Скворцова М. И. Chuvash-Russian dictionary Skvortsova M. I.[158] Dargwa 40,000 Даргинско-русский словарь Юсупова Х. А. Dargwa-Russian dictionary of Yusupov H. A.[159] Riksmål 40,000 Riksmålsordlisten[160] Normative dictionary of the non-official Norwegian written language called Riksmål.[161] Arabic 40,000 Taj al-Lugha wa Sihah al-Arabiyya[notes 4] The dictionary includes around 40,000 entries.[139] Classical Latin 39,589 Oxford Latin Dictionary Includes 39,589 Classical Latin entries, including borrowings from Greek, Gaulish, other Italic dialects, Sanskrit, and others. There are about: 10,000 adjectives, 2,123 adverbs, 46 conjunctions, 77 interjections, 17,450 nouns, 26 particles, 39 prepositions, 17 pronouns, and 5,986 verbs. The remaining entries are references to other entries (such as alternate spellings or archaic versions), prefixes, suffixes, and terms left untranslated by the original editors.[162] Avar 36,000 Аварско-русский словарь Гимбатова. М. М.[163] Avar-Russian dictionary of M. M. Gimbatov.[163] Venetian 36,000 Dizionario della lingua veneta[164] Dictionary of Venetian Language[165] of Gianfranco Cavallin.[166] Komi 32,659 Большой коми-русский словарь[167] A reverse Russian-Komi dictionary contains 50722 words.[168] Lezgi 28,000 Лезгинско-русский словарь: Б. Б Талибов, М. М. Гаджиев[169] Lezgi-Russian dictionary: B. B Talibov, M. M. Gadzhiev.[169] Middle Dutch 25,000 Het Vroegmiddelnederlands Woordenboek[170] Historical dictionary of Middle Dutch: 1200–1300.[171] Old Swedish 22,894 Ordbok öfver svenska medeltidsspråket[172] The dictionary includes around 22,894 headwords in 3 volumes,[173] and with the 2 volume supplement (21,495 headwords) the dictionary includes 44,389 headwords.[174] Chechen 20,000 Чеченско-русский словарь. Алироев, И.А.; Хамидова, З.Х.; Алексеев, М.Е.[175] Chechen-Russian dictionary. I.A Aliroev., Z.Kh. Khamidova., M.E. Alekseev.[175] Kabardian 20,000 Кабардинско-русский словарь. М. Л. Апажев, Н. А. Багов[176] Kabardian-Russian dictionary. M. L. Apazhev, N. A. Bagov.[176] Quechua 20,000 Diccionario Quechua-Español Lira Jorge Quechua-Spanish dictionary Lira Jorge.[177] Swedish 20,000 Svensk etymologisk ordbok[178] The dictionary includes around 20,000 headwords.[179] Esperanto 16,780 Plena Ilustrita Vortaro de Esperanto (Complete Illustrated Dictionary of Esperanto) 46,890 lexical units.[180] Malayalam 16,000 Chathur Dravida Bhasha Nighandu (Dictionary of Four Dravidian Languages)[181] Malayalam, Kannada , Tamil and Telugu multilingual dictionary published by Kerala Bhasha Institute and authored by Njattyela Sreedharan.[182] The dictionary includes around 16,000 headwords.[183] Ingush 11,142 Ингушско-русский словарь. М. С. Мургустов.[184] Ingush-Russian dictionary by M.S. Murgustov.[184] Nahuatl 10,500 Tlahtolxitlauhcayotl: Chicontepec, Veracruz Huasteca Nahuatl monolingual dictionary with 10,500 entries of which 360 are loanwords, co-authored by John Joseph Sullivan.[185] Old Dutch 9,000 Het Oudnederlands Woordenboek[186][171] Flemish-only words 1,000 Het Gele Boekje[187][188] Toki Pona 137 Toki Pona: The Language of Good[189] A later dictionary added 17 words, bringing the total number of official words to 137. Subsequently, the Lipu Linku dictionary has used the Toki Pona language survey to identify 260 words in use in Toki Pona, though not all are frequently used.[190][191][192]