Data | Among languages mostly confined to a State, Tamil leads with 1.5 lakh Wikipedia articles Premium
The Hindu
Among languages mostly confined to a State, Tamil dominates with over 1.5 lakh articles
In India, if we consider non-English language Wikipedia, the highest number of articles are available in Urdu, Hindi and Tamil. A non-English language Wikipedia is not a translation of English articles. It is self-sustaining: active users and moderators create and moderate content in their languages. Among languages which are mostly confined to a State, Tamil leads by a wide margin, with 1.6 times more articles than the second-best, Marathi, followed by Malayalam and Telugu.
Understandably, when all the global languages are considered, English leads the list with 66,71,236 articles (Chart 1).
Chart 1 | The chart lists the 320 languages in which Wikipedia articles are available. The bigger the size of the bubble, the more the number of articles.
Charts appear incomplete? Click to remove AMP mode
Interestingly, Cebuano, a regional language spoken widely in the Philippines, has the second-highest number of articles in Wikipedia (61,23,197). The Cebuano entries are written in Latin alphabets. However, news reports show that many entries were made in Cebuano by a bot.
German (around 28.1 lakh), Swedish (25.6 lakh), French (25.3 lakh) and Dutch (21.2 lakh) are the other prominent languages in which a considerable number of Wikipedia articles are maintained. There are relatively few articles in Chinese and Cantonese (13.6 lakh articles and 1.3 lakh, respectively) despite the fact that many more people speak these languages.
Chart 2 | The chart lists the 23 languages spoken in India in which Wikipedia articles are available.