Skip to content
Home » Most Spoken Languages » 🇮🇳 Santali #77 Most Spoken Language (23M speakers)

🇮🇳 Santali #77 Most Spoken Language (23M speakers)

Santali — Munda language, Ol Chiki script, and strongly head-final grammar

Austroasiatic • Munda • Ol Chiki • SOV • Strongly Suffixing

Santali is the largest language in the Munda branch of the Austroasiatic family. It is spoken mainly in eastern India, with speaker communities in Bangladesh and Nepal as well. In English-language scholarship, the form Santali is common, while many Indian government publications also use Santhali. Both spellings refer to the same language.

Number of Speakers
7,368,192 speakers in India in the 2011 Census. In everyday summaries, Santali is usually described as a language with more than seven million speakers overall.
India Census 2011Largest Munda LanguageSouth Asia
Family / Branch
Austroasiatic → Munda → North Munda → Kherwarian → Santali. It is the best-known and most widely used member of this branch.
MundaKherwarian
Writing System
Ol Chiki is the script most closely tied to Santali. It was created in 1925 by Pandit Raghunath Murmu. Santali has also been written in Bengali, Odia, Devanagari, Roman, and other regional scripts.
Dedicated Script1925Multiscript History
Official Standing
Santali entered India’s Eighth Schedule through the 92nd Constitutional Amendment Act, 2003. It also has state-level official use in places such as Jharkhand and West Bengal.
Scheduled LanguageIndiaPublic Use
Word Order
SOV is the default order. Santali also patterns as Adjective–Noun, Genitive–Noun, and Postpositions, which gives it a strongly head-final profile.
SOVPostpositionsHead-Final
Language Codes
ISO 639-3: sat • Glottocode: sant1410 • Script code: Olck for Ol Chiki.
ISOGlottologScript Code
Where Santali Is Spoken

The main Santali belt runs across Jharkhand, West Bengal, Odisha, and Bihar, with smaller communities in Assam and beyond. Census and language-atlas material place the strongest concentrations in the eastern and north-eastern part of the Chota Nagpur region and nearby districts.

Why Ol Chiki Matters

Older borrowed scripts could write Santali, but they were not built for its own sound system. Ol Chiki changed that. It gave Santali a script designed for the language itself, which helped printed books, school materials, spelling practice, and later native-script digital publishing.

Sound System
  • Ol Chiki uses 30 letters: 6 vowels and 24 consonants in its core letter inventory.
  • Santali dialects do not all sound the same. Southern varieties are often described with 6 vowel phonemes, while northern varieties may show 8 or 9 vowel contrasts.
  • Descriptions of Santali also note sound patterns such as nasalization, checked final stops, and vowel behavior that is more detailed than many short encyclopedia entries show.
Grammar Snapshot
  • Strongly suffixing: grammatical marking is mostly added after the stem.
  • Person and number marking: verbs can show rich agreement behavior.
  • Inclusive vs Exclusive “We”: Santali separates “we including you” from “we excluding you”.
  • Three-number logic in many descriptions: singular, dual, and plural play a visible role.
Dialect Pattern

A common broad division is Northern Santali versus Southern Santali. The split shows up in pronunciation, some vocabulary choices, and parts of morphology. This matters for teaching, spelling, and language technology because one script has to serve more than one spoken pattern.

Language Profile in Plain Terms

Santali is not just “a tribal language from India,” which is where many short pages stop. It is a major literary and public language with its own script, a large speaker base, a clear place in the Austroasiatic family, and a grammar that looks very different from English.

The language is usually described as head-final. That means the verb tends to come after the object, and relational words behave like postpositions rather than prepositions. Possessors and adjectives typically come before the noun. For readers used to English, that alone already marks a different grammatical rhythm.

Santali also stands out because it kept features that are very useful for linguistic study. Short reference pages often mention the script and speaker count, yet skip the richer structure: number distinctions, inclusive and exclusive first person, suffix-heavy morphology, and dialect-level sound differences. Those details matter because they shape how the language is taught, typed, translated, and standardized.

Writing, Literacy, and Digital Use

Santali has a layered writing history. For a long time, writers used scripts borrowed from nearby major languages, especially Bengali, Odia, and Devanagari, while Roman spelling also appeared in some older texts and missionary work. That mixed script history still matters today because readers may encounter Santali in more than one writing tradition.

Ol Chiki gave the language a stable written center. In print and education, that helped spelling become more consistent. In digital spaces, Unicode support made native-script text easier to store, search, publish, and share. The Ol Chiki block in Unicode is U+1C50–U+1C7F, which is one of the reasons Santali is now easier to handle in fonts, keyboards, websites, and software than it was in the early web era.

Language Milestones
YearWhat HappenedWhy It Matters
1925Pandit Raghunath Murmu developed Ol Chiki.Santali gained a script built for its own sound structure.
2003 / 2004Santali entered the Eighth Schedule through the 92nd Amendment Act.This raised its standing in education, public communication, and formal language policy.
25 Dec 2025The Constitution of India was released in Santhali for the first time, written in Ol Chiki.This was a major step in native-language public access.
16 Feb 2026India formally marked 100 years of Ol Chiki with centenary events, a commemorative coin, and a postage stamp.The script moved further into national visibility and cultural institutions.

What Many Short Articles Miss
  • Santali and Santhali are the same language. The spelling changes by source, not by language identity.
  • Ol Chiki did not erase older scripts overnight. Santali still has a multiscript history, and readers may meet more than one writing tradition.
  • The language is not only culturally important. It is also technically interesting for linguistics because of its head-final syntax, suffix-heavy structure, number distinctions, and dialect-based vowel variation.

Where Is Santali Spoken?

Santali is spoken mainly in eastern India, especially in Jharkhand, West Bengal, Odisha, and Bihar. Smaller speaker groups also live in Assam, Bangladesh, and Nepal. The language is strongly linked with the Santal people, though language use can extend beyond a single census label in mixed-language districts.

Is Santali an Official Language?

Yes. Santali is one of the 22 scheduled languages of India. It entered the Eighth Schedule through the 92nd Amendment Act. That change mattered because it opened more room for public use, school material, literary growth, examinations, and language-related institutional support. At the state level, Santali also has official use in places such as Jharkhand and West Bengal.

What Script Does Santali Use?

The script most closely tied to Santali today is Ol Chiki. It was created for Santali rather than borrowed from another language. That is why it holds a special place in literature, schooling, and identity. Even so, Santali has also been written in Bengali, Odia, Devanagari, and Roman script, so older books and regional materials may look very different from modern Ol Chiki texts.

Is Santali the Same as Santhali?

Yes. They are two spellings of the same language. Santali is common in linguistic databases and international language coding. Santhali appears often in Indian public documents and press material. A reader should treat them as equivalent unless a source is discussing spelling practice itself.

How Many People Speak Santali?

The most cited hard figure is the 2011 Census of India, which recorded 7,368,192 Santali speakers in India. Since India holds by far the largest speaker base, that figure is central in current language profiles. When Bangladesh and Nepal are included, writers often describe Santali as a language spoken by well over seven million people.

What Makes Santali Important for Linguistics?

Santali is useful for more than language census lists. It gives linguists a strong example of a Munda language with a large living speaker base, a dedicated script, and a grammar that differs clearly from Indo-Aryan and English patterns. Its SOV syntax, postpositions, suffixing morphology, inclusive/exclusive distinction, and dialect-level sound variation make it one of the most closely watched indigenous languages of South Asia.

Recent Public Developments

Santali has moved further into visible public life in the last few years. Two developments stand out.

  • In December 2025, the Constitution of India was published in Santhali for the first time, using Ol Chiki.
  • In February 2026, India marked the centenary of the script with official celebrations, a commemorative coin, and a postage stamp.

These are not small symbolic moments. They show that Santali is active in both cultural memory and present-day public institutions. For readers, teachers, and students, that means the language is easier to find in formal print, easier to discuss in policy spaces, and more visible in the digital era than many older profiles suggest.

santali