16 Surprising Language Facts That Sound Fake (But Are Real)
TABLE OF CONTENTS
Some of these are so strange you’ll think we made them up. We didn’t. Every single one is backed by linguistic research.
Vocabulary
1. “Run” has 645 meanings — and it took one man 9 months to write them all.
The Oxford English Dictionary’s longest entry isn’t for “set” anymore. Lexicographer Peter Gilliver spent nine months compiling the verb “run” — 645 distinct senses driven by machines (programs run), liquids (rivers run), fabrics (colors run), and dozens more domains. The entry for “set” has around 430 meanings and now sits in third place, behind both “run” and “put.”
2. The world’s longest word takes over 3 hours to say.
The answer depends on what you count as a “word.” If you allow chemical nomenclature, the protein titin runs 189,819 letters. Guinness World Records gives the crown to a 195-character Sanskrit compound from a 16th-century poem — 428 letters when transliterated. And German, as an agglutinative language, can stack nouns together without limit, meaning there is no longest German word at all.
3. Arabic does NOT have 400 words for “camel.” Neither do the Inuit have 100 words for “snow.”
Both claims are the same linguistic myth dressed in different clothes. The “400 camel words” figure — repeated by encyclopedias and textbooks for decades — counts every descriptive phrase, dialect variant, and derived form as a separate “word.” By that logic, English has hundreds of “words” for water (liquid, moisture, dew, condensation, precipitation, rainwater, seawater…). Both myths were debunked by linguists as “motivated more by ethnocentric ignorance than by scholarship.”
4. More than 20 English words are their own opposites.
They’re called contronyms (or Janus words, after the two-faced Roman god). Sanction can mean both “approve” and “penalize.” Cleave can mean “split apart” and “cling together.” Dust can mean “remove dust from” and “sprinkle dust onto.” Oversight can mean “careful supervision” and “a careless mistake.” Off can mean “activated” (the alarm went off) and “deactivated” (turn it off). Context does all the work.
Words are strange enough. But grammar — the invisible rules we follow without thinking — gets even stranger.
Grammar That Breaks Your Brain
5. There is a language with no numbers — not even “one” and “two.”
Pirahã, spoken by a few hundred people in the Brazilian Amazon, has no words for exact quantities. Linguist Daniel Everett documented only three relative terms: hói (a small amount), hoí (a larger amount), and baágiso (many). When MIT researchers tested this in 2008 by showing objects in descending order (10 → 1), speakers used hói — supposedly “one” — for quantities as high as six. They aren’t counting. They’re estimating.
6. In some languages, you can’t say a single sentence without revealing how you know it.
About a quarter of the world’s languages have obligatory evidentiality — a grammatical system that forces speakers to mark the source of their information. In Tariana, spoken in the Amazon, “José played football” requires one of five verb suffixes: -ka (I saw it), -mahka (I heard it), -nihka (I infer it from evidence), -sika (I assume from general knowledge), or -pidaka (someone told me). Without one, the sentence is grammatically incomplete — and using the wrong one is considered dishonest.
7. Mandarin Chinese has no past tense. Or future tense. Or any tense at all.
Chinese verbs never conjugate for time. Instead, time is expressed through context words (昨天 “yesterday,” 明天 “tomorrow”), aspect markers (了 le for completed actions), and common sense. The sentence “我去” (wǒ qù) can mean “I go,” “I went,” or “I will go” depending entirely on when you say it and what surrounds it. Several Mayan languages and West Greenlandic work the same way — proving tense is optional, not universal.
8. Japanese has almost no swear words — and that makes it more brutal, not less.
Japanese lacks the kind of “forbidden words” that shock in English. Kuso (“damn/crap”) and baka (“idiot”) are so mild they appear uncensored in children’s anime. But Japanese offends differently — through pronoun choice. There are over a dozen ways to say “you,” and picking the wrong one is devastating. Temē doesn’t mean anything literally obscene, but using it is the social equivalent of calling someone a piece of garbage. As one linguist put it: some languages invented the sledgehammer for cursing; Japanese invented the surgical scalpel.
Grammar tells you what to say. But writing and sound determine how it reaches the world — and here too, languages make wildly different choices.
Sound and Symbol
9. The world’s smallest alphabet has only 12 letters.
Rotokas, spoken by about 4,300 people on Bougainville Island in Papua New Guinea, gets by with just A, E, G, I, K, O, P, R, S, T, U, V. That’s 5 vowels and effectively only 6 consonant sounds. Compare that to English (26 letters) or Khmer (74 letters) — the gap is staggering.
10. Click consonants exist naturally in only one place on Earth — Africa.
That “tsk-tsk” sound you make to show disapproval? It’s a legitimate speech sound in the Khoisan languages of southern Africa, where up to 70% of words begin with a click. Some languages in this family have over 100 consonants thanks to combinations of five basic click types (dental, lateral, alveolar, palatal, bilabial) with different voice and airflow patterns. Xhosa and Zulu borrowed clicks through contact — but outside Africa, not a single natural language uses them.
11. Tonal languages are not the majority — they’re about 42% of the world’s languages.
The often-quoted claim that “60–70% of languages are tonal” is wrong. The ThoT Database (Maslinsky & Vydrin, 2025) analyzed 7,674 languages and found ~42.8% are tonal. The old figure came from biased sampling. Still, that’s over 3,000 languages where pitch changes the meaning of a word — in Mandarin, mā (妈, mother) and mǎ (马, horse) differ only in tone.
12. Chinese is the oldest writing system still in use — by a lot.
Chinese characters trace back to oracle bone inscriptions from around 1200 BCE, making the writing system over 3,200 years old. Egyptian hieroglyphs are older but extinct. Sumerian cuneiform is older but extinct. Chinese characters evolved, simplified, and spread — but a modern reader can still recognize forms their ancestors carved into turtle shells three millennia ago. Greek has the longest continuous written-and-spoken tradition (Linear B from ~1700 BCE), but its current alphabet is “only” from the 8th century BCE.
Some languages have written records stretching back millennia. Others are vanishing before they’re ever written down.
Lost and Found
13. Basque has no known relatives. None. It’s the last of its kind.
Basque (Euskara) is the sole surviving pre-Indo-European language in Western Europe. It was spoken in the Pyrenees before the ancestors of English, Spanish, French, and Hindi arrived. Every attempt to link it to another language family — Iberian, Berber, Caucasian — has failed. The oldest Basque inscription, the Hand of Irulegi (1st century BCE), reads sorioneku — modern Basque zorioneko, meaning “fortunate.” The language survived Roman conquest, centuries of pressure from Spanish and French, and Franco’s explicit ban. About 700,000 people speak it today. Like Welsh, another ancient European language that survived against the odds, Basque is now experiencing a cultural revival.
14. Esperanto has native speakers — about 1,000 of them.
People born into Esperanto-speaking families are called denaskuloj. The first was Emilia Burillo, born in Spain in 1904. Today, roughly 1,000 to 2,000 people worldwide learned Esperanto from birth — usually in international families where parents met through the Esperanto movement. They’re always at least bilingual (no country has Esperanto as an official language), and some families have carried the tradition for four generations. It’s the only constructed language to develop a native speaker community.
15. A language dies every two weeks.
UNESCO estimates there are about 8,300 languages in the world. Around 40% — over 3,000 — are endangered. Since 1950, at least 230 languages have gone extinct, and the current rate is roughly one every 14 days. By 2100, linguists project that 50% to 90% of all languages could vanish, each one taking with it an irreplaceable way of seeing the world.
16. The most translated document in history isn’t the Bible — it’s only 1,800 words long.
The Universal Declaration of Human Rights holds the Guinness World Record for the most translated document, available in over 525 languages. Adopted by the UN in 1948, it contains 30 articles in about 1,800 words. The Bible has been translated in full into more languages (700+), but the UDHR wins as a single, complete document available in the widest range of languages — from Abkhaz to Zulu.
The UDHR now exists in over 525 languages thanks to translators working across language barriers. OpenL supports 100+ of them today.
More language quirks: 12 untranslatable words that will change how you see the world
Sources
- OED — “Run” as most complex word — New York Times interview with OED editor
- Longest words — Wikipedia — Cross-language comparison of longest words
- Pirahã number experiment — MIT-led team finds language without numbers (2008)
- Evidentiality — The Grammar of Knowledge — Aikhenvald & Dixon, Oxford University Press
- Japanese swearing — Language Log — Discussion of Japanese profanity mechanisms
- Rotokas language — Wikipedia — Smallest modern alphabet
- Click languages — Britannica — Overview of click consonants
- ThoT Database — Tonal status of world languages (Maslinsky & Vydrin, 2025)
- Basque language — BBC Travel — The mysterious origins of Europe’s oldest language
- Native Esperanto speakers — Ethnologue — Denaskuloj demographics
- Endangered languages — UNESCO — UNESCO Bangkok 2025 report
- Most translated document — Guinness World Records — UDHR record
- Contronyms — MIT — Seth Teller’s auto-antonym collection
- Color terms — The Conversation — Why languages differ in color vocabulary


