An empirical study of meaning-composition in Arabic. Built on six decades of letter-semantics scholarship, extended with new structural findings: a harmonized 28-letter table with an empirically-confirmed universal dual-face structure; a full catalog of 453 binary nuclei; and a 12-mode operative grammar that grades 2,285 trilateral roots at 100% native fit.
دراسةٌ تَجريبيّةٌ في تركيب المعنى داخل العربيّة. تنطلق من ستّة عقودٍ من العمل في دلالات الحروف، وتُضيف إليها نتائج بنيويّة جديدة: **جدولٌ مُوَحَّدٌ لِـ ٢٨ حرفًا** ببنيةٍ ازدواجيّةٍ كونيّةٍ مُؤكَّدةٍ إمبيريًّا، و**كتالوجٌ كاملٌ لِـ ٤٥٣ نواةً ثنائيّة**، و**نحوٌ تشغيليٌّ من ١٢ نمطًا** يُصنِّف **٢,٢٨٥ جذرًا ثلاثيًّا بمطابقةٍ أصليّةٍ كاملة**.
Conducted under تَمَسُّك للبحوث والنشر والتدريب · Temessek for Research, Publishing & Training تُجرى تحت تَمَسُّك للبحوث والنشر والتدريب
Arabic letter semantics, the claim that each of the 28 Arabic letters carries a physical-articulatory charge and that meaning composes upward from letter to nucleus to root to word, was advanced independently by seven Arabic-speaking scholars over six decades. This study builds on that scholarly foundation (Jabal's lexicon is the empirical anchor) and adds new structural findings of its own: the universal dual-face structure of letter charges, the 12-mode operative grammar of trilateral composition, the 4-bar verification rubric for cross-linguistic cognates, and a principled scope statement that names what the framework does and does not yet do.
تَنبَني دلالاتُ الحروف العربيّة على فرضيّةٍ مُتقادِمَة: لِكلّ حرفٍ من حروف العربيّة الـ ٢٨ شحنةٌ نطقيّة-فيزيائيّة، والمعنى يَتركّبُ صُعودًا من الحرف إلى النواة إلى الجذر إلى الكلمة. عَمِل على هذه الفرضيّة سبعةُ علماءٍ مستقلّين عربيّ-اللسان على مدى ستّة عقود. تَنطلِق هذه الدراسة من هذا الإرث (ومعجم جبل هو أرضيّتها الإمبيريّة) وتُضيف إليه نتائج بنيويّة أصليّة: بنية الازدواجيّة الكونيّة لشحنات الحروف، النحو التشغيليّ من ١٢ نمطًا لتركيب الجذور الثلاثيّة، المعيار الرباعيّ للتحقّق من المقاربات بين اللغات، وبَيانٌ صادقٌ للنطاق يُحدِّد ما يَفعَلُه الإطار وما لا يَفعَلُه بعد.
One consensus charge per letter, picked from seven competing traditions, with bilingual phonetic notes and English glosses for every entry.
شحنةٌ إجماعيّةٌ واحدة لِكلّ حرف، مُختارةٌ بترجيحٍ بين سبع نظريّات، مع ملاحظةٍ نطقيّةٍ وترجمةٍ إنجليزيّةٍ لِكلّ صفّ.
Empirically confirmed across all 453 binary nuclei: every letter carries two semantic faces of the same physical articulation, and the trilateral family selects which face is active.
مُؤكَّدةٌ إمبيريًّا على ٤٥٣ نواةً ثنائيّة: كلّ حرفٍ يَحمِل وجهَين دلاليَّين للنطق الفيزيائيّ نفسه، والحرف الثالث في الجذر يَختار الوجهَ الفعّال.
Every Arabic trilateral root is readable as (binary nucleus) does X to (third letter as material). 2,285 roots graded at 100% native fit across 12 modes.
كلّ جذرٍ ثلاثيّ عربيّ مقروءٌ بِصياغةٍ تشغيليّةٍ واحدة: (النواة الثنائيّة) تَفعَل X في (الحرف الثالث كمادّة). ٢,٢٨٥ جذرًا بمطابقةٍ أصليّةٍ كاملةٍ عبر ١٢ نمطًا.
A 4-bar verification rubric for Quranic-anchored cognates across language families, with 8 Tier-A pairs confirmed.
معيارٌ رباعيّ للتحقّق من المقاربات قرآنيّ المركز بين العائلات اللغويّة، مع ٨ أزواج مُؤكَّدة في المستوى A.
Every nucleus, every cognate, every claim is line-cited and tested. The study maintains a public empirical record.
كلّ نواةٍ وكلّ مقاربةٍ وكلّ ادّعاءٍ مَشفوعٌ بِسطرٍ مَرجِعيّ ومُختبَر. الدراسة تَحتفظ بسجلٍّ تجريبيٍّ عَلَنيّ.
The seven prior scholars built the foundation. Over a year of sustained testing, this study put their combined record through a uniform empirical examination, running each claim across Jabal's lexicon together with the corroborating readings of the other six. The five results that follow are new to the field and emerged from that combination, not from any of the source frameworks taken alone:
العلماء السبعة قبلنا بَنَوا الأرضيّة. لم تَأخذها هذه الدراسة تَسليمًا، بل أَجرَت عليها فحصًا إمبيريًّا موحَّدًا امتدّ سنةً كاملة، اختَبَرَت فيه كلّ ادّعاءٍ على معجم جبل مَع القراءات المُعَضِّدة عند الستّة الآخَرين معًا. النتائج الخمس التالية جديدةٌ على الحقل، وتَولَّدَت من هذا التَّركيب لا من أيّ إطارٍ من الأُطُر السبعة منفردًا:
Every Arabic word is built by composing letter charges from the bottom up. The first two consonants form a binary nucleus that carries the seed-meaning; the third letter modifies that seed; the resulting trilateral root takes one of many derived forms that inherit the root's conceptual centre. The framework's job is to make this compositional structure explicit, testable, and consistent.
كلّ كلمةٍ عربيّةٍ مَبنيّةٌ بتركيب شحنات الحروف من الأسفل إلى الأعلى. الحرفان الأوّلان يُكوّنان نواةً ثنائيّة تَحمِلُ بَذرةَ المعنى؛ الحرف الثالث يُعدِّل تلك البَذرة؛ والجذر الثلاثيّ الناتج يَتّخذ صيغًا مُتعدّدة تَرِث مركزَهُ المفهوميّ. مهمّة الإطار: جعلُ هذه البنية التركيبيّة صريحةً، قابلةً للاختبار، ومتّسقةً.
Empirical corpus underneath: Jabal's 2,300 unique roots · 1,666 Quranic verses · 7 scholarly sources frozen in Data raw/.
Empirical testing across all 453 binary nuclei in Jabal's lexicon revealed a structural property that earlier scholarship had only suspected for a handful of letters: every Arabic letter carries two semantic faces of the same physical articulation. The trilateral family selects which face is active. This is one of the study's principal new findings.
كَشَف الاختبارُ الإمبيريّ على جميع الـ ٤٥٣ نواة ثنائيّة في معجم جبل خاصّيّةً بنيويّةً لم يَستَكشِفها مَن سَبَق إلا في حروفٍ معدودة: كلّ حرفٍ عربيّ يَحمِل وجهَين دلاليَّين لِنطقٍ فيزيائيّ واحد. والحرف الثالث في الجذر يَختار الوجهَ الفعّال. هذه إحدى النتائج الأصليّة الرئيسة للدراسة.
Primary «انبِعاث رَنينيّ», outward resonant emission (نج, نف, نو).
Variant «رَنين مَحبوس», held cavity resonance (ثن, تن, طن). Same nasal articulation, two phases.
Primary «تَجَمُّع في حَيِّز», gathering in a space.
Variant «بُروز من حَيِّز», surfacing out of a space (جَهْرًا, الْجِنّ). Same mid-palate burst, two semantic directions.
Primary «همس + تَنَفُّس», soft passing presence.
Variant «نَفَسٌ يَحمِل ويَتَلاشى», breath that carries off, dissipating. Same pure exhale, two roles.
Primary «اتّصال + تَمَسُّك», attachment, holding.
Variants «بَوّابة تُظهر» (gateway showing) · «انطِباق + إغلاق» (sealed enclosure). Same lip closure, three faces.
The bottom-up path from letter to binary to trilateral is grounded by the study's second principal finding. Every Arabic trilateral root in Jabal's lexicon is interpretable as a single operative relation:
المسار الصاعد من الحرف إلى النواة إلى الجذر يَستقرّ بِفَضل الإسهام الأصليّ الثاني لِلدراسة. كلّ جذرٍ عربيّ ثلاثيّ في معجم جبل قابلٌ لِلقراءة كَعلاقةٍ تشغيليّةٍ واحدة:
…where X is one of 12 relationship modes in a finite operative grammar. 2,285 of 2,288 trilaterals graded (99.87%), 100% find a coherent mode without forcing. See lv2-operative-grammar.md for the full methodology and the interactive dashboard in the computational sister project.
The binary nucleus is the agent; the third letter is the material the nucleus acts upon. Under this reading, قصر reads cleanly as "قص-cutting-precision BLOCKS the flow (ر)" — a single coherent operation, like every other native trilateral in the corpus.
| Stance | Mode | Meaning | Example | % of corpus |
|---|---|---|---|---|
| POSITIVE 55.2% | CARRY | binary carries L3-material forward | ورد, و-bind carries د-fixity → arriving | 6.4% |
| HOLD | binary contains/encloses L3 | بلد, بل-wet holds د-fixity → ground | 16.3% | |
| RELEASE | binary lets L3-material burst out | فتح, فت-loose releases ح-warmth → opening | 9.1% | |
| PROJECT | binary extends L3 outward as feature | حدد, ح-edge projects د-fixity → boundary | 11.7% | |
| INTENSIFY | doubled or amplified L3 | جمد, ج-gather amplifies د-fixity → freezing | 11.7% | |
| NEGATIVE 11.5% | BLOCK | binary stops/seals/walls off L3 | سدد, س-flow blocked → sealing, stopping | 5.8% |
| DRAIN | binary gradually depletes L3 | فقد, فق-void drains د-fixity → losing | 5.7% | |
| TRANSFORM 33.3% | CHANNEL | binary directs/shapes L3's path | جرى, جر-pull channels ى-flow → running | 5.3% |
| OPERATE | binary modifies L3 via squeeze/fold/cut | حسم, حس-feel operates on م-mass → cutting | 24.4% | |
| MIX | blends the binary with L3 | روب, رو-going mixes ب-attach → curdled milk | 0.6% | |
| REVERT | circles back / oscillates L3 | ردد, ر-flow reverts د-fixity → returning | 2.9% | |
| EXCEPTION | LOANWORD | non-native composition | 1 / 2,285 in current dataset | 0.0% |
| Test | Predicted pattern | Observed |
|---|---|---|
| Doubled-final roots (XYY: بدد · ردد · جدد) | INTENSIFY + OPERATE dominate | 66.9% of 405 roots ✓ |
| Redoubled XY-XY roots (دردر · جمجم) | OPERATE + INTENSIFY dominate | 66.9% of 251 roots ✓ |
| Letters with flow charge (ر · ل · ن · ع) | Skew positive (CARRY · CHANNEL · PROJECT · RELEASE) | Confirmed ✓ |
| Letters with mass charge (د · م · ق · ك · ص) | OPERATE + HOLD dominate | Confirmed ✓ |
A reader seeing "100% native fit" and "453 binary nuclei" will reasonably ask: what about the other pairs? With 28 letters, there are 28² = 784 mathematical possibilities for an L1·L2 binary. Our catalog has 453. Where did the other 331 go? They are filtered, not missing:
| Filter | Pairs removed | Linguistic reason |
|---|---|---|
| alif-initial (ا as L1) | 28 | ا is a vowel marker, it cannot begin a root in Arabic |
| identical XX (e.g. بب, دد, …) | 25 | OCP (Obligatory Contour Principle), repeated consonants blocked at the root start |
| same-articulator class | 107 | Soft OCP, tongue-tip-twice (e.g. تد), lips-twice (e.g. بم) and similar adjacencies dispreferred |
| Genuine lexical gaps | 155 | Phonologically allowed by Arabic but unused, the language could form them, but doesn't |
160 pairs (58%) of the unattested set are blocked by known Arabic phonological constraints, alif as a vowel, OCP, articulator-class. These are not gaps; Arabic structurally cannot have them.
Only 155 pairs (20% of all 784) are pairs the language could have used but doesn't, the natural held-out test set for whether the operative model can become generative.
Every trilateral that exists in Arabic gets a coherent operative reading. We did not claim every theoretical XYZ would, and we name where the model's predictive reach is currently untested.
The reference tables below are the study's canonical outputs. Each is line-cited against primary sources, derived from the empirical evidence in Jabal's lexicon and the published Tafsir Coran corpus, and open to revision under the study's stability discipline.
الجداول أدناه هي المُخرَجات الحاكمة لِلدراسة. كلّ صفٍّ مَشفوعٌ بإحالةٍ سطريّةٍ إلى المصادر الأوّليّة، مُستخرَجٌ من الشواهد الإمبيريّة في معجم جبل وفي متن تفسير قرآن المنشور، ومَفتوحٌ لِلمراجعة تحت انضباط الاستقرار في الدراسة.
One harmonized charge per letter, with phonetic note and English gloss. See consensus-letter-charges.md for the full file with variants section.
| حرف | Consensus charge | Phonetic note | English gloss |
|---|---|---|---|
| ء | تَأكيد + قَطع | sharp glottal onset, breath stopped at the throat | sharp affirmation, beginning |
| ا | امتداد صائت | breath standing open, prolonged | extension, sustained presence |
| ب | اتّصال + تَمَسُّك | lips press and release, closing-then-opening seal | attachment, holding, gateway |
| ت | امتداد لطيف + تَمام | tongue tip touches upper teeth, light percussion | gentle extension to completion |
| ث | تَناثُر + تَفَتُّت | tongue between teeth, fine breathy spread | scattering, small-grained spread |
| ج | تَجَمُّع في حَيِّز | mid-palate stop with an emerging burst | gathering in a space, surfacing |
| ح | حياةٌ حارّة + احتواء | deep throat friction, warm breath enclosed | living warmth that contains |
| خ | تَخَلخُل + اختِراق | scratchy back-of-throat friction | rarefying, piercing through |
| د | ضَبط + ثَبات | tongue behind teeth, firm closed nudge | fixity, grounding, settled push |
| ذ | نَفاذ + حِدَّة | tongue between teeth with voice | piercing from inside, fine bringing-out |
| ر | تَكرار + جَريان | tongue tip rolls against alveolar ridge | repetition, running, flow |
| ز | اندفاع + اهتزاز | sharp voiced sibilant from the front | thrust with vibration, sharp penetration |
| س | امتداد + سَيَلان | breathy sibilant gliding forward | extending flow, fine streaming |
| ش | انتشار + تَفَرُّق | wide breathy spread from the front of the mouth | branching, scattering outward |
| ص | صَلابة + إطباق | emphatic sibilant from a tight palate | hardness, sealed strength |
| ض | ضَمّ + ثِقَل | emphatic voiced lateral, tongue presses heavy | drawing-together with weight |
| ط | انبساط + انتشار ثَقيل | emphatic dental, heavy spreading touch | broad heavy spreading |
| ظ | بُروز + حِدّة | emphatic interdental | pronounced surfacing, edged prominence |
| ع | قَبضٌ حَلقيٌّ + شِدّة | deep guttural pharyngeal grip | depth-grip, intense from the throat |
| غ | غَور يَستر | back-of-throat fricative | concealing depth, covering |
| ف | تَفريق + نَفاذ | breath escapes between lip and upper teeth | parting through, splitting open |
| ق | قَطع + إحكام | back-of-palate stop, firm release | cutting precision, firm settling |
| ك | كَتم + قَطع | unvoiced back-of-palate stop | pressed closure, sealed cut |
| ل | تَعَلُّق + امتِداد | tongue against alveolar ridge, voiced glide | attachment that extends, a bridge |
| م | تَجمُّع + تَلاصُق | lips closing, breath gathered | gathering, sealing into a mass |
| ن | رَنين + انبِعاث | nasal resonance from inside | inner resonance emitted outward |
| ه | همس + تَنَفُّس | soft exhale from the throat | breath, soft passing |
| و | وَصل + ربط | rounded lips, sustained connector | binding, joining, sustained link |
| ي | امتداد + لِين + سَريان | front of palate, voiced glide | gentle directed extension |
A curated subset of Jabal's 453 binary nuclei, the highest-frequency, Quran-anchored seeds. The full 453 catalog with charge-only and TC-anchored dossiers is in 03-scholar-extracts/jabal-nuclei-*.md.
| Nucleus | Meaning | Examples |
|---|---|---|
| ال | extending affirmation, a bridge | الم، الله، الرحمن |
| بر | manifest extension, surfacing-and-running | بَرَّ، بَرَزَ، بَرَأَ |
| بد | manifest fixing, surface-stays | بَدَأ، بَدَا، بَدَّلَ |
| حم | living containment, warm enclosure | حَمَى، حَمَلَ، حَمِدَ، الرَّحم |
| حق | living precision, contained truth | حَقّ، حَقَّقَ، الحَقّ |
| رب | repeating attachment, holding-and-multiplying | رَبّ، رَبَّى، الرَّبّ |
| رح | repeating warmth, recurring containment | رَحِم، رَحْمَة، الرَّحمن |
| سل | flowing attachment, fine sliding | سَلَّمَ، سَلَكَ، سَلِمَ |
| سم | extending-toward-completion, lifting | سَمَا، اسم، سَموّ |
| عل | depth-grip extending | عَلِمَ، عَلَا، عِلم، عالَم |
| فر | parting-then-running | فَرَّ، فَرَقَ، فَرَدَ |
| فط | parting-then-spreading | فَطَرَ، فِطرة |
| قر | precision repeating, firm settling | قَرَأ، قَرَّ، قَرارَ |
| كل | sealed extension, pressed-attachment | كَلَّ، كُلّ، كَلِمة |
| ملك | gathering-and-sealing-at-a-cut | المَلك |
| نج | inner resonance gathering | نَجا، نَجم |
| هد | breath-fixing | هَدى، هَدَأ |
| وص | binding-with-emphatic-strength | وَصَلَ، وَصَى، وَصْف |
Quranic-anchored pairs that clear the 4-bar rubric: skeleton match · nucleus identity · modern reader-recognition · dual-family confirmation (Semitic + Indo-European). See tafsir-coran-tier-a-cognates.md for the full Tier-B ledger and anti-patterns.
| Arabic root | Cognate | Skeleton | Shared nucleus | Home verse |
|---|---|---|---|---|
| ك-ف-ء | copy / copie / כפל (Heb) | K-F | the second-instance that lines up against the first | 112:4 |
| س-ل-م | shalom (Heb) / shalmā (Aram) | S-L-M | bound completeness, peace-as-wholeness | 97:5 |
| ر-ب-ب | rabbi / rav (Heb) | R-B | the expansive holder, master-by-nurture | 1:2 |
| ج-ن-ن | genie / genus | G-N | born-from-a-hidden-source, kin-bearing concealment | 114:6 |
| ك-ل-م | claim / clamor (Lat. clamare) | K-L-M | utter-with-intent, the spoken claim that lands | 2:75 |
| ق-و-ل | call (PIE *kel-) | K/Q-L | the summon-uttered, the audible-reach | 112:1 |
| ش-ر-ب | sorb / absorb / syrup | S-R-B | the take-in of fluid, absorbing-flow | 2:60 |
| ج-م-ع | Gemini / gemellus | G-M / J-M | gathered-into-pair, together-as-one | 2:148 |
Seven Arabic letter-semantics scholars whose work forms the empirical and theoretical foundation of this study. Jabal's lexicon supplies the canonical dataset (2,300 unique roots, 453 binary nuclei, 1,666 Quranic applications); the other six contribute complementary frameworks. The study takes this foundation as given and extends it with new structural findings (LV0 dual-face, LV1 nucleus catalog, LV2 operative grammar) tested against the same empirical record.
سبعةُ علماءَ في دلالات الحروف العربيّة، يَتشكّل من أعمالهم الأرضيّةُ الإمبيريّة والنظريّةُ لِهذه الدراسة. معجم جبل يُوفِّر القاعدة المُعتمَدة (٢,٣٠٠ جذرٍ فريد، ٤٥٣ نواةً ثنائيّة، ١,٦٦٦ تطبيقًا قرآنيًّا)؛ والستّة الآخرون يُكمِّلون بأُطرٍ نَظَريّة. تَأخذ الدراسة هذه الأرضيّة مُسَلَّمَةً وتُضيف إليها نتائجَ بنيويّةً جديدة (الازدواجيّة الكونيّة لِطبقة LV0، كتالوج النوى لِطبقة LV1، النحو التشغيليّ لِطبقة LV2) مَفحوصةً على السجلّ الإمبيريّ نفسه.
المعجم الاشتقاقي المؤصَّل, the canonical dataset every layer of this study tests against: 2,300 unique roots, 453 binary nuclei, 1,666 Quranic applications. Jabal's compositional model (letter → binary → trilateral) is the structural skeleton; the study's new findings (universal dual-face, 12-mode operative grammar) refine it against his own data.
Explicit continuator of Neili's project. Complete 28-letter table under Qasdiyya. The framework adopts his methodological frame; differs on 8+ letter charges.
Founder of Qasdiyya, the conceptual/referential distinction, the 6 no-metaphor rules, the rejection of Saussurean arbitrariness. The framework's methodological core.
Six sensory categories × three evolutionary mechanisms (هيجانية / إيمائية / إيحائية). The framework absorbs his sensory axis but reads ا/و/ي as active letters, not jawfī blanks.
The reversal/metathesis hypothesis: reversing letter order reverses meaning. Tested empirically across 155 strict reverse pairs in Jabal's corpus, with a 22% inverse hit-rate confirming Anbar's canonical examples while bounding the rule's wider reach.
Not letter semantics, phonetic shift laws for intra-Semitic comparison. Architecturally separated into the cross-linguistic layer to avoid contamination of letter charges.
Pictographic origin theory (e.g., تاء as a cross-shape for تابوت). Explicitly excluded from the consensus table, incompatible with the physical-articulation framework.
Every artifact of the study in one place. The canonical reference tables are in 03-scholar-extracts/; the architecture and position paper are in 02-architecture/; the empirical audits are in 05-audits/; raw scholar sources are frozen in Data raw/.
كلّ ما أنتجته الدراسةُ في موضعٍ واحد. الجداول المرجعيّة في 03-scholar-extracts/؛ المعمار والورقة الموقفيّة في 02-architecture/؛ السجلّ الإمبيريّ في 05-audits/؛ المصادر الأوّليّة لِلعلماء مُجَمَّدةٌ في Data raw/.
The study has closed the bottom-up genealogy from letter to trilateral root. The agenda that follows extends the framework toward statistical validation, additional scholar integration, and Quranic application. Every line below links to the relevant artifact for detail.
أَغلَقَت الدراسةُ المسار الصاعد من الحرف إلى الجذر الثلاثيّ. الخارطة الأماميّة تَدفع الإطارَ نحو التحقّق الإحصائيّ، ودمجٍ إضافيٍّ لِلعلماء، وتطبيقٍ قرآنيّ. كلّ سطرٍ أدناه يُحيل إلى المُلَفّ التفصيليّ.
our-contributions-and-roadmap.md contains the full methodological detail for each line.
Temessek is the institutional home of this study and its sister works in Quranic linguistics. The framework presented on this page (the 28-letter consensus, the 453-nucleus catalog, the 12-mode operative grammar, the cross-linguistic verification rubric) is published, taught, and applied under the Temessek imprint.
تَمَسُّك هي البيتُ المؤسّسيّ لِهذه الدراسة وأعمالها الشقيقة في اللسانيّات القرآنيّة. الإطار المَعروض على هذه الصفحة (جدول الإجماع لِـ ٢٨ حرفًا، كتالوج الـ ٤٥٣ نواة، النحو التشغيليّ من ١٢ نمطًا، معيار التحقّق بين اللغات) يُنشَر ويُدرَّس ويُطَبَّق تحت اسم تَمَسُّك.
Juthoor-Linguistic-GenealogyREADME.md for the layout overview.our-contributions-and-roadmap.md, the shareable position paper.consensus-letter-charges.md for the canonical 28-letter table.jabal-nuclei-extended.md for the full nucleus dossier.README.md لِفهم البنية العامّة.our-contributions-and-roadmap.md، الورقة الموقفيّة القابلة لِلمشاركة.consensus-letter-charges.md لِلجدول الحاكم للحروف الـ ٢٨.jabal-nuclei-extended.md لِلكتالوج الكامل لِلنوى.