Skip to content

Latest commit

 

History

History
751 lines (695 loc) · 84.6 KB

character-tables-arabic.md

File metadata and controls

751 lines (695 loc) · 84.6 KB

Arabic character tables

This document lists the per-character shaping information needed to shape Arabic text.

Table of Contents

Arabic character table

Arabic glyphs should be classified as in the following table. Codepoints in the Arabic block with no assigned meaning are designated as unassigned in the Unicode category column.

The Joining type column indicates whether each codepoint is defined as joining with adjacent characters on the left side, right side, left and right sides ("DUAL"), or neither side ("NON_JOINING"). Codepoints designated TRANSPARENT in the Joining type column do not join with adjacent characters and, in addition, do not affect the joining behavior of surrounding characters. Non-spacing marks are of type TRANSPARENT. Codepoints designated JOIN_CAUSING force adjacent characters to join.

The Joining group column lists the fundamental letter that the listed codepoint behaves like for joining purposes.

Assigned codepoints with a null in the Joining group column evoke no special behavior from the shaping engine during the join-computation stage.

The Mark class column indicates the Canonical Combining Class for the codepoint. Marks are assigned non-zero combining classes so that sequences of adjacent marks can be reordered as required by the orthography.

For Arabic, a subset of marks in the 220 and 230 classes are also designated Modifier Combining Marks (MCM). These are denoted with 220_MCM and 230_MCM in the Mark class column. The MCM marks are treated differently during the mark-reordering stage.

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+0600 Other NON_JOINING null 0 ؀ Number Sign
U+0601 Other NON_JOINING null 0 ؁ Sign Sanah
U+0602 Other NON_JOINING null 0 ؂ Footnote Marker
U+0603 Other NON_JOINING null 0 ؃ Sign Safha
U+0604 Other NON_JOINING null 0 ؄ Sign Samvat
U+0605 Other NON_JOINING null 0 ؅ Number Mark Above
U+0606 Symbol NON_JOINING null 0 ؆ Cube Root
U+0607 Symbol NON_JOINING null 0 ؇ Fourth Root
U+0608 Symbol NON_JOINING null 0 ؈ Ray
U+0609 Punctuation NON_JOINING null 0 ؉ Per Mille
U+060A Punctuation NON_JOINING null 0 ؊ Per Ten Thousand
U+060B Symbol NON_JOINING null 0 ؋ Afghani Sign
U+060C Punctuation NON_JOINING null 0 ، Comma
U+060D Punctuation NON_JOINING null 0 ؍ Date Separator
U+060E Symbol NON_JOINING null 0 ؎ Poetic Verse Sign
U+060F Symbol NON_JOINING null 0 ؏ Sign Misra
U+0610 Mark [Mn] TRANSPARENT null 230 ؐ Sign Sallallahou Alayhe Wassallam
U+0611 Mark [Mn] TRANSPARENT null 230 ؑ Sign Alayhe Assallam
U+0612 Mark [Mn] TRANSPARENT null 230 ؒ Sign Rahmatullah Alayhe
U+0613 Mark [Mn] TRANSPARENT null 230 ؓ Sign Radi Allahou Anhu
U+0614 Mark [Mn] TRANSPARENT null 230 ؔ Sign Takhallus
U+0615 Mark [Mn] TRANSPARENT null 230 ؕ Small High Tah
U+0616 Mark [Mn] TRANSPARENT null 230 ؖ Small High Alef Lam Yeh
U+0617 Mark [Mn] TRANSPARENT null 230 ؗ Small High Zain
U+0618 Mark [Mn] TRANSPARENT null 30 ؘ Small Fatha
U+0619 Mark [Mn] TRANSPARENT null 31 ؙ Small Damma
U+061A Mark [Mn] TRANSPARENT null 32 ؚ Small Kasra
U+061B Punctuation NON_JOINING null 0 ؛ Semicolon
U+061C Other TRANSPARENT null 0 ؜ Arabic Letter Mark
U+061D Punctuation NON_JOINING null 0 ؝ End Of Text Mark
U+061E Punctuation NON_JOINING null 0 ؞ Triple Dot Punctuation Mark
U+061F Punctuation NON_JOINING null 0 ؟ Question Mark
U+0620 Letter DUAL YEH 0 ؠ Kashmiri Yeh
U+0621 Letter NON_JOINING null 0 ء Hamza
U+0622 Letter RIGHT ALEF 0 آ Alef With Madda Above
U+0623 Letter RIGHT ALEF 0 أ Alef With Hamza Above
U+0624 Letter RIGHT WAW 0 ؤ Waw With Hamza Above
U+0625 Letter RIGHT ALEF 0 إ Alef With Hamza Below
U+0626 Letter DUAL YEH 0 ئ Dotless Yeh With Hamza Above
U+0627 Letter RIGHT ALEF 0 ا Alef
U+0628 Letter DUAL BEH 0 ب Beh
U+0629 Letter RIGHT TEH_MARBUTA 0 ة Teh Marbuta
U+062A Letter DUAL BEH 0 ت Dotless Beh With 2 Dots Above
U+062B Letter DUAL BEH 0 ث Dotless Beh With 3 Dots Above
U+062C Letter DUAL HAH 0 ج Hah With Dot Below
U+062D Letter DUAL HAH 0 ح Hah
U+062E Letter DUAL HAH 0 خ Hah With Dot Above
U+062F Letter RIGHT DAL 0 د Dal
U+0630 Letter RIGHT DAL 0 ذ Dal With Dot Above
U+0631 Letter RIGHT REH 0 ر Reh
U+0632 Letter RIGHT REH 0 ز Reh With Dot Above
U+0633 Letter DUAL SEEN 0 س Seen
U+0634 Letter DUAL SEEN 0 ش Seen With 3 Dots Above
U+0635 Letter DUAL SAD 0 ص Sad
U+0636 Letter DUAL SAD 0 ض Sad With Dot Above
U+0637 Letter DUAL TAH 0 ط Tah
U+0638 Letter DUAL TAH 0 ظ Tah With Dot Above
U+0639 Letter DUAL AIN 0 ع Ain
U+063A Letter DUAL AIN 0 غ Ain With Dot Above
U+063B Letter DUAL GAF 0 ػ Keheh With 2 Dots Above
U+063C Letter DUAL GAF 0 ؼ Keheh With 3 Dots Below
U+063D Letter DUAL FARSI_YEH 0 ؽ Farsi Yeh With Inverted V Above
U+063E Letter DUAL FARSI_YEH 0 ؾ Farsi Yeh With 2 Dots Above
U+063F Letter DUAL FARSI_YEH 0 ؿ Farsi Yeh With 3 Dots Above
U+0640 Letter modifier JOIN_CAUSING null 0 ـ Tatweel
U+0641 Letter DUAL FEH 0 ف Feh
U+0642 Letter DUAL QAF 0 ق Qaf
U+0643 Letter DUAL KAF 0 ك Kaf
U+0644 Letter DUAL LAM 0 ل Lam
U+0645 Letter DUAL MEEM 0 م Meem
U+0646 Letter DUAL NOON 0 ن Noon
U+0647 Letter DUAL HEH 0 ه Heh
U+0648 Letter RIGHT WAW 0 و Waw
U+0649 Letter DUAL YEH 0 ى Dotless Yeh
U+064A Letter DUAL YEH 0 ي Yeh
U+064B Mark [Mn] TRANSPARENT null 27 ً Fathatan
U+064C Mark [Mn] TRANSPARENT null 28 ٌ Dammatan
U+064D Mark [Mn] TRANSPARENT null 29 ٍ Kasratan
U+064E Mark [Mn] TRANSPARENT null 30 َ Fatha
U+064F Mark [Mn] TRANSPARENT null 31 ُ Damma
U+0650 Mark [Mn] TRANSPARENT null 32 ِ Kasra
U+0651 Mark [Mn] TRANSPARENT null 33 ّ Shadda
U+0652 Mark [Mn] TRANSPARENT null 34 ْ Sukun
U+0653 Mark [Mn] TRANSPARENT null 230 ٓ Maddah Above
U+0654 Mark [Mn] TRANSPARENT null 230_MCM ٔ Hamza Above
U+0655 Mark [Mn] TRANSPARENT null 220_MCM ٕ Hamza Below
U+0656 Mark [Mn] TRANSPARENT null 220 ٖ Subscript Alef
U+0657 Mark [Mn] TRANSPARENT null 230 ٗ Inverted Damma
U+0658 Mark [Mn] TRANSPARENT null 230_MCM ٘ Noon Ghunna
U+0659 Mark [Mn] TRANSPARENT null 230 ٙ Zwarakay
U+065A Mark [Mn] TRANSPARENT null 230 ٚ Vowel Sign Small V Above
U+065B Mark [Mn] TRANSPARENT null 230 ٛ Vowel Sign Inverted Small V Above
U+065C Mark [Mn] TRANSPARENT null 220 ٜ Vowel Sign Dot Below
U+065D Mark [Mn] TRANSPARENT null 230 ٝ Reversed Damma
U+065E Mark [Mn] TRANSPARENT null 230 ٞ Fatha with Two Dots
U+065F Mark [Mn] TRANSPARENT null 220 ٟ Wavy Hamza Below
U+0660 Number NON_JOINING null 0 ٠ Digit Zero
U+0661 Number NON_JOINING null 0 ١ Digit One
U+0662 Number NON_JOINING null 0 ٢ Digit Two
U+0663 Number NON_JOINING null 0 ٣ Digit Three
U+0664 Number NON_JOINING null 0 ٤ Digit Four
U+0665 Number NON_JOINING null 0 ٥ Digit Five
U+0666 Number NON_JOINING null 0 ٦ Digit Six
U+0667 Number NON_JOINING null 0 ٧ Digit Seven
U+0668 Number NON_JOINING null 0 ٨ Digit Eight
U+0669 Number NON_JOINING null 0 ٩ Digit Nine
U+066A Punctuation NON_JOINING null 0 ٪ Percent Sign
U+066B Punctuation NON_JOINING null 0 ٫ Decimal Separator
U+066C Punctuation NON_JOINING null 0 ٬ Thousands Separator
U+066D Punctuation NON_JOINING null 0 ٭ Five Pointed Star
U+066E Letter DUAL BEH 0 ٮ Dotless Beh
U+066F Letter DUAL QAF 0 ٯ Dotless Qaf
U+0670 Mark [Mn] TRANSPARENT null 35 ٰ Superscript Alef
U+0671 Letter RIGHT ALEF 0 ٱ Alef With Wasla Above
U+0672 Letter RIGHT ALEF 0 ٲ Alef With Wavy Hamza Above
U+0673 Letter RIGHT ALEF 0 ٳ Alef With Wavy Hamza Below
U+0674 Letter NON_JOINING null 0 ٴ High Hamza
U+0675 Letter RIGHT ALEF 0 ٵ High Hamza Alef
U+0676 Letter RIGHT WAW 0 ٶ High Hamza Waw
U+0677 Letter RIGHT WAW 0 ٷ High Hamza Waw With Damma Above
U+0678 Letter DUAL YEH 0 ٸ High Hamza Dotless Yeh
U+0679 Letter DUAL BEH 0 ٹ Dotless Beh With Tah Above
U+067A Letter DUAL BEH 0 ٺ Dotless Beh With Vertical 2 Dots Above
U+067B Letter DUAL BEH 0 ٻ Dotless Beh With Vertical 2 Dots Below
U+067C Letter DUAL BEH 0 ټ Dotless Beh With Attached Ring Below And 2 Dots Above
U+067D Letter DUAL BEH 0 ٽ Dotless Beh With Inverted 3 Dots Above
U+067E Letter DUAL BEH 0 پ Dotless Beh With 3 Dots Below
U+067F Letter DUAL BEH 0 ٿ Dotless Beh With 4 Dots Above
U+0680 Letter DUAL BEH 0 ڀ Dotless Beh With 4 Dots Below
U+0681 Letter DUAL HAH 0 ځ Hah With Hamza Above
U+0682 Letter DUAL HAH 0 ڂ Hah With Vertical 2 Dots Above
U+0683 Letter DUAL HAH 0 ڃ Hah With 2 Dots Below
U+0684 Letter DUAL HAH 0 ڄ Hah With Vertical 2 Dots Below
U+0685 Letter DUAL HAH 0 څ Hah With 3 Dots Above
U+0686 Letter DUAL HAH 0 چ Hah With 3 Dots Below
U+0687 Letter DUAL HAH 0 ڇ Hah With 4 Dots Below
U+0688 Letter RIGHT DAL 0 ڈ Dal With Tah Above
U+0689 Letter RIGHT DAL 0 ډ Dal With Attached Ring Below
U+068A Letter RIGHT DAL 0 ڊ Dal With Dot Below
U+068B Letter RIGHT DAL 0 ڋ Dal With Dot Below And Tah Above
U+068C Letter RIGHT DAL 0 ڌ Dal With 2 Dots Above
U+068D Letter RIGHT DAL 0 ڍ Dal With 2 Dots Below
U+068E Letter RIGHT DAL 0 ڎ Dal With 3 Dots Above
U+068F Letter RIGHT DAL 0 ڏ Dal With Inverted 3 Dots Above
U+0690 Letter RIGHT DAL 0 ڐ Dal With 4 Dots Above
U+0691 Letter RIGHT REH 0 ڑ Reh With Tah Above
U+0692 Letter RIGHT REH 0 ڒ Reh With V Above
U+0693 Letter RIGHT REH 0 ړ Reh With Attached Ring Below
U+0694 Letter RIGHT REH 0 ڔ Reh With Dot Below
U+0695 Letter RIGHT REH 0 ڕ Reh With V Below
U+0696 Letter RIGHT REH 0 ږ Reh With Dot Below And Dot Within
U+0697 Letter RIGHT REH 0 ڗ Reh With 2 Dots Above
U+0698 Letter RIGHT REH 0 ژ Reh With 3 Dots Above
U+0699 Letter RIGHT REH 0 ڙ Reh With 4 Dots Above
U+069A Letter DUAL SEEN 0 ښ Seen With Dot Below And Dot Above
U+069B Letter DUAL SEEN 0 ڛ Seen With 3 Dots Below
U+069C Letter DUAL SEEN 0 ڜ Seen With 3 Dots Below And 3 Dots Above
U+069D Letter DUAL SAD 0 ڝ Sad With 2 Dots Below
U+069E Letter DUAL SAD 0 ڞ Sad With 3 Dots Above
U+069F Letter DUAL TAH 0 ڟ Tah With 3 Dots Above
U+06A0 Letter DUAL AIN 0 ڠ Ain With 3 Dots Above
U+06A1 Letter DUAL FEH 0 ڡ Dotless Feh
U+06A2 Letter DUAL FEH 0 ڢ Dotless Feh With Dot Below
U+06A3 Letter DUAL FEH 0 ڣ Feh With Dot Below
U+06A4 Letter DUAL FEH 0 ڤ Dotless Feh With 3 Dots Above
U+06A5 Letter DUAL FEH 0 ڥ Dotless Feh With 3 Dots Below
U+06A6 Letter DUAL FEH 0 ڦ Dotless Feh With 4 Dots Above
U+06A7 Letter DUAL QAF 0 ڧ Dotless Qaf With Dot Above
U+06A8 Letter DUAL QAF 0 ڨ Dotless Qaf With 3 Dots Above
U+06A9 Letter DUAL GAF 0 ک Keheh
U+06AA Letter DUAL SWASH_KAF 0 ڪ Swash Kaf
U+06AB Letter DUAL GAF 0 ګ Keheh With Attached Ring Below
U+06AC Letter DUAL KAF 0 ڬ Kaf With Dot Above
U+06AD Letter DUAL KAF 0 ڭ Kaf With 3 Dots Above
U+06AE Letter DUAL KAF 0 ڮ Kaf With 3 Dots Below
U+06AF Letter DUAL GAF 0 گ Gaf
U+06B0 Letter DUAL GAF 0 ڰ Gaf With Attached Ring Below
U+06B1 Letter DUAL GAF 0 ڱ Gaf With 2 Dots Above
U+06B2 Letter DUAL GAF 0 ڲ Gaf With 2 Dots Below
U+06B3 Letter DUAL GAF 0 ڳ Gaf With Vertical 2 Dots Below
U+06B4 Letter DUAL GAF 0 ڴ Gaf With 3 Dots Above
U+06B5 Letter DUAL LAM 0 ڵ Lam With V Above
U+06B6 Letter DUAL LAM 0 ڶ Lam With Dot Above
U+06B7 Letter DUAL LAM 0 ڷ Lam With 3 Dots Above
U+06B8 Letter DUAL LAM 0 ڸ Lam With 3 Dots Below
U+06B9 Letter DUAL NOON 0 ڹ Noon With Dot Below
U+06BA Letter DUAL NOON 0 ں Dotless Noon
U+06BB Letter DUAL NOON 0 ڻ Dotless Noon With Tah Above
U+06BC Letter DUAL NOON 0 ڼ Noon With Attached Ring Below
U+06BD Letter DUAL NYA 0 ڽ Nya
U+06BE Letter DUAL KNOTTED_HEH 0 ھ Knotted Heh
U+06BF Letter DUAL HAH 0 ڿ Hah With 3 Dots Below And Dot Above
U+06C0 Letter RIGHT TEH_MARBUTA 0 ۀ Dotless Teh Marbuta With Hamza Above
U+06C1 Letter DUAL HEH_GOAL 0 ہ Heh Goal
U+06C2 Letter DUAL HEH_GOAL 0 ۂ Heh Goal With Hamza Above
U+06C3 Letter RIGHT TEH_MARBUTA_GOAL 0 ۃ Teh Marbuta Goal
U+06C4 Letter RIGHT WAW 0 ۄ Waw With Attached Ring Within
U+06C5 Letter RIGHT WAW 0 ۅ Waw With Bar
U+06C6 Letter RIGHT WAW 0 ۆ Waw With V Above
U+06C7 Letter RIGHT WAW 0 ۇ Waw With Damma Above
U+06C8 Letter RIGHT WAW 0 ۈ Waw With Alef Above
U+06C9 Letter RIGHT WAW 0 ۉ Waw With Inverted V Above
U+06CA Letter RIGHT WAW 0 ۊ Waw With 2 Dots Above
U+06CB Letter RIGHT WAW 0 ۋ Waw With 3 Dots Above
U+06CC Letter DUAL FARSI_YEH 0 ی Farsi Yeh
U+06CD Letter RIGHT YEH_WITH_TAIL 0 ۍ Yeh With Tail
U+06CE Letter DUAL FARSI_YEH 0 ێ Farsi Yeh With V Above
U+06CF Letter RIGHT WAW 0 ۏ Waw With Dot Above
U+06D0 Letter DUAL YEH 0 ې Dotless Yeh With Vertical 2 Dots Below
U+06D1 Letter DUAL YEH 0 ۑ Dotless Yeh With 3 Dots Below
U+06D2 Letter RIGHT YEH_BARREE 0 ے Yeh Barree
U+06D3 Letter RIGHT YEH_BARREE 0 ۓ Yeh Barree With Hamza Above
U+06D4 Punctuation NON_JOINING null 0 ۔ Full Stop
U+06D5 Letter NON_JOINING TEH_MARBUTA 0 ە Dotless Teh Marbuta
U+06D6 Mark [Mn] TRANSPARENT null 230 ۖ Small High Sad Lam Alef Maksura
U+06D7 Mark [Mn] TRANSPARENT null 230 ۗ Small High Qaf Lam Alef Maksura
U+06D8 Mark [Mn] TRANSPARENT null 230 ۘ Small High Meem Initial Form
U+06D9 Mark [Mn] TRANSPARENT null 230 ۙ Small High Lam Alef
U+06DA Mark [Mn] TRANSPARENT null 230 ۚ Small High Jeem
U+06DB Mark [Mn] TRANSPARENT null 230 ۛ Small High Three Dots
U+06DC Mark [Mn] TRANSPARENT null 230_MCM ۜ Small High Seen
U+06DD Other NON_JOINING null 0 ۝ End Of Ayah
U+06DE Other NON_JOINING null 0 ۞ Start Of Rub El Hizb
U+06DF Mark [Mn] TRANSPARENT null 230 ۟ Small High Rounded Zero
U+06E0 Mark [Mn] TRANSPARENT null 230 ۠ Small High Upright Rectangular Zero
U+06E1 Mark [Mn] TRANSPARENT null 230 ۡ Small High Dotless Head Of Khah
U+06E2 Mark [Mn] TRANSPARENT null 230 ۢ Small High Meem Isolated Form
U+06E3 Mark [Mn] TRANSPARENT null 220_MCM ۣ Small Low Seen
U+06E4 Mark [Mn] TRANSPARENT null 230 ۤ Small High Madda
U+06E5 Letter modifier NON_JOINING null 0 ۥ Small Waw
U+06E6 Letter modifier NON_JOINING null 0 ۦ Small Yeh
U+06E7 Mark [Mn] TRANSPARENT null 230_MCM ۧ Small High Yeh
U+06E8 Mark [Mn] TRANSPARENT null 230_MCM ۨ Small High Noon
U+06E9 Symbol NON_JOINING null 0 ۩ Place Of Sajdah
U+06EA Mark [Mn] TRANSPARENT null 220 ۪ Empty Centre Low Stop
U+06EB Mark [Mn] TRANSPARENT null 230 ۫ Empty Centre High Stop
U+06EC Mark [Mn] TRANSPARENT null 230 ۬ Rounded High Stop With Filled Centre
U+06ED Mark [Mn] TRANSPARENT null 220 ۭ Small Low Meem
U+06EE Letter RIGHT DAL 0 ۮ Dal With Inverted V Above
U+06EF Letter RIGHT REH 0 ۯ Reh With Inverted V Above
U+06F0 Number NON_JOINING null 0 ۰ Extended Digit Zero
U+06F1 Number NON_JOINING null 0 ۱ Extended Digit One
U+06F2 Number NON_JOINING null 0 ۲ Extended Digit Two
U+06F3 Number NON_JOINING null 0 ۳ Extended Digit Three
U+06F4 Number NON_JOINING null 0 ۴ Extended Digit Four
U+06F5 Number NON_JOINING null 0 ۵ Extended Digit Five
U+06F6 Number NON_JOINING null 0 ۶ Extended Digit Six
U+06F7 Number NON_JOINING null 0 ۷ Extended Digit Seven
U+06F8 Number NON_JOINING null 0 ۸ Extended Digit Eight
U+06F9 Number NON_JOINING null 0 ۹ Extended Digit Nine
U+06FA Letter DUAL SEEN 0 ۺ Sheen With Dot Below
U+06FB Letter DUAL SAD 0 ۻ Dad With Dot Below
U+06FC Letter DUAL AIN 0 ۼ Ghain With Dot Below
U+06FD Symbol NON_JOINING null 0 ۽ Sign Sindhi Ampersand
U+06FE Symbol NON_JOINING null 0 ۾ Sign Sindhi Postposition Men
U+06FF Letter DUAL KNOTTED_HEH 0 ۿ Knotted Heh With Inverted V Above

Arabic Supplement character table

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+0750 Letter DUAL BEH 0 ݐ Dotless Beh With Horizontal 3 Dots Below
U+0751 Letter DUAL BEH 0 ݑ Beh With 3 Dots Above
U+0752 Letter DUAL BEH 0 ݒ Dotless Beh With Inverted 3 Dots Below
U+0753 Letter DUAL BEH 0 ݓ Dotless Beh With Inverted 3 Dots Below And 2 Dots Above
U+0754 Letter DUAL BEH 0 ݔ Dotless Beh With 2 Dots Below And Dot Above
U+0755 Letter DUAL BEH 0 ݕ Dotless Beh With Inverted V Below
U+0756 Letter DUAL BEH 0 ݖ Dotless Beh With V Above
U+0757 Letter DUAL HAH 0 ݗ Hah With 2 Dots Above
U+0758 Letter DUAL HAH 0 ݘ Hah With Inverted 3 Dots Below
U+0759 Letter RIGHT DAL 0 ݙ Dal With Vertical 2 Dots Below And Tah Above
U+075A Letter RIGHT DAL 0 ݚ Dal With Inverted V Below
U+075B Letter RIGHT REH 0 ݛ Reh With Bar
U+075C Letter DUAL SEEN 0 ݜ Seen With 4 Dots Above
U+075D Letter DUAL AIN 0 ݝ Ain With 2 Dots Above
U+075E Letter DUAL AIN 0 ݞ Ain With Inverted 3 Dots Above
U+075F Letter DUAL AIN 0 ݟ Ain With Vertical 2 Dots Above
U+0760 Letter DUAL FEH 0 ݠ Dotless Feh With 2 Dots Below
U+0761 Letter DUAL FEH 0 ݡ Dotless Feh With Inverted 3 Dots Below
U+0762 Letter DUAL GAF 0 ݢ Keheh With Dot Above
U+0763 Letter DUAL GAF 0 ݣ Keheh With 3 Dots Above
U+0764 Letter DUAL GAF 0 ݤ Keheh With Inverted 3 Dots Below
U+0765 Letter DUAL MEEM 0 ݥ Meem With Dot Above
U+0766 Letter DUAL MEEM 0 ݦ Meem With Dot Below
U+0767 Letter DUAL NOON 0 ݧ Noon With 2 Dots Below
U+0768 Letter DUAL NOON 0 ݨ Noon With Tah Above
U+0769 Letter DUAL NOON 0 ݩ Noon With V Above
U+076A Letter DUAL LAM 0 ݪ Lam With Bar
U+076B Letter RIGHT REH 0 ݫ Reh With Vertical 2 Dots Above
U+076C Letter RIGHT REH 0 ݬ Reh With Hamza Above
U+076D Letter DUAL SEEN 0 ݭ Seen With Vertical 2 Dots Above
U+076E Letter DUAL HAH 0 ݮ Hah With Tah Below
U+076F Letter DUAL HAH 0 ݯ Hah With Tah And 2 Dots Below
U+0770 Letter DUAL SEEN 0 ݰ Seen With 2 Dots And Tah Above
U+0771 Letter RIGHT REH 0 ݱ Reh With 2 Dots And Tah Above
U+0772 Letter DUAL HAH 0 ݲ Hah With Tah Above
U+0773 Letter RIGHT ALEF 0 ݳ Alef With Digit Two Above
U+0774 Letter RIGHT ALEF 0 ݴ Alef With Digit Three Above
U+0775 Letter DUAL FARSI_YEH 0 ݵ Farsi Yeh With Digit Two Above
U+0776 Letter DUAL FARSI_YEH 0 ݶ Farsi Yeh With Digit Three Above
U+0777 Letter DUAL YEH 0 ݷ Dotless Yeh With Digit Four Below
U+0778 Letter RIGHT WAW 0 ݸ Waw With Digit Two Above
U+0779 Letter RIGHT WAW 0 ݹ Waw With Digit Three Above
U+077A Letter DUAL BURUSHASKI_YEH_BARREE 0 ݺ Burushaski Yeh Barree With Digit Two Above
U+077B Letter DUAL BURUSHASKI_YEH_BARREE 0 ݻ Burushaski Yeh Barree With Digit Three Above
U+077C Letter DUAL HAH 0 ݼ Hah With Digit Four Below
U+077D Letter DUAL SEEN 0 ݽ Seen With Digit Four Above
U+077E Letter DUAL SEEN 0 ݾ Seen With Inverted V Above
U+077F Letter DUAL KAF 0 ݿ Kaf With 2 Dots Above

Arabic Extended-A character table

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+08A0 Letter DUAL BEH 0 ࢠ Dotless Beh With V Below
U+08A1 Letter DUAL BEH 0 ࢡ Beh With Hamza Above
U+08A2 Letter DUAL HAH 0 ࢢ Hah With Dot Below And 2 Dots Above
U+08A3 Letter DUAL TAH 0 ࢣ Tah With 2 Dots Above
U+08A4 Letter DUAL FEH 0 ࢤ Dotless Feh With Dot Below And 3 Dots Above
U+08A5 Letter DUAL QAF 0 ࢥ Qaf With Dot Below
U+08A6 Letter DUAL LAM 0 ࢦ Lam With Double Bar
U+08A7 Letter DUAL MEEM 0 ࢧ Meem With 3 Dots Above
U+08A8 Letter DUAL YEH 0 ࢨ Yeh With Hamza Above
U+08A9 Letter DUAL YEH 0 ࢩ Yeh With Dot Above
U+08AA Letter RIGHT REH 0 ࢪ Reh With Loop
U+08AB Letter RIGHT WAW 0 ࢫ Waw With Dot Within
U+08AC Letter RIGHT ROHINGYA_YEH 0 ࢬ Rohingya Yeh
U+08AD Letter NON_JOINING null 0 ࢭ Low Alef
U+08AE Letter RIGHT DAL 0 ࢮ Dal With 3 Dots Below
U+08AF Letter DUAL SAD 0 ࢯ Sad With 3 Dots Below
U+08B0 Letter DUAL GAF 0 ࢰ Keheh With Stroke Below
U+08B1 Letter RIGHT STRAIGHT_WAW 0 ࢱ Straight Waw
U+08B2 Letter RIGHT REH 0 ࢲ Reh With Dot And Inverted V Above
U+08B3 Letter DUAL AIN 0 ࢳ Ain With 3 Dots Below
U+08B4 Letter DUAL KAF 0 ࢴ Kaf With Dot Below
U+08B5 Letter DUAL QAF 0 ࢵ Qaf With Dot Below
U+08B6 Letter DUAL BEH 0 ࢶ Beh With Meem Above
U+08B7 Letter DUAL BEH 0 ࢷ Dotless Beh With 3 Dots Below And Meem Above
U+08B8 Letter DUAL BEH 0 ࢸ Dotless Beh With Teh Above
U+08B9 Letter RIGHT REH 0 ࢹ Reh With Noon Above
U+08BA Letter DUAL YEH 0 ࢺ Yeh With Noon Above
U+08BB Letter DUAL AFRICAN_FEH 0 ࢻ African Feh
U+08BC Letter DUAL AFRICAN_QAF 0 ࢼ African Qaf
U+08BD Letter DUAL AFRICAN_NOON 0 ࢽ African Noon
U+08BE Letter DUAL BEH 0 ࢾ Peh With Small V
U+08BF Letter DUAL BEH 0 ࢿ Teh With Small V
U+08C0 Letter DUAL BEH 0 ࣀ Tteh With Small V
U+08C1 Letter DUAL HAH 0 ࣁ Tcheh With Small V
U+08C2 Letter DUAL GAF 0 ࣂ Keheh With Small V
U+08C3 Letter DUAL AIN 0 ࣃ Ghain With 3 Dots Above
U+08C4 Letter DUAL AFRICAN_QAF 0 ࣄ African Qaf With 3 Dots Above
U+08C5 Letter DUAL HAH 0 ࣅ Jeem With 3 Dots Above
U+08C6 Letter DUAL HAH 0 ࣆ Jeem With 3 Dots Below
U+08C7 Letter DUAL LAM 0 ࣇ Lam With Small Arabic Tah Above
U+08C8 Letter DUAL GAF 0 ࣈ Graf
U+08C9 Letter modifier TRANSPARENT null 0 ࣉ Small Farsi Yeh
U+08CA Mark [Mn] TRANSPARENT null 230_MCM ࣊ Small High Farsi Yeh
U+08CB Mark [Mn] TRANSPARENT null 230_MCM ࣋ Small High Yeh Barree With Two Dots Below
U+08CC Mark [Mn] TRANSPARENT null 230 ࣌ Small High Word Sah
U+08CD Mark [Mn] TRANSPARENT null 230_MCM ࣍ Small High Zah
U+08CE Mark [Mn] TRANSPARENT null 230_MCM ࣎ Large Round Dot Above
U+08CF Mark [Mn] TRANSPARENT null 220_MCM ࣏ Large Round Dot Below
U+08D0 Mark [Mn] TRANSPARENT null 220 ࣐ Sukun Below
U+08D1 Mark [Mn] TRANSPARENT null 220 ࣑ Large Circle Below
U+08D2 Mark [Mn] TRANSPARENT null 220 ࣒ Large Round Dot Inside Circle Below
U+08D3 Mark [Mn] TRANSPARENT null 220_MCM ࣓ Small Low Waw
U+08D4 Mark [Mn] TRANSPARENT null 230 ࣔ Small High Word Ar-Rub
U+08D5 Mark [Mn] TRANSPARENT null 230 ࣕ Small High Sad
U+08D6 Mark [Mn] TRANSPARENT null 230 ࣖ Small High Ain
U+08D7 Mark [Mn] TRANSPARENT null 230 ࣗ Small High Qaf
U+08D8 Mark [Mn] TRANSPARENT null 230 ࣘ Small High Noon With Kasra
U+08D9 Mark [Mn] TRANSPARENT null 230 ࣙ Small Low Noon With Kasra
U+08DA Mark [Mn] TRANSPARENT null 230 ࣚ Small High Word Ath-Thalatha
U+08DB Mark [Mn] TRANSPARENT null 230 ࣛ Small High Word As-Sajda
U+08DC Mark [Mn] TRANSPARENT null 230 ࣜ Small High Word An-Nisf
U+08DD Mark [Mn] TRANSPARENT null 230 ࣝ Small High Word Sakta
U+08DE Mark [Mn] TRANSPARENT null 230 ࣞ Small High Word Qif
U+08DF Mark [Mn] TRANSPARENT null 230 ࣟ Small High Word Waqfa
U+08E0 Mark [Mn] TRANSPARENT null 230 ࣠ Small High Footnote Marker
U+08E1 Mark [Mn] TRANSPARENT null 230 ࣡ Small High Sign Safha
U+08E2 Other NON_JOINING null 0 ࣢ Disputed End Of Ayah
U+08E3 Mark [Mn] TRANSPARENT null 220 ࣣ Turned Damma Below
U+08E4 Mark [Mn] TRANSPARENT null 230 ࣤ Curly Fatha
U+08E5 Mark [Mn] TRANSPARENT null 230 ࣥ Curly Damma
U+08E6 Mark [Mn] TRANSPARENT null 220 ࣦ Curly Kasra
U+08E7 Mark [Mn] TRANSPARENT null 230 ࣧ Curly Fathatan
U+08E8 Mark [Mn] TRANSPARENT null 230 ࣨ Curly Dammatan
U+08E9 Mark [Mn] TRANSPARENT null 220 ࣩ Curly Kasratan
U+08EA Mark [Mn] TRANSPARENT null 230 ࣪ Tone One Dot Above
U+08EB Mark [Mn] TRANSPARENT null 230 ࣫ Tone Two Dots aAove
U+08EC Mark [Mn] TRANSPARENT null 230 ࣬ Tone Loop Above
U+08ED Mark [Mn] TRANSPARENT null 220 ࣭ Tone One Dot Below
U+08EE Mark [Mn] TRANSPARENT null 220 ࣮ Tone Two Dots Below
U+08EF Mark [Mn] TRANSPARENT null 220 ࣯ Tone Loop Below
U+08F0 Mark [Mn] TRANSPARENT null 27 ࣰ Open Fathatan
U+08F1 Mark [Mn] TRANSPARENT null 28 ࣱ Open Dammatan
U+08F2 Mark [Mn] TRANSPARENT null 29 ࣲ Open Kasratan
U+08F3 Mark [Mn] TRANSPARENT null 230_MCM ࣳ Small High Waw
U+08F4 Mark [Mn] TRANSPARENT null 230 ࣴ Fatha With Ring
U+08F5 Mark [Mn] TRANSPARENT null 230 ࣵ Fatha With Dot Above
U+08F6 Mark [Mn] TRANSPARENT null 220 ࣶ Kasra With Dot Below
U+08F7 Mark [Mn] TRANSPARENT null 230 ࣷ Left Arrowhead Above
U+08F8 Mark [Mn] TRANSPARENT null 230 ࣸ Right Arrowhead Above
U+08F9 Mark [Mn] TRANSPARENT null 220 ࣹ Left Arrowhead Below
U+08FA Mark [Mn] TRANSPARENT null 220 ࣺ Right Arrowhead Below
U+08FB Mark [Mn] TRANSPARENT null 230 ࣻ Double Right Arrowhead Above
U+08FC Mark [Mn] TRANSPARENT null 230 ࣼ Double Right Arrowhead Above With Dot
U+08FD Mark [Mn] TRANSPARENT null 230 ࣽ Right Arrowhead Above With Dot
U+08FE Mark [Mn] TRANSPARENT null 230 ࣾ Damma With Dot
U+08FF Mark [Mn] TRANSPARENT null 230 ࣿ Mark Sideways Noon Ghunna

Arabic Extended-B character table

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+0870 Letter RIGHT ALEF 0 ࡰ Alef With Attached Fatha
U+0871 Letter RIGHT ALEF 0 ࡱ Alef With Attached Top Right Fatha
U+0872 Letter RIGHT ALEF 0 ࡲ Alef With Right Middle Stroke
U+0873 Letter RIGHT ALEF 0 ࡳ Alef With Left Middle Stroke
U+0874 Letter RIGHT ALEF 0 ࡴ Alef With Attached Kasra
U+0875 Letter RIGHT ALEF 0 ࡵ Alef With Attached Bottom Right Kasra
U+0876 Letter RIGHT ALEF 0 ࡶ Alef With Attached Round Dot Above
U+0877 Letter RIGHT ALEF 0 ࡷ Alef With Attached Right Round Dot
U+0878 Letter RIGHT ALEF 0 ࡸ Alef With Attached Left Round Dot
U+0879 Letter RIGHT ALEF 0 ࡹ Alef With Attached Round Dot Below
U+087A Letter RIGHT ALEF 0 ࡺ Alef With Dot Above
U+087B Letter RIGHT ALEF 0 ࡻ Alef With Attached Top Right Fatha And Dot Above
U+087C Letter RIGHT ALEF 0 ࡼ Alef With Right Middle Stroke And Dot Above
U+087D Letter RIGHT ALEF 0 ࡽ Alef With Attached Bottom Right Kasra And Dot Above
U+087E Letter RIGHT ALEF 0 ࡾ Alef With Attached Top Right Fatha And Left Ring
U+087F Letter RIGHT ALEF 0 ࡿ Alef With Right Middle Stroke And Left Ring
U+0880 Letter RIGHT ALEF 0 ࢀ Alef With Attached Bottom Right Kasra And Left Ring
U+0881 Letter RIGHT ALEF 0 ࢁ Alef With Attached Right Hamza
U+0882 Letter RIGHT ALEF 0 ࢂ Alef With Attached Left Hamza
U+0883 Letter modifier JOIN_CAUSING null 0 ࢃ Tatweel With Overstruck Hamza
U+0884 Letter modifier JOIN_CAUSING null 0 ࢄ Tatweel With Overstruck Waw
U+0885 Letter modifier JOIN_CAUSING null 0 ࢅ Tatweel With Two Dots Below
U+0886 Letter DUAL THIN_YEH 0 ࢆ Thin Yeh
U+0887 Letter NON_JOINING null 0 ࢇ Baseline Round Dot
U+0888 Symbol NON_JOINING null 0 ࢈ Raised Round Dot
U+0889 Letter DUAL NOON 0 ࢉ Noon With Inverted Small V
U+088A Letter DUAL HAH 0 ࢊ Hah With Inverted Small V Below
U+088B Letter DUAL TAH 0 ࢋ Tah With Dot Below
U+088C Letter DUAL TAH 0 ࢌ Tah With Three Dots Below
U+088D Letter DUAL GAF 0 ࢍ Keheh With Two Dots Vertically Below
U+088E Letter RIGHT VERTICAL_TAIL 0 ࢎ Vertical Tail
U+088F unassigned
U+0890 Symbol NON_JOINING null 0 ࢐ Pound Mark Above
U+0891 Symbol NON_JOINING null 0 ࢑ Piastre Mark Above
U+0892 unassigned
U+0893 unassigned
U+0894 unassigned
U+0895 unassigned
U+0896 unassigned
U+0897 Mark [Mn] TRANSPARENT null 230 ࢗ Pepet
U+0898 Mark [Mn] TRANSPARENT null 230 ࢘ Small High Word Al-Juz
U+0899 Mark [Mn] TRANSPARENT null 220 ࢙ Small Low Word Ishmaam
U+089A Mark [Mn] TRANSPARENT null 220 ࢚ Small Low Word Imaala
U+089B Mark [Mn] TRANSPARENT null 220 ࢛ Small Low Word Tasheel
U+089C Mark [Mn] TRANSPARENT null 230 ࢜ Madda Waajib
U+089D Mark [Mn] TRANSPARENT null 230 ࢝ Superscript Alef Mokhassas
U+089E Mark [Mn] TRANSPARENT null 230 ࢞ Doubled Madda
U+089F Mark [Mn] TRANSPARENT null 230 ࢟ Half Madda Over Madda

Arabic Extended-C character table

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+10EC0 unassigned
U+10EC1 unassigned
U+10EC2 Letter RIGHT DAL 0 𐻂 Dal With Two Dots Vertically Below
U+10EC3 Letter DUAL TAH 0 𐻂 Tah With Two Dots Vertically Below
U+10EC4 Letter DUAL KAF 0 𐻂 Kaf With Two Dots Vertically Below
U+10EC5 unassigned
U+10EC6 unassigned
U+10EC7 unassigned
U+10EC8 unassigned
U+10EC9 unassigned
U+10ECA unassigned
U+10ECB unassigned
U+10ECC unassigned
U+10ECD unassigned
U+10ECE unassigned
U+10ECF unassigned
U+10ED0 unassigned
U+10ED1 unassigned
U+10ED2 unassigned
U+10ED3 unassigned
U+10ED4 unassigned
U+10ED5 unassigned
U+10ED6 unassigned
U+10ED7 unassigned
U+10ED8 unassigned
U+10ED9 unassigned
U+10EDA unassigned
U+10EDB unassigned
U+10EDC unassigned
U+10EDD unassigned
U+10EDE unassigned
U+10EDF unassigned
U+10EE0 unassigned
U+10EE1 unassigned
U+10EE2 unassigned
U+10EE3 unassigned
U+10EE4 unassigned
U+10EE5 unassigned
U+10EE6 unassigned
U+10EE7 unassigned
U+10EE8 unassigned
U+10EE9 unassigned
U+10EEA unassigned
U+10EEB unassigned
U+10EEC unassigned
U+10EED unassigned
U+10EEE unassigned
U+10EEF unassigned
U+10EF0 unassigned
U+10EF1 unassigned
U+10EF2 unassigned
U+10EF3 unassigned
U+10EF4 unassigned
U+10EF5 unassigned
U+10EF6 unassigned
U+10EF7 unassigned
U+10EF8 unassigned
U+10EF9 unassigned
U+10EFA unassigned
U+10EFB unassigned
U+10EFC Mark [Mn] TRANSPARENT null 0 𐻼 Combining Alef Overlay
U+10EFD Mark [Mn] TRANSPARENT null 220 𐻽 Small Low Word Sakta
U+10EFE Mark [Mn] TRANSPARENT null 220 𐻾 Small Low Word Qasr
U+10EFF Mark [Mn] TRANSPARENT null 220 𐻿 Small Low Word Madda

Rumi Numeral Symbols character table

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+10E60 Number NON_JOINING null 0 𐹠 Digit One
U+10E61 Number NON_JOINING null 0 𐹡 Digit Two
U+10E62 Number NON_JOINING null 0 𐹢 Digit Three
U+10E63 Number NON_JOINING null 0 𐹣 Digit Four
U+10E64 Number NON_JOINING null 0 𐹤 Digit Five
U+10E65 Number NON_JOINING null 0 𐹥 Digit Six
U+10E66 Number NON_JOINING null 0 𐹦 Digit Seven
U+10E67 Number NON_JOINING null 0 𐹧 Digit Eight
U+10E68 Number NON_JOINING null 0 𐹨 Digit Nine
U+10E69 Number NON_JOINING null 0 𐹩 Number Ten
U+10E6A Number NON_JOINING null 0 𐹪 Number Twenty
U+10E6B Number NON_JOINING null 0 𐹫 Number Thirty
U+10E6C Number NON_JOINING null 0 𐹬 Number Forty
U+10E6D Number NON_JOINING null 0 𐹭 Number Fifty
U+10E6E Number NON_JOINING null 0 𐹮 Number Sixty
U+10E6F Number NON_JOINING null 0 𐹯 Number Seventy
U+10E70 Number NON_JOINING null 0 𐹰 Number Eighty
U+10E71 Number NON_JOINING null 0 𐹱 Number Ninety
U+10E72 Number NON_JOINING null 0 𐹲 Number One Hundred
U+10E73 Number NON_JOINING null 0 𐹳 Number Two Hundred
U+10E74 Number NON_JOINING null 0 𐹴 Number Three Hundred
U+10E75 Number NON_JOINING null 0 𐹵 Number Four Hundred
U+10E76 Number NON_JOINING null 0 𐹶 Number Five Hundred
U+10E77 Number NON_JOINING null 0 𐹷 Number Six Hundred
U+10E78 Number NON_JOINING null 0 𐹸 Number Seven Hundred
U+10E79 Number NON_JOINING null 0 𐹹 Number Eight Hundred
U+10E7A Number NON_JOINING null 0 𐹺 Number Nine Hundred
U+10E7B Number NON_JOINING null 0 𐹻 Fraction One Half
U+10E7C Number NON_JOINING null 0 𐹼 Fraction One Quarter
U+10E7D Number NON_JOINING null 0 𐹽 Fraction One Third
U+10E7E Number NON_JOINING null 0 𐹾 Fraction Two Thirds
U+10E7F unassigned

Miscellaneous character table

Other important characters that may be encountered when shaping runs of Arabic text include the dotted-circle placeholder (U+25CC), the combining grapheme joiner (U+034F), the zero-width joiner (U+200D) and zero-width non-joiner (U+200C), the left-to-right text marker (U+200E) and right-to-left text marker (U+200F), and the no-break space (U+00A0).

The dotted-circle placeholder is frequently used when displaying a combining mark in isolation. Real-world text syllables may also use other characters, such as hyphens or dashes, in a similar placeholder fashion; shaping engines should cope with this situation gracefully.

Codepoint Unicode category Joining type Joining group Mark class Glyph
U+00A0 Separator NON_JOINING null 0   No-break space
U+034F Other NON_JOINING null 0 ͏ Combining grapheme joiner
U+200C Other NON_JOINING null 0 ‌ Zero-width non-joiner
U+200D Other JOIN_CAUSING null 0 ‍ Zero-width joiner
U+200E Other NON_JOINING null 0 ‎ Left-to-Right marker
U+200F Other NON_JOINING null 0 ‏ Right-to-Left marker
U+2010 Punctuation NON_JOINING null 0 ‐ Hyphen
U+2011 Punctuation NON_JOINING null 0 ‑ No-break hyphen
U+2012 Punctuation NON_JOINING null 0 ‒ Figure dash
U+2013 Punctuation NON_JOINING null 0 – En dash
U+2014 Punctuation NON_JOINING null 0 — Em dash
U+25CC Symbol NON_JOINING null 0 ◌ Dotted circle

The combining grapheme joiner (CGJ) is primarily used to alter the order in which adjacent marks are positioned during the mark-reordering stage, in order to adhere to the needs of a non-default language orthography.

The zero-width joiner (ZWJ) is primarily used to force the usage of the cursive connecting form of a letter even when the context of the adjoining letters would not trigger the connecting form.

For example, to show the initial form of a letter in isolation (such as for dislaying it in a table of forms), the sequence "Letter,ZWJ" would be used. To show the medial form of a letter in isolation, the sequence "ZWJ,Letter,ZWJ" would be used.

The right-to-left mark (RLM) and left-to-right mark (LRM) are used by the Unicode bidirectionality algorithm (BiDi) to indicate the points in a text run at which the writing direction changes.

The no-break space is primarily used to display those codepoints that are defined as non-spacing (such as vowel or diacritical marks and "Hamza") in an isolated context, as an alternative to displaying them superimposed on the dotted-circle placeholder.