home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic: POS Tags: PART

There are 23 PART lemmas (0%), 29 PART types (0%) and 3797 PART tokens (1%). Out of 16 observed tags, the rank of PART is: 10 in number of lemmas, 13 in number of types and 11 in number of tokens.

The 10 most frequent PART lemmas: سَ، لَا، قَد، لَم، إِنَّ، إِلَّا، لَن، سَوفَ، أَمَّا، مَا

The 10 most frequent PART types: س، لا، قد، لم، إن، لن، إلا، سوف، الا، ما

The 10 most frequent ambiguous lemmas: إِنَّ (CCONJ 934, PART 200), مَا (DET 1007, PART 67, INTJ 1), أَ (X 48, PART 10), لِ (ADP 6661, CCONJ 202, PART 1)

The 10 most frequent ambiguous types: س (PART 1039, X 2), لم (PART 483, X 3), إن (CCONJ 571, PART 170, X 3), إلا (PART 87, X 17), الا (PART 78, X 12, CCONJ 1), ما (DET 1007, PART 67, X 4, INTJ 1), ان (CCONJ 1956, PART 30, X 11, VERB 1), اما (PART 26, CCONJ 5), ال (PART 24, X 9), ل (ADP 6520, CCONJ 202, PART 23, X 2)

Morphology

The form / lemma ratio of PART is 1.260870 (the average of all parts of speech is 1.685281).

The 1st highest number of forms (3) was observed with the lemma “اَل”: ال, الـ, الـــ.

The 2nd highest number of forms (2) was observed with the lemma “أَمَّا”: أما, اما.

The 3rd highest number of forms (2) was observed with the lemma “إِلَّا”: إلا, الا.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 19 different relations: aux (1553; 41% instances), advmod (1319; 35% instances), advmod:emph (399; 11% instances), cc (215; 6% instances), aux:pass (113; 3% instances), mark (47; 1% instances), nmod (28; 1% instances), conj (27; 1% instances), obj (21; 1% instances), root (20; 1% instances), cop (14; 0% instances), dep (11; 0% instances), fixed (10; 0% instances), parataxis (8; 0% instances), case (5; 0% instances), advcl (2; 0% instances), nsubj (2; 0% instances), obl:arg (2; 0% instances), iobj (1; 0% instances)

Parents of PART nodes belong to 12 different parts of speech: VERB (3086; 81% instances), NOUN (201; 5% instances), ADJ (158; 4% instances), X (152; 4% instances), CCONJ (62; 2% instances), NUM (49; 1% instances), ADV (24; 1% instances), (20; 1% instances), PART (18; 0% instances), PRON (13; 0% instances), DET (12; 0% instances), AUX (2; 0% instances)

3384 (89%) PART nodes are leaves.

238 (6%) PART nodes have one child.

117 (3%) PART nodes have two children.

58 (2%) PART nodes have three or more children.

The highest child degree of a PART node is 10.

Children of PART nodes are attached using 25 different relations: nsubj (232; 33% instances), cc (123; 17% instances), appos (76; 11% instances), punct (60; 8% instances), obl (46; 7% instances), mark (30; 4% instances), conj (24; 3% instances), obl:arg (17; 2% instances), case (16; 2% instances), fixed (14; 2% instances), parataxis (14; 2% instances), advcl (12; 2% instances), nmod (12; 2% instances), advmod (6; 1% instances), dep (6; 1% instances), amod (4; 1% instances), advmod:emph (3; 0% instances), obj (3; 0% instances), csubj (2; 0% instances), orphan (2; 0% instances), acl (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), cop (1; 0% instances), nummod (1; 0% instances)

Children of PART nodes belong to 12 different parts of speech: NOUN (277; 39% instances), CCONJ (141; 20% instances), PUNCT (60; 8% instances), X (60; 8% instances), PRON (40; 6% instances), VERB (37; 5% instances), ADJ (29; 4% instances), DET (21; 3% instances), PART (18; 3% instances), ADP (16; 2% instances), ADV (4; 1% instances), NUM (4; 1% instances)