home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: PART

There are 23 PART lemmas (0%), 30 PART types (0%) and 3854 PART tokens (1%). Out of 16 observed tags, the rank of PART is: 12 in number of lemmas, 13 in number of types and 11 in number of tokens.

The 10 most frequent PART lemmas: سَ، لَا، قَد، لَم، إِنَّ، إِلَّا، لَن، سَوفَ، أَمَّا، مَا

The 10 most frequent PART types: س، لا، قد، لم، إن، لن، إلا، سوف، الا، ما

The 10 most frequent ambiguous lemmas: إِنَّ (CCONJ 945, PART 212), مَا (DET 1021, PART 67, INTJ 1), أَ (X 48, PART 10), لِ (ADP 6946, CCONJ 210, PART 1)

The 10 most frequent ambiguous types: س (PART 1074, X 2), لم (PART 483, X 3), إن (CCONJ 582, PART 182, X 3), إلا (PART 87, X 17), الا (PART 78, X 12, CCONJ 1), ما (DET 1025, PART 67, X 4, INTJ 1), ان (CCONJ 1956, PART 30, X 11, VERB 1), اما (PART 26, CCONJ 5), ال (PART 24, X 9), ل (ADP 6805, CCONJ 210, PART 24, X 2)

Morphology

The form / lemma ratio of PART is 1.304348 (the average of all parts of speech is 1.761701).

The 1st highest number of forms (3) was observed with the lemma “اَل”: ال, الـ, الـــ.

The 2nd highest number of forms (2) was observed with the lemma “أَمَّا”: أما, اما.

The 3rd highest number of forms (2) was observed with the lemma “إِلَّا”: إلا, الا.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 19 different relations: aux (1589; 41% instances), advmod (1320; 34% instances), advmod:emph (410; 11% instances), cc (198; 5% instances), aux:pass (113; 3% instances), mark (50; 1% instances), conj (29; 1% instances), nmod (29; 1% instances), fixed (23; 1% instances), obj (21; 1% instances), root (20; 1% instances), cop (17; 0% instances), dep (11; 0% instances), parataxis (9; 0% instances), case (8; 0% instances), advcl (2; 0% instances), nsubj (2; 0% instances), obl:arg (2; 0% instances), iobj (1; 0% instances)

Parents of PART nodes belong to 13 different parts of speech: VERB (3152; 82% instances), NOUN (198; 5% instances), ADJ (164; 4% instances), X (124; 3% instances), CCONJ (64; 2% instances), NUM (49; 1% instances), ADV (24; 1% instances), (20; 1% instances), PART (18; 0% instances), ADP (13; 0% instances), DET (13; 0% instances), PRON (13; 0% instances), AUX (2; 0% instances)

3424 (89%) PART nodes are leaves.

249 (6%) PART nodes have one child.

122 (3%) PART nodes have two children.

59 (2%) PART nodes have three or more children.

The highest child degree of a PART node is 10.

Children of PART nodes are attached using 25 different relations: nsubj (246; 34% instances), cc (126; 17% instances), appos (76; 10% instances), punct (60; 8% instances), obl (50; 7% instances), mark (30; 4% instances), conj (24; 3% instances), fixed (18; 2% instances), obl:arg (18; 2% instances), case (16; 2% instances), parataxis (14; 2% instances), advcl (12; 2% instances), nmod (12; 2% instances), dep (6; 1% instances), amod (4; 1% instances), obj (4; 1% instances), advmod:emph (3; 0% instances), advmod (2; 0% instances), csubj (2; 0% instances), orphan (2; 0% instances), acl (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), cop (1; 0% instances), nummod (1; 0% instances)

Children of PART nodes belong to 12 different parts of speech: NOUN (289; 40% instances), CCONJ (146; 20% instances), PUNCT (60; 8% instances), PRON (53; 7% instances), X (46; 6% instances), VERB (38; 5% instances), ADJ (29; 4% instances), DET (22; 3% instances), ADP (20; 3% instances), PART (18; 2% instances), ADV (5; 1% instances), NUM (4; 1% instances)