home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: PRON

There are 1 PRON lemmas (0%), 51 PRON types (0%) and 9991 PRON tokens (4%). Out of 16 observed tags, the rank of PRON is: 16 in number of lemmas, 9 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: هُوَ

The 10 most frequent PRON types: ه، ها، هم، هو، هي، ك، هما، نا، هن، كم

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: ها (PRON 3751, PART 2), هم (PRON 1100, NOUN 2, X 1), هو (PRON 442, X 10), ك (ADP 170, PRON 143, X 3), هما (PRON 89, X 13), كم (PRON 26, DET 14, X 5), نحن (X 23, PRON 11), أنا (X 5, PRON 3), أعضائها (NOUN 1, PRON 1), أهدافها (NOUN 1, PRON 1)

Morphology

The form / lemma ratio of PRON is 51.000000 (the average of all parts of speech is 1.685281).

The 1st highest number of forms (51) was observed with the lemma “هُوَ”: أعضائها, أنا, أنت, أنتم, أهدافها, إدانته, إليها, استبعادهم, استعداداته, انا, انتشاره, انتم, بأنفسهم, بضمانها, بفقدانها, بلاده, بلادهم, بهم, بهويتها, تجارتها, تجميدها, تجهيزه, تخصيصها, حكومته, زنزانته, شفائهم, طائرته, ك, كم, لاراضيه, لمساعدتنا, لهم, مستشفياتها, مستقبله, مواجهتها, نا, نحن, نهايتها, ني, ه, ها, هم, هما, هن, هو, هى, هي, والده, وغربه, وهي, ي.

PRON occurs with 5 features: Case (9991; 100% instances), Gender (9991; 100% instances), Number (9991; 100% instances), Person (9991; 100% instances), PronType (9991; 100% instances)

PRON occurs with 12 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Prs

PRON occurs with 31 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|Person=3|PronType=Prs (2871 tokens). Examples: ها، أعضائها، أهدافها، إليها، بضمانها، بفقدانها، بهويتها، تجارتها، تجميدها، تخصيصها

Relations

PRON nodes are attached to their parents using 23 different relations: nmod (5055; 51% instances), obj (1340; 13% instances), nsubj (1204; 12% instances), obl:arg (1013; 10% instances), obl (490; 5% instances), fixed (365; 4% instances), cc (305; 3% instances), conj (59; 1% instances), iobj (46; 0% instances), advmod:emph (18; 0% instances), case (15; 0% instances), cop (15; 0% instances), advmod (14; 0% instances), nsubj:pass (14; 0% instances), mark (13; 0% instances), root (9; 0% instances), nummod (5; 0% instances), appos (2; 0% instances), ccomp (2; 0% instances), dep (2; 0% instances), det (2; 0% instances), parataxis (2; 0% instances), amod (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: NOUN (5592; 56% instances), VERB (3040; 30% instances), ADJ (469; 5% instances), CCONJ (387; 4% instances), X (169; 2% instances), DET (111; 1% instances), NUM (104; 1% instances), ADP (50; 1% instances), PART (40; 0% instances), PRON (12; 0% instances), (9; 0% instances), ADV (6; 0% instances), AUX (2; 0% instances)

7493 (75%) PRON nodes are leaves.

2060 (21%) PRON nodes have one child.

253 (3%) PRON nodes have two children.

185 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 27 different relations: case (2204; 67% instances), punct (353; 11% instances), nsubj (256; 8% instances), cc (100; 3% instances), cop (58; 2% instances), nmod (55; 2% instances), obl (52; 2% instances), mark (45; 1% instances), conj (37; 1% instances), amod (28; 1% instances), xcomp (17; 1% instances), fixed (16; 0% instances), acl (13; 0% instances), obl:arg (13; 0% instances), advmod:emph (11; 0% instances), advcl (10; 0% instances), appos (10; 0% instances), csubj (8; 0% instances), advmod (5; 0% instances), det (5; 0% instances), nummod (5; 0% instances), ccomp (1; 0% instances), dep (1; 0% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (2191; 66% instances), PUNCT (353; 11% instances), NOUN (335; 10% instances), CCONJ (147; 4% instances), NUM (80; 2% instances), X (67; 2% instances), ADJ (45; 1% instances), VERB (45; 1% instances), PART (13; 0% instances), DET (12; 0% instances), PRON (12; 0% instances), ADV (4; 0% instances), AUX (3; 0% instances)