home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: PRON

There are 1 PRON lemmas (0%), 52 PRON types (0%) and 10877 PRON tokens (4%). Out of 17 observed tags, the rank of PRON is: 17 in number of lemmas, 10 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: هُوَ

The 10 most frequent PRON types: ه، ها، هم، هو، نا، هي، هما، ك، ي، ني

The 10 most frequent ambiguous lemmas: هُوَ (PRON 10877, DET 1)

The 10 most frequent ambiguous types: ه (PRON 4088, DET 7), ها (PRON 3864, DET 4, PART 2), هم (PRON 1159, NOUN 2, X 1), هو (PRON 442, X 10), نا (PRON 392, DET 1), هما (PRON 219, X 13), ك (ADP 193, PRON 144, X 3), ي (PRON 84, DET 1), نحن (PRON 36, X 23), كم (PRON 27, DET 14, X 5)

Morphology

The form / lemma ratio of PRON is 52.000000 (the average of all parts of speech is 1.761966).

The 1st highest number of forms (52) was observed with the lemma “هُوَ”: أعضائها, أنا, أنت, أنتم, أهدافها, إدانته, إليها, استبعادهم, استعداداته, انا, انتشاره, انتم, بأنفسهم, بضمانها, بفقدانها, بلاده, بلادهم, بهم, بهويتها, تجارتها, تجميدها, تجهيزه, تخصيصها, حكومته, زنزانته, شفائهم, طائرته, ك, كم, كما, لاراضيه, لمساعدتنا, لهم, مستشفياتها, مستقبله, مواجهتها, نا, نحن, نهايتها, ني, ه, ها, هم, هما, هن, هو, هى, هي, والده, وغربه, وهي, ي.

PRON occurs with 5 features: Case (10877; 100% instances), Gender (10877; 100% instances), Number (10877; 100% instances), Person (10877; 100% instances), PronType (10877; 100% instances)

PRON occurs with 12 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Prs

PRON occurs with 32 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|Person=3|PronType=Prs (2966 tokens). Examples: ها، أعضائها، أهدافها، إليها، بضمانها، بفقدانها، بهويتها، تجارتها، تجميدها، تخصيصها

Relations

PRON nodes are attached to their parents using 18 different relations: nmod (5866; 54% instances), obj (1326; 12% instances), nsubj (1290; 12% instances), obl:arg (1202; 11% instances), obl (686; 6% instances), fixed (342; 3% instances), conj (67; 1% instances), iobj (56; 1% instances), nsubj:pass (15; 0% instances), root (8; 0% instances), dislocated (4; 0% instances), det (3; 0% instances), parataxis (3; 0% instances), xcomp (3; 0% instances), appos (2; 0% instances), dep (2; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: NOUN (6242; 57% instances), VERB (3283; 30% instances), ADJ (493; 5% instances), SCONJ (340; 3% instances), X (206; 2% instances), NUM (108; 1% instances), DET (82; 1% instances), PART (52; 0% instances), ADP (24; 0% instances), CCONJ (20; 0% instances), PRON (13; 0% instances), (8; 0% instances), ADV (6; 0% instances)

8202 (75%) PRON nodes are leaves.

2196 (20%) PRON nodes have one child.

279 (3%) PRON nodes have two children.

200 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 27 different relations: case (2331; 66% instances), punct (409; 12% instances), nsubj (265; 7% instances), nmod (115; 3% instances), cc (105; 3% instances), obl (59; 2% instances), conj (46; 1% instances), mark (44; 1% instances), amod (32; 1% instances), xcomp (31; 1% instances), nummod (22; 1% instances), obl:arg (14; 0% instances), advcl (10; 0% instances), advmod:emph (9; 0% instances), csubj (9; 0% instances), acl (8; 0% instances), appos (7; 0% instances), fixed (6; 0% instances), det (5; 0% instances), acl:relcl (4; 0% instances), advmod (3; 0% instances), dep (3; 0% instances), dislocated (3; 0% instances), cop (2; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: ADP (2323; 66% instances), PUNCT (409; 12% instances), NOUN (370; 10% instances), CCONJ (112; 3% instances), NUM (78; 2% instances), X (64; 2% instances), VERB (50; 1% instances), ADJ (49; 1% instances), SCONJ (43; 1% instances), DET (13; 0% instances), PART (13; 0% instances), PRON (13; 0% instances), ADV (7; 0% instances), AUX (2; 0% instances)