home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: PRON

There are 1 PRON lemmas (0%), 52 PRON types (0%) and 10877 PRON tokens (4%). Out of 16 observed tags, the rank of PRON is: 16 in number of lemmas, 11 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: هُوَ

The 10 most frequent PRON types: ه، ها، هم، هو، نا، هي، هما، ك، ي، ني

The 10 most frequent ambiguous lemmas: هُوَ (PRON 10877, DET 1)

The 10 most frequent ambiguous types: ه (PRON 4088, DET 7), ها (PRON 3864, DET 4, PART 2), هم (PRON 1159, NOUN 2, X 1), هو (PRON 442, X 10), نا (PRON 392, DET 1), هما (PRON 219, X 13), ك (ADP 193, PRON 144, X 3), ي (PRON 84, DET 1), نحن (PRON 36, X 23), كم (PRON 27, DET 14, X 5)

Morphology

The form / lemma ratio of PRON is 52.000000 (the average of all parts of speech is 1.761701).

The 1st highest number of forms (52) was observed with the lemma “هُوَ”: أعضائها, أنا, أنت, أنتم, أهدافها, إدانته, إليها, استبعادهم, استعداداته, انا, انتشاره, انتم, بأنفسهم, بضمانها, بفقدانها, بلاده, بلادهم, بهم, بهويتها, تجارتها, تجميدها, تجهيزه, تخصيصها, حكومته, زنزانته, شفائهم, طائرته, ك, كم, كما, لاراضيه, لمساعدتنا, لهم, مستشفياتها, مستقبله, مواجهتها, نا, نحن, نهايتها, ني, ه, ها, هم, هما, هن, هو, هى, هي, والده, وغربه, وهي, ي.

PRON occurs with 5 features: Case (10877; 100% instances), Gender (10877; 100% instances), Number (10877; 100% instances), Person (10877; 100% instances), PronType (10877; 100% instances)

PRON occurs with 12 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Prs

PRON occurs with 32 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|Person=3|PronType=Prs (2966 tokens). Examples: ها، أعضائها، أهدافها، إليها، بضمانها، بفقدانها، بهويتها، تجارتها، تجميدها، تخصيصها

Relations

PRON nodes are attached to their parents using 24 different relations: nmod (5636; 52% instances), obj (1448; 13% instances), nsubj (1289; 12% instances), obl:arg (1079; 10% instances), obl (504; 5% instances), fixed (366; 3% instances), cc (307; 3% instances), conj (67; 1% instances), iobj (56; 1% instances), advmod:emph (18; 0% instances), cop (17; 0% instances), case (16; 0% instances), nsubj:pass (15; 0% instances), advmod (14; 0% instances), mark (13; 0% instances), root (9; 0% instances), appos (6; 0% instances), nummod (5; 0% instances), parataxis (3; 0% instances), ccomp (2; 0% instances), dep (2; 0% instances), det (2; 0% instances), xcomp (2; 0% instances), amod (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: NOUN (6203; 57% instances), VERB (3228; 30% instances), ADJ (499; 5% instances), CCONJ (402; 4% instances), X (191; 2% instances), DET (114; 1% instances), NUM (105; 1% instances), PART (53; 0% instances), ADP (51; 0% instances), PRON (14; 0% instances), (9; 0% instances), ADV (6; 0% instances), AUX (2; 0% instances)

8204 (75%) PRON nodes are leaves.

2200 (20%) PRON nodes have one child.

267 (2%) PRON nodes have two children.

206 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 27 different relations: case (2338; 66% instances), punct (412; 12% instances), nsubj (273; 8% instances), cc (108; 3% instances), nmod (62; 2% instances), cop (59; 2% instances), obl (57; 2% instances), conj (46; 1% instances), mark (46; 1% instances), amod (29; 1% instances), xcomp (21; 1% instances), fixed (16; 0% instances), obl:arg (15; 0% instances), acl (13; 0% instances), advmod:emph (12; 0% instances), advcl (10; 0% instances), appos (10; 0% instances), csubj (9; 0% instances), advmod (7; 0% instances), det (5; 0% instances), nummod (5; 0% instances), ccomp (1; 0% instances), dep (1; 0% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances), parataxis (1; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (2325; 65% instances), PUNCT (412; 12% instances), NOUN (368; 10% instances), CCONJ (159; 4% instances), NUM (80; 2% instances), X (67; 2% instances), VERB (50; 1% instances), ADJ (49; 1% instances), PRON (14; 0% instances), DET (13; 0% instances), PART (13; 0% instances), ADV (6; 0% instances), AUX (3; 0% instances)