home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Arabic-PADT: POS Tags: PRON

There are 1 PRON lemmas (0%), 52 PRON types (0%) and 10877 PRON tokens (4%). Out of 16 observed tags, the rank of PRON is: 16 in number of lemmas, 10 in number of types and 8 in number of tokens.

The 10 most frequent PRON lemmas: هُوَ

The 10 most frequent PRON types: ه، ها، هم، هو، نا، هي، هما، ك، ي، ني

The 10 most frequent ambiguous lemmas: هُوَ (PRON 10877, DET 1)

The 10 most frequent ambiguous types: ه (PRON 4088, DET 7), ها (PRON 3864, DET 4, PART 2), هم (PRON 1159, NOUN 2, X 1), هو (PRON 442, X 10), نا (PRON 392, DET 1), هما (PRON 219, X 13), ك (ADP 193, PRON 144, X 3), ي (PRON 84, DET 1), نحن (PRON 36, X 23), كم (PRON 27, DET 14, X 5)

Morphology

The form / lemma ratio of PRON is 52.000000 (the average of all parts of speech is 1.762014).

The 1st highest number of forms (52) was observed with the lemma “هُوَ”: أعضائها, أنا, أنت, أنتم, أهدافها, إدانته, إليها, استبعادهم, استعداداته, انا, انتشاره, انتم, بأنفسهم, بضمانها, بفقدانها, بلاده, بلادهم, بهم, بهويتها, تجارتها, تجميدها, تجهيزه, تخصيصها, حكومته, زنزانته, شفائهم, طائرته, ك, كم, كما, لاراضيه, لمساعدتنا, لهم, مستشفياتها, مستقبله, مواجهتها, نا, نحن, نهايتها, ني, ه, ها, هم, هما, هن, هو, هى, هي, والده, وغربه, وهي, ي.

PRON occurs with 5 features: Case (10877; 100% instances), Gender (10877; 100% instances), Number (10877; 100% instances), Person (10877; 100% instances), PronType (10877; 100% instances)

PRON occurs with 12 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Prs

PRON occurs with 32 feature combinations. The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|Person=3|PronType=Prs (2966 tokens). Examples: ها، أعضائها، أهدافها، إليها، بضمانها، بفقدانها، بهويتها، تجارتها، تجميدها، تخصيصها

Relations

PRON nodes are attached to their parents using 19 different relations: nmod (5976; 55% instances), obj (1334; 12% instances), nsubj (1289; 12% instances), obl:arg (1079; 10% instances), obl (666; 6% instances), fixed (366; 3% instances), conj (67; 1% instances), iobj (56; 1% instances), nsubj:pass (15; 0% instances), root (9; 0% instances), dislocated (4; 0% instances), parataxis (3; 0% instances), xcomp (3; 0% instances), appos (2; 0% instances), cop (2; 0% instances), dep (2; 0% instances), det (2; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances)

Parents of PRON nodes belong to 12 different parts of speech: NOUN (6224; 57% instances), VERB (3252; 30% instances), ADJ (492; 5% instances), CCONJ (401; 4% instances), X (194; 2% instances), NUM (107; 1% instances), DET (78; 1% instances), PART (52; 0% instances), ADP (48; 0% instances), PRON (14; 0% instances), (9; 0% instances), ADV (6; 0% instances)

8205 (75%) PRON nodes are leaves.

2201 (20%) PRON nodes have one child.

266 (2%) PRON nodes have two children.

205 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 26 different relations: case (2322; 66% instances), punct (412; 12% instances), nsubj (268; 8% instances), nmod (115; 3% instances), cc (105; 3% instances), obl (60; 2% instances), conj (46; 1% instances), mark (42; 1% instances), amod (31; 1% instances), nummod (22; 1% instances), xcomp (21; 1% instances), fixed (16; 0% instances), obl:arg (14; 0% instances), acl (12; 0% instances), advmod:emph (11; 0% instances), advcl (10; 0% instances), csubj (9; 0% instances), appos (7; 0% instances), det (5; 0% instances), advmod (3; 0% instances), cop (3; 0% instances), dep (3; 0% instances), dislocated (3; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (2322; 66% instances), PUNCT (412; 12% instances), NOUN (366; 10% instances), CCONJ (157; 4% instances), NUM (78; 2% instances), X (64; 2% instances), ADJ (49; 1% instances), VERB (49; 1% instances), PRON (14; 0% instances), PART (13; 0% instances), DET (11; 0% instances), ADV (7; 0% instances), AUX (2; 0% instances)