home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Gheg-GPS: POS Tags: PRON

There are 178 PRON lemmas (16%), 269 PRON types (10%) and 2898 PRON tokens (18%). Out of 15 observed tags, the rank of PRON is: 3 in number of lemmas, 4 in number of types and 1 in number of tokens.

The 10 most frequent PRON lemmas: i, e, aj, j, a, ata, që, at, ato, ky

The 10 most frequent PRON types: i, e, aj, j, a, ata, at, qe, ky, që

The 10 most frequent ambiguous lemmas: i (PRON 548, DET 69), e (PRON 422, CCONJ 135, DET 99, INTJ 1), a (PRON 199, CCONJ 33, PART 10, ADV 1, DET 1, INTJ 1, SCONJ 1), (SCONJ 131, PRON 122, ADV 1), vet (PRON 30, ADV 1), (PRON 29, ADV 5), u (AUX 128, PRON 26), do (PRON 25, AUX 3, PART 2), krejt (PRON 23, ADV 1), qysh (PRON 15, ADV 6)

The 10 most frequent ambiguous types: i (PRON 530, DET 61), e (PRON 417, CCONJ 129, DET 92, INTJ 2), a (PRON 184, CCONJ 32, PART 10, ADV 1, AUX 1, DET 1, INTJ 1, SCONJ 1), qe (PRON 64, SCONJ 44, ADV 1), (SCONJ 61, PRON 53), m (PART 45, PRON 29), u (AUX 170, PRON 26), vet (PRON 25, ADV 1, NOUN 1, VERB 1), ati (PRON 22, ADV 5), do (VERB 45, PRON 22, AUX 3, PART 1)

Morphology

The form / lemma ratio of PRON is 1.511236 (the average of all parts of speech is 2.539450).

The 1st highest number of forms (8) was observed with the lemma “ai”: a:i, a:i:, ai, ai:, ati:j, atij, ti:j, tij.

The 2nd highest number of forms (7) was observed with the lemma “ajo”: ajo, ajo/, ajo:, asa:j, asaj, atë, atë:.

The 3rd highest number of forms (5) was observed with the lemma “ato”: ato, ato:, aty:re, atyre, to.

PRON occurs with 7 features: Number (2492; 86% instances), Case (2483; 86% instances), Person (1786; 62% instances), PronType (1219; 42% instances), Gender (966; 33% instances), Reflex (55; 2% instances), Foreign (5; 0% instances)

PRON occurs with 19 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, Reflex=Yes

PRON occurs with 164 feature combinations. The most frequent feature combination is Case=Acc|Number=Sing|Person=3 (619 tokens). Examples: e, a, i, a:, u, e:, a/, v, a:/, ëh

Relations

PRON nodes are attached to their parents using 18 different relations: expl (800; 28% instances), obj (566; 20% instances), det (555; 19% instances), nsubj (514; 18% instances), iobj (215; 7% instances), nmod (96; 3% instances), obl (72; 2% instances), reparandum (41; 1% instances), amod (9; 0% instances), root (9; 0% instances), conj (5; 0% instances), ccomp (4; 0% instances), xcomp (4; 0% instances), acl (3; 0% instances), orphan (2; 0% instances), appos (1; 0% instances), dislocated (1; 0% instances), fixed (1; 0% instances)

Parents of PRON nodes belong to 12 different parts of speech: VERB (2130; 73% instances), NOUN (648; 22% instances), PRON (50; 2% instances), AUX (17; 1% instances), NUM (13; 0% instances), ADJ (12; 0% instances), ADV (12; 0% instances), (9; 0% instances), INTJ (3; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances)

2617 (90%) PRON nodes are leaves.

231 (8%) PRON nodes have one child.

26 (1%) PRON nodes have two children.

24 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 9.

Children of PRON nodes are attached using 22 different relations: det (99; 27% instances), case (66; 18% instances), nmod (39; 11% instances), reparandum (30; 8% instances), cc (19; 5% instances), discourse (17; 5% instances), acl (15; 4% instances), advmod (12; 3% instances), amod (9; 2% instances), punct (9; 2% instances), conj (8; 2% instances), mark (8; 2% instances), nummod (8; 2% instances), cop (7; 2% instances), nsubj (5; 1% instances), parataxis (5; 1% instances), obj (4; 1% instances), aux (3; 1% instances), advcl (2; 1% instances), expl (2; 1% instances), orphan (2; 1% instances), ccomp (1; 0% instances)

Children of PRON nodes belong to 14 different parts of speech: DET (98; 26% instances), ADP (62; 17% instances), PRON (50; 14% instances), NOUN (31; 8% instances), VERB (27; 7% instances), ADV (19; 5% instances), CCONJ (19; 5% instances), INTJ (16; 4% instances), AUX (10; 3% instances), NUM (10; 3% instances), ADJ (9; 2% instances), PUNCT (9; 2% instances), PART (7; 2% instances), SCONJ (3; 1% instances)