home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-TSA: POS Tags: PRON

There are 14 PRON lemmas (3%), 28 PRON types (6%) and 53 PRON tokens (6%). Out of 14 observed tags, the rank of PRON is: 7 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PRON lemmas: ai, ky, kjo, disa, i, cili, ajo, ata, gjithë, tjetër

The 10 most frequent PRON types: disa, këto, Ata, e, gjitha, i, këtë, tij, Kjo, Ky

The 10 most frequent ambiguous lemmas: i (DET 99, PRON 5, CCONJ 1)

The 10 most frequent ambiguous types: e (DET 32, PRON 3, CCONJ 1), i (DET 18, PRON 3), u (AUX 7, PRON 1)

Morphology

The form / lemma ratio of PRON is 2.000000 (the average of all parts of speech is 1.167464).

The 1st highest number of forms (5) was observed with the lemma “ai”: Ata, ai, ato, e, tij.

The 2nd highest number of forms (4) was observed with the lemma “ky”: Ky, këto, këtyre, këtë.

The 3rd highest number of forms (3) was observed with the lemma “ata”: atyre, tyre, u.

PRON occurs with 6 features: Case (53; 100% instances), Gender (52; 98% instances), Number (52; 98% instances), PronType (48; 91% instances), Poss (6; 11% instances), Person (3; 6% instances)

PRON occurs with 18 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=3, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, PronType=Tot

PRON occurs with 39 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Plur|PronType=Prs (4 tokens). Examples: Ata, Këto

Relations

PRON nodes are attached to their parents using 10 different relations: det (23; 43% instances), nsubj (9; 17% instances), expl (6; 11% instances), nmod:poss (6; 11% instances), iobj (2; 4% instances), nmod (2; 4% instances), obl (2; 4% instances), conj (1; 2% instances), obj (1; 2% instances), root (1; 2% instances)

Parents of PRON nodes belong to 5 different parts of speech: NOUN (31; 58% instances), VERB (19; 36% instances), ADJ (1; 2% instances), PRON (1; 2% instances), (1; 2% instances)

35 (66%) PRON nodes are leaves.

13 (25%) PRON nodes have one child.

3 (6%) PRON nodes have two children.

2 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 10 different relations: det:pron (12; 46% instances), case (3; 12% instances), cop (2; 8% instances), det (2; 8% instances), nsubj (2; 8% instances), cc (1; 4% instances), conj (1; 4% instances), nmod:poss (1; 4% instances), nummod (1; 4% instances), punct (1; 4% instances)

Children of PRON nodes belong to 8 different parts of speech: DET (14; 54% instances), ADP (3; 12% instances), NOUN (3; 12% instances), AUX (2; 8% instances), CCONJ (1; 4% instances), NUM (1; 4% instances), PRON (1; 4% instances), PUNCT (1; 4% instances)