home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Albanian-STAF: POS Tags: PROPN

There are 24 PROPN lemmas (2%), 30 PROPN types (2%) and 39 PROPN tokens (1%). Out of 15 observed tags, the rank of PROPN is: 9 in number of lemmas, 7 in number of types and 13 in number of tokens.

The 10 most frequent PROPN lemmas: Ernest, Shqipëri, Vedat, margë, Hadi, Bamit, Berti, Dizi, Dizin, Ernesti

The 10 most frequent PROPN types: Ernesti, Ernestit, Shqipëri, Linda, Vedati, Bamit, Berti, Dizi, Dizin, Dizit

The 10 most frequent ambiguous lemmas: Hadi (PROPN 2, NOUN 1)

The 10 most frequent ambiguous types: Hadi (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.250000 (the average of all parts of speech is 1.223770).

The 1st highest number of forms (3) was observed with the lemma “Ernest”: Ernest, Ernesti, Ernestit.

The 2nd highest number of forms (3) was observed with the lemma “Vedat”: Vedat, Vedati, Vedatit.

The 3rd highest number of forms (2) was observed with the lemma “Hadi”: Hadi, Hadin.

PROPN occurs with 4 features: Case (32; 82% instances), Gender (31; 79% instances), Number (31; 79% instances), Definite (30; 77% instances)

PROPN occurs with 10 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing

PROPN occurs with 14 feature combinations. The most frequent feature combination is Case=Nom|Definite=Def|Gender=Masc|Number=Sing (9 tokens). Examples: Ernesti, Vedati, Hadi, Linda, Parku, xhepi

Relations

PROPN nodes are attached to their parents using 10 different relations: obl (11; 28% instances), nsubj (10; 26% instances), nmod:poss (5; 13% instances), appos (3; 8% instances), obj (3; 8% instances), iobj (2; 5% instances), root (2; 5% instances), conj (1; 3% instances), flat (1; 3% instances), xcomp (1; 3% instances)

Parents of PROPN nodes belong to 6 different parts of speech: VERB (22; 56% instances), NOUN (9; 23% instances), ADJ (3; 8% instances), PRON (2; 5% instances), (2; 5% instances), PROPN (1; 3% instances)

18 (46%) PROPN nodes are leaves.

15 (38%) PROPN nodes have one child.

3 (8%) PROPN nodes have two children.

3 (8%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 8.

Children of PROPN nodes are attached using 13 different relations: case (14; 39% instances), det (6; 17% instances), punct (5; 14% instances), cop (2; 6% instances), acl:relcl (1; 3% instances), advmod (1; 3% instances), cc (1; 3% instances), conj (1; 3% instances), det:poss (1; 3% instances), flat (1; 3% instances), nmod:poss (1; 3% instances), obl:tmod (1; 3% instances), parataxis (1; 3% instances)

Children of PROPN nodes belong to 11 different parts of speech: ADP (13; 36% instances), DET (6; 17% instances), PUNCT (5; 14% instances), VERB (3; 8% instances), AUX (2; 6% instances), NOUN (2; 6% instances), ADV (1; 3% instances), CCONJ (1; 3% instances), PART (1; 3% instances), PRON (1; 3% instances), PROPN (1; 3% instances)