Treebank Statistics: UD_Albanian-TSA: POS Tags: PROPN
There are 18 PROPN
lemmas (4%), 19 PROPN
types (4%) and 20 PROPN
tokens (2%).
Out of 14 observed tags, the rank of PROPN
is: 6 in number of lemmas, 6 in number of types and 12 in number of tokens.
The 10 most frequent PROPN
lemmas: Dju, Shqipëri, Britani, Evropë, Japoni, Kinë, Kore, Mani, Nors, Ruso
The 10 most frequent PROPN
types: Shqipëri, Bashkimit, Britania, Djui, Djuin, Evropës, Homo, Japoninë, Kinës, Korenë
The 10 most frequent ambiguous lemmas: bashkim (NOUN 1, PROPN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN
is 1.055556 (the average of all parts of speech is 1.167464).
The 1st highest number of forms (2) was observed with the lemma “Dju”: Djui, Djuin.
The 2nd highest number of forms (1) was observed with the lemma “Britani”: Britania.
The 3rd highest number of forms (1) was observed with the lemma “Evropë”: Evropës.
PROPN
occurs with 4 features: Case (15; 75% instances), Definite (15; 75% instances), Gender (15; 75% instances), Number (15; 75% instances)
PROPN
occurs with 9 feature-value pairs: Case=Acc
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
PROPN
occurs with 9 feature combinations.
The most frequent feature combination is _
(5 tokens).
Examples: Homo, Shakespeare, Shpëtim, William, Çuçka
Relations
PROPN
nodes are attached to their parents using 7 different relations: nmod:poss (5; 25% instances), flat (4; 20% instances), nsubj (4; 20% instances), nmod (3; 15% instances), obl (2; 10% instances), appos (1; 5% instances), conj (1; 5% instances)
Parents of PROPN
nodes belong to 5 different parts of speech: NOUN (8; 40% instances), PROPN (5; 25% instances), VERB (3; 15% instances), ADJ (2; 10% instances), DET (2; 10% instances)
6 (30%) PROPN
nodes are leaves.
10 (50%) PROPN
nodes have one child.
3 (15%) PROPN
nodes have two children.
1 (5%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 4.
Children of PROPN
nodes are attached using 8 different relations: case (5; 25% instances), det (5; 25% instances), flat (4; 20% instances), amod (2; 10% instances), cc (1; 5% instances), conj (1; 5% instances), nmod (1; 5% instances), punct (1; 5% instances)
Children of PROPN
nodes belong to 7 different parts of speech: ADP (5; 25% instances), DET (5; 25% instances), PROPN (5; 25% instances), ADJ (2; 10% instances), CCONJ (1; 5% instances), NOUN (1; 5% instances), PUNCT (1; 5% instances)