home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Azerbaijani-TueCL: POS Tags: PROPN

There are 17 PROPN lemmas (6%), 19 PROPN types (4%) and 88 PROPN tokens (10%). Out of 15 observed tags, the rank of PROPN is: 5 in number of lemmas, 6 in number of types and 4 in number of tokens.

The 10 most frequent PROPN lemmas: Deniz, Ayhan, Peter, Ayşə, Mary, Fəransə, Paris, Brown, Fərvərdin, Hakan

The 10 most frequent PROPN types: Deniz, Ayhana, Denizin, Ayşə, Mary, Fəransənin, Parisdə, Peter, Brown, Fərvərdin

The 10 most frequent ambiguous lemmas: Deniz (PROPN 58, NOUN 2), _ (PUNCT 3, NOUN 2, PROPN 1, VERB 1), ev (NOUN 25, PROPN 1)

The 10 most frequent ambiguous types: Deniz (PROPN 55, NOUN 2), Ayşə (PROPN 3, NOUN 1)

Morphology

The form / lemma ratio of PROPN is 1.117647 (the average of all parts of speech is 1.486014).

The 1st highest number of forms (4) was observed with the lemma “Peter”: Peter, Peterdendir, Peterin, Peterinən.

The 2nd highest number of forms (2) was observed with the lemma “Deniz”: Deniz, Denizin.

The 3rd highest number of forms (1) was observed with the lemma “Ayhan”: Ayhana.

PROPN occurs with 4 features: Case (87; 99% instances), Number (87; 99% instances), Number[psor] (1; 1% instances), Person[psor] (1; 1% instances)

PROPN occurs with 8 feature-value pairs: Case=Com, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Number=Sing, Number[psor]=Sing, Person[psor]=3

PROPN occurs with 7 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (70 tokens). Examples: Deniz, Ayşə, Mary, Peter, Brown, Fərvərdin, Jane, Ordibeheşt, Sam, Smith

Relations

PROPN nodes are attached to their parents using 9 different relations: nsubj (64; 73% instances), nmod (8; 9% instances), obl (5; 6% instances), conj (4; 5% instances), appos (2; 2% instances), flat (2; 2% instances), ccomp (1; 1% instances), root (1; 1% instances), vocative (1; 1% instances)

Parents of PROPN nodes belong to 6 different parts of speech: VERB (57; 65% instances), NOUN (21; 24% instances), ADJ (4; 5% instances), PROPN (4; 5% instances), PRON (1; 1% instances), (1; 1% instances)

80 (91%) PROPN nodes are leaves.

4 (5%) PROPN nodes have one child.

2 (2%) PROPN nodes have two children.

2 (2%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 3.

Children of PROPN nodes are attached using 7 different relations: conj (3; 21% instances), punct (3; 21% instances), cc (2; 14% instances), flat (2; 14% instances), orphan (2; 14% instances), advmod:emph (1; 7% instances), nsubj (1; 7% instances)

Children of PROPN nodes belong to 6 different parts of speech: PROPN (4; 29% instances), NOUN (3; 21% instances), PUNCT (3; 21% instances), CCONJ (2; 14% instances), ADV (1; 7% instances), VERB (1; 7% instances)