home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tatar-NMCTT: POS Tags: PROPN

There are 83 PROPN lemmas (9%), 96 PROPN types (8%) and 167 PROPN tokens (7%). Out of 14 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent PROPN lemmas: Татарстан, Кама, Казан, Төмән, Марат, Россия, Чиләбе, рамил, Әхмәтов, Василий

The 10 most frequent PROPN types: Татарстан, Кама, Төмән, Марат, Татарстанның, Казан, Рамил, Татарстанда, Чиләбе, Әхмәтов

The 10 most frequent ambiguous lemmas: Россия (PROPN 4, NOUN 1, PRON 1), Курск (PROPN 2, NOUN 1), татар (NOUN 11, PROPN 1)

The 10 most frequent ambiguous types: Россия (PROPN 3, PRON 1), Татар (NOUN 3, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.156627 (the average of all parts of speech is 1.414579).

The 1st highest number of forms (3) was observed with the lemma “Казан”: Казан, Казанда, Казанның.

The 2nd highest number of forms (3) was observed with the lemma “Кама”: Кама, Камага, Каманың.

The 3rd highest number of forms (3) was observed with the lemma “Кырым”: Кырым, Кырымда, Кырымнан.

PROPN occurs with 3 features: Case (165; 99% instances), Number (165; 99% instances), Person[psor] (2; 1% instances)

PROPN occurs with 7 feature-value pairs: Case=Abl, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Number=Sing, Person[psor]=3

PROPN occurs with 8 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing (137 tokens). Examples: Татарстан, Кама, Төмән, Марат, Казан, Рамил, Чиләбе, Әхмәтов, Василий, Россия

Relations

PROPN nodes are attached to their parents using 7 different relations: nmod (75; 45% instances), flat (43; 26% instances), appos (18; 11% instances), obl (11; 7% instances), nsubj (10; 6% instances), compound (7; 4% instances), conj (3; 2% instances)

Parents of PROPN nodes belong to 4 different parts of speech: NOUN (107; 64% instances), PROPN (33; 20% instances), VERB (20; 12% instances), ADJ (7; 4% instances)

117 (70%) PROPN nodes are leaves.

40 (24%) PROPN nodes have one child.

7 (4%) PROPN nodes have two children.

3 (2%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 5.

Children of PROPN nodes are attached using 10 different relations: flat (31; 48% instances), punct (14; 22% instances), amod (6; 9% instances), conj (6; 9% instances), compound (3; 5% instances), acl (1; 2% instances), case (1; 2% instances), ccomp (1; 2% instances), nmod (1; 2% instances), parataxis (1; 2% instances)

Children of PROPN nodes belong to 6 different parts of speech: PROPN (33; 51% instances), PUNCT (14; 22% instances), ADJ (10; 15% instances), VERB (4; 6% instances), NOUN (3; 5% instances), ADP (1; 2% instances)