home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: POS Tags: PROPN

There are 374 PROPN lemmas (14%), 569 PROPN types (11%) and 1184 PROPN tokens (6%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Москва, Ивановичь, Иванъ, Петръ, Григорей, Борисъ, Фалькъ, Василей, Новгородъ, Павловское

The 10 most frequent PROPN types: Ивановичю, Борису, Петр, Григорья, Москве, Фальк, Москвѣ, Григорей, Иван, Петра

The 10 most frequent ambiguous lemmas: гора (NOUN 5, PROPN 1)

The 10 most frequent ambiguous types: Августа (NOUN 1, PROPN 1), Петрова (ADJ 2, PROPN 1), Петрову (ADJ 2, PROPN 1), Юрьев (ADJ 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.521390 (the average of all parts of speech is 1.900114).

The 1st highest number of forms (10) was observed with the lemma “Василей”: Васил[ь]е, Васил[ь]ем, Васил[ь]емъ, Васил[ь]ю, Васил[ь]ѧ, Васил[і]ю, Василеи, Василей, Василью, Василья.

The 2nd highest number of forms (8) was observed with the lemma “Новгородъ”: Новагорода, Новгородъ, Новегороде, Новугороду, Новъгород, Новъгородъ, Новѣгороди, Новѣгородѣ.

The 3rd highest number of forms (8) was observed with the lemma “Семенъ”: Семен, Семена, Семену, Семенъ, Семенѣ, Семенꙋ, Семѣна, Семѣнꙋ.

PROPN occurs with 4 features: Case (1184; 100% instances), Gender (1184; 100% instances), Number (1184; 100% instances), Animacy (25; 2% instances)

PROPN occurs with 13 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 27 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (398 tokens). Examples: Петр, Фальк, Григорей, Иван, Дмитрей, Дементьев, Михайло, Олешка, Тихановичь, Вавила

Relations

PROPN nodes are attached to their parents using 18 different relations: flat:name (392; 33% instances), appos (326; 28% instances), obl (165; 14% instances), conj (81; 7% instances), nmod (79; 7% instances), nsubj (67; 6% instances), obj (27; 2% instances), iobj (17; 1% instances), root (8; 1% instances), amod (4; 0% instances), orphan (4; 0% instances), nsubj:pass (3; 0% instances), vocative (3; 0% instances), acl (2; 0% instances), acl:relcl (2; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances)

Parents of PROPN nodes belong to 7 different parts of speech: PROPN (448; 38% instances), NOUN (400; 34% instances), VERB (280; 24% instances), PRON (36; 3% instances), ADJ (10; 1% instances), (8; 1% instances), DET (2; 0% instances)

488 (41%) PROPN nodes are leaves.

452 (38%) PROPN nodes have one child.

173 (15%) PROPN nodes have two children.

71 (6%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 14.

Children of PROPN nodes are attached using 19 different relations: flat:name (389; 36% instances), case (242; 22% instances), punct (127; 12% instances), conj (68; 6% instances), cc (65; 6% instances), appos (50; 5% instances), det (40; 4% instances), nmod (33; 3% instances), amod (23; 2% instances), nsubj (13; 1% instances), parataxis (12; 1% instances), advmod (10; 1% instances), orphan (7; 1% instances), acl:relcl (3; 0% instances), obl (3; 0% instances), mark (2; 0% instances), acl (1; 0% instances), cop (1; 0% instances), discourse (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: PROPN (448; 41% instances), ADP (242; 22% instances), PUNCT (127; 12% instances), NOUN (117; 11% instances), CCONJ (65; 6% instances), DET (44; 4% instances), ADJ (22; 2% instances), PART (7; 1% instances), VERB (7; 1% instances), ADV (5; 0% instances), PRON (2; 0% instances), SCONJ (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances)