home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Russian-RNC: POS Tags: PROPN

There are 276 PROPN lemmas (12%), 434 PROPN types (10%) and 862 PROPN tokens (6%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 7 in number of tokens.

The 10 most frequent PROPN lemmas: Петръ, Иванъ, Григорей, Фалькъ, Василей, Москва, Новгородъ, Васильевичь, Дмитрей, Тихановичь

The 10 most frequent PROPN types: Петр, Григорья, Фальк, Москвѣ, Григорей, Иван, Петра, Дмитрей, Михайло, Русии

The 10 most frequent ambiguous lemmas: августъ (NOUN 1, PROPN 1), Богъ (NOUN 27, PROPN 1)

The 10 most frequent ambiguous types: Бог (NOUN 1, PROPN 1), Петрова (ADJ 2, PROPN 1), Петрову (ADJ 2, PROPN 1), Юрьев (ADJ 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.572464 (the average of all parts of speech is 1.860579).

The 1st highest number of forms (10) was observed with the lemma “Василей”: Васил[ь]е, Васил[ь]ем, Васил[ь]емъ, Васил[ь]ю, Васил[ь]ѧ, Васил[і]ю, Василеи, Василей, Василью, Василья.

The 2nd highest number of forms (8) was observed with the lemma “Новгородъ”: Новагорода, Новгородъ, Новегороде, Новугороду, Новъгород, Новъгородъ, Новѣгороди, Новѣгородѣ.

The 3rd highest number of forms (7) was observed with the lemma “Иванъ”: ИВАНА, Иван, Ивана, Ивану, Иванъ, Иванѣ, Иванꙋ.

PROPN occurs with 4 features: Case (862; 100% instances), Gender (862; 100% instances), Number (862; 100% instances), Animacy (25; 3% instances)

PROPN occurs with 13 feature-value pairs: Animacy=Anim, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 23 feature combinations. The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (298 tokens). Examples: Петр, Фальк, Григорей, Иван, Дмитрей, Михайло, Тихановичь, Вавила, Василей, Томосов

Relations

PROPN nodes are attached to their parents using 17 different relations: flat:name (278; 32% instances), appos (239; 28% instances), obl (105; 12% instances), conj (69; 8% instances), nmod (62; 7% instances), nsubj (52; 6% instances), obj (21; 2% instances), iobj (15; 2% instances), root (5; 1% instances), orphan (4; 0% instances), vocative (3; 0% instances), acl (2; 0% instances), amod (2; 0% instances), nsubj:pass (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances)

Parents of PROPN nodes belong to 7 different parts of speech: PROPN (324; 38% instances), NOUN (297; 34% instances), VERB (196; 23% instances), PRON (32; 4% instances), ADJ (6; 1% instances), (5; 1% instances), DET (2; 0% instances)

348 (40%) PROPN nodes are leaves.

324 (38%) PROPN nodes have one child.

135 (16%) PROPN nodes have two children.

55 (6%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 14.

Children of PROPN nodes are attached using 19 different relations: flat:name (277; 34% instances), case (169; 20% instances), punct (100; 12% instances), conj (55; 7% instances), cc (54; 7% instances), appos (45; 5% instances), det (39; 5% instances), nmod (31; 4% instances), amod (22; 3% instances), nsubj (11; 1% instances), advmod (8; 1% instances), acl:relcl (3; 0% instances), obl (3; 0% instances), mark (2; 0% instances), orphan (2; 0% instances), parataxis (2; 0% instances), acl (1; 0% instances), cop (1; 0% instances), discourse (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: PROPN (324; 39% instances), ADP (169; 20% instances), PUNCT (100; 12% instances), NOUN (95; 12% instances), CCONJ (54; 7% instances), DET (43; 5% instances), ADJ (21; 3% instances), PART (6; 1% instances), VERB (5; 1% instances), ADV (4; 0% instances), SCONJ (2; 0% instances), AUX (1; 0% instances), INTJ (1; 0% instances), PRON (1; 0% instances)