home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-PUD: POS Tags: PROPN

There are 832 PROPN lemmas (16%), 969 PROPN types (12%) and 1209 PROPN tokens (6%). Out of 17 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: США, Китай, Америка, Трамп, Великобритания, Европа, Австралия, Гонконг, Италия, Франция

The 10 most frequent PROPN types: США, Великобритании, Америки, Италии, Китай, Клинтон, Австралии, Европы, Онтарио, BBC

The 10 most frequent ambiguous lemmas: де (PART 3, PROPN 1)

The 10 most frequent ambiguous types: По (ADP 15, PROPN 1), Сад (NOUN 1, PROPN 1)

Morphology

The form / lemma ratio of PROPN is 1.164663 (the average of all parts of speech is 1.496727).

The 1st highest number of forms (5) was observed with the lemma “Трамп”: Трамп, Трампа, Трампе, Трампом, Трампу.

The 2nd highest number of forms (4) was observed with the lemma “Великобритания”: Великобританией, Великобритании, Великобританию, Великобритания.

The 3rd highest number of forms (4) was observed with the lemma “Гонконг”: Гонконг, Гонконга, Гонконге, Гонконгу.

PROPN occurs with 6 features: Animacy (1101; 91% instances), Case (1101; 91% instances), Gender (1101; 91% instances), Number (1101; 91% instances), Foreign (108; 9% instances), Abbr (53; 4% instances)

PROPN occurs with 15 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 52 feature combinations. The most frequent feature combination is Animacy=Anim|Case=Nom|Gender=Masc|Number=Sing (254 tokens). Examples: Джон, Джордж, Мисима, Рафферти, Сигал, Трамп, Уинстон, Шэнь, Август, Анайя

Relations

PROPN nodes are attached to their parents using 18 different relations: nmod (379; 31% instances), nsubj (214; 18% instances), flat:name (200; 17% instances), obl (128; 11% instances), conj (72; 6% instances), obj (47; 4% instances), appos (40; 3% instances), flat (37; 3% instances), flat:foreign (35; 3% instances), iobj (27; 2% instances), nsubj:pass (20; 2% instances), parataxis (3; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), obl:agent (1; 0% instances), orphan (1; 0% instances)

Parents of PROPN nodes belong to 12 different parts of speech: NOUN (542; 45% instances), VERB (402; 33% instances), PROPN (222; 18% instances), ADJ (14; 1% instances), AUX (6; 0% instances), X (6; 0% instances), DET (5; 0% instances), ADP (4; 0% instances), ADV (4; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)

641 (53%) PROPN nodes are leaves.

358 (30%) PROPN nodes have one child.

137 (11%) PROPN nodes have two children.

73 (6%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 7.

Children of PROPN nodes are attached using 25 different relations: case (239; 27% instances), punct (164; 18% instances), flat:name (117; 13% instances), conj (83; 9% instances), amod (70; 8% instances), cc (59; 7% instances), appos (40; 4% instances), flat:foreign (32; 4% instances), acl:relcl (15; 2% instances), nmod (14; 2% instances), acl (11; 1% instances), flat (10; 1% instances), advmod (9; 1% instances), det (6; 1% instances), parataxis (6; 1% instances), nsubj (4; 0% instances), cop (3; 0% instances), mark (3; 0% instances), orphan (3; 0% instances), nummod (2; 0% instances), advcl (1; 0% instances), nummod:gov (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), xcomp (1; 0% instances)

Children of PROPN nodes belong to 15 different parts of speech: ADP (227; 25% instances), PROPN (222; 25% instances), PUNCT (164; 18% instances), ADJ (86; 10% instances), NOUN (61; 7% instances), CCONJ (58; 6% instances), VERB (23; 3% instances), SCONJ (16; 2% instances), ADV (14; 2% instances), DET (6; 1% instances), NUM (6; 1% instances), PART (4; 0% instances), X (4; 0% instances), AUX (3; 0% instances), PRON (1; 0% instances)