home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian: POS Tags: PROPN

There are 214 PROPN lemmas (10%), 259 PROPN types (8%) and 588 PROPN tokens (7%). Out of 16 observed tags, the rank of PROPN is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.

The 10 most frequent PROPN lemmas: беларусь, ес, тэлеграф, еўрасаюз, нацбанк, мінск, літва, сірыя, уладзіслав, Кавалёв

The 10 most frequent PROPN types: Беларусі, ЕС, Беларусь, Тэлеграф, Кавалёва, Уладзіслава, АЭС, Еўрасаюза, МЗС, ВВД

The 10 most frequent ambiguous lemmas: м (NOUN 1, PROPN 1), мінскі (ADJ 6, PROPN 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of PROPN is 1.210280 (the average of all parts of speech is 1.397401).

The 1st highest number of forms (5) was observed with the lemma “літва”: Літва, Літве, Літвой, Літву, Літвы.

The 2nd highest number of forms (4) was observed with the lemma “нацбанк”: Нацбанк, Нацбанка, Нацбанкам, Нацбанку.

The 3rd highest number of forms (3) was observed with the lemma “беларусь”: Беларуссю, Беларусь, Беларусі.

PROPN occurs with 4 features: Animacy (583; 99% instances), Case (583; 99% instances), Number (583; 99% instances), Gender (582; 99% instances)

PROPN occurs with 13 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

PROPN occurs with 31 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing (88 tokens). Examples: Беларусі, АЭС, Літвы, Сірыі, Еўропы, ААН, ДАІ, Белавія, Расіі, Белтэлерадыёкампаніі

Relations

PROPN nodes are attached to their parents using 13 different relations: nmod (188; 32% instances), flat (124; 21% instances), nsubj (86; 15% instances), conj (79; 13% instances), obl (51; 9% instances), appos (17; 3% instances), flat:name (13; 2% instances), obj (13; 2% instances), root (8; 1% instances), iobj (3; 1% instances), parataxis (3; 1% instances), obl:agent (2; 0% instances), orphan (1; 0% instances)

Parents of PROPN nodes belong to 6 different parts of speech: NOUN (248; 42% instances), PROPN (178; 30% instances), VERB (145; 25% instances), (8; 1% instances), ADJ (7; 1% instances), ADV (2; 0% instances)

268 (46%) PROPN nodes are leaves.

238 (40%) PROPN nodes have one child.

53 (9%) PROPN nodes have two children.

29 (5%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 17.

Children of PROPN nodes are attached using 16 different relations: punct (126; 27% instances), case (95; 20% instances), flat (89; 19% instances), conj (76; 16% instances), cc (21; 4% instances), nmod (18; 4% instances), amod (13; 3% instances), flat:name (13; 3% instances), parataxis (10; 2% instances), appos (4; 1% instances), orphan (4; 1% instances), acl:relcl (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), det (1; 0% instances), nummod (1; 0% instances)

Children of PROPN nodes belong to 14 different parts of speech: PROPN (178; 37% instances), PUNCT (126; 27% instances), ADP (93; 20% instances), CCONJ (20; 4% instances), NOUN (16; 3% instances), NUM (13; 3% instances), ADJ (12; 3% instances), VERB (11; 2% instances), ADV (1; 0% instances), DET (1; 0% instances), PART (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)