Treebank Statistics: UD_Belarusian: POS Tags: PROPN
There are 214 PROPN
lemmas (10%), 259 PROPN
types (8%) and 588 PROPN
tokens (7%).
Out of 16 observed tags, the rank of PROPN
is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: беларусь, ес, тэлеграф, еўрасаюз, нацбанк, мінск, літва, сірыя, уладзіслав, Кавалёв
The 10 most frequent PROPN
types: Беларусі, ЕС, Беларусь, Тэлеграф, Кавалёва, Уладзіслава, АЭС, Еўрасаюза, МЗС, ВВД
The 10 most frequent ambiguous lemmas: м (NOUN 1, PROPN 1), мінскі (ADJ 6, PROPN 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PROPN
is 1.210280 (the average of all parts of speech is 1.397401).
The 1st highest number of forms (5) was observed with the lemma “літва”: Літва, Літве, Літвой, Літву, Літвы.
The 2nd highest number of forms (4) was observed with the lemma “нацбанк”: Нацбанк, Нацбанка, Нацбанкам, Нацбанку.
The 3rd highest number of forms (3) was observed with the lemma “беларусь”: Беларуссю, Беларусь, Беларусі.
PROPN
occurs with 4 features: Animacy (583; 99% instances), Case (583; 99% instances), Number (583; 99% instances), Gender (582; 99% instances)
PROPN
occurs with 13 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
PROPN
occurs with 31 feature combinations.
The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing
(88 tokens).
Examples: Беларусі, АЭС, Літвы, Сірыі, Еўропы, ААН, ДАІ, Белавія, Расіі, Белтэлерадыёкампаніі
Relations
PROPN
nodes are attached to their parents using 13 different relations: nmod (188; 32% instances), flat (124; 21% instances), nsubj (86; 15% instances), conj (79; 13% instances), obl (51; 9% instances), appos (17; 3% instances), flat:name (13; 2% instances), obj (13; 2% instances), root (8; 1% instances), iobj (3; 1% instances), parataxis (3; 1% instances), obl:agent (2; 0% instances), orphan (1; 0% instances)
Parents of PROPN
nodes belong to 6 different parts of speech: NOUN (248; 42% instances), PROPN (178; 30% instances), VERB (145; 25% instances), (8; 1% instances), ADJ (7; 1% instances), ADV (2; 0% instances)
268 (46%) PROPN
nodes are leaves.
238 (40%) PROPN
nodes have one child.
53 (9%) PROPN
nodes have two children.
29 (5%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 17.
Children of PROPN
nodes are attached using 16 different relations: punct (126; 27% instances), case (95; 20% instances), flat (89; 19% instances), conj (76; 16% instances), cc (21; 4% instances), nmod (18; 4% instances), amod (13; 3% instances), flat:name (13; 3% instances), parataxis (10; 2% instances), appos (4; 1% instances), orphan (4; 1% instances), acl:relcl (2; 0% instances), acl (1; 0% instances), advmod (1; 0% instances), det (1; 0% instances), nummod (1; 0% instances)
Children of PROPN
nodes belong to 14 different parts of speech: PROPN (178; 37% instances), PUNCT (126; 27% instances), ADP (93; 20% instances), CCONJ (20; 4% instances), NOUN (16; 3% instances), NUM (13; 3% instances), ADJ (12; 3% instances), VERB (11; 2% instances), ADV (1; 0% instances), DET (1; 0% instances), PART (1; 0% instances), PRON (1; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)