home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian: POS Tags: NOUN

There are 825 NOUN lemmas (37%), 1206 NOUN types (39%) and 2088 NOUN tokens (26%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: сакавік, год, мова, краіна, чалавек, красавік, санкцыя, бок, камітэт, конкурс

The 10 most frequent NOUN types: сакавіка, года, рублёў, красавіка, млн, дачыненні, мова, санкцыі, выкананне, даляраў

The 10 most frequent ambiguous lemmas: імя (NOUN 3, ADP 1), м (NOUN 1, PROPN 1), рускі (ADJ 1, NOUN 1)

The 10 most frequent ambiguous types: стане (NOUN 4, VERB 3), дадзеных (NOUN 3, ADJ 1), асноўным (ADJ 1, NOUN 1), разам (ADV 5, NOUN 1), імя (ADP 1, NOUN 1)

Morphology

The form / lemma ratio of NOUN is 1.461818 (the average of all parts of speech is 1.397401).

The 1st highest number of forms (7) was observed with the lemma “год”: Год, гадах, гадоў, гады, года, годзе, году.

The 2nd highest number of forms (6) was observed with the lemma “бок”: Бакі, бакоў, баку, бок, бокам, боку.

The 3rd highest number of forms (6) was observed with the lemma “кампанія”: кампаній, кампанію, кампанія, кампаніяй, кампаніямі, кампаніі.

NOUN occurs with 4 features: Animacy (2088; 100% instances), Case (2088; 100% instances), Number (2088; 100% instances), Gender (2087; 100% instances)

NOUN occurs with 13 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 55 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing (245 tokens). Examples: сакавіка, года, млн, красавіка, конкурсу, ўніверсітэта, бульбы, камітэта, боку, аналізу

Relations

NOUN nodes are attached to their parents using 24 different relations: nmod (748; 36% instances), obl (419; 20% instances), nsubj (317; 15% instances), obj (276; 13% instances), conj (140; 7% instances), fixed (48; 2% instances), parataxis (34; 2% instances), appos (19; 1% instances), root (15; 1% instances), nummod:gov (12; 1% instances), iobj (11; 1% instances), nummod (9; 0% instances), orphan (9; 0% instances), advcl (6; 0% instances), flat (6; 0% instances), ccomp (5; 0% instances), nsubj:pass (4; 0% instances), obl:agent (3; 0% instances), advmod (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), case (1; 0% instances), csubj (1; 0% instances), flat:name (1; 0% instances)

Parents of NOUN nodes belong to 15 different parts of speech: VERB (977; 47% instances), NOUN (965; 46% instances), ADP (47; 2% instances), ADJ (46; 2% instances), PROPN (16; 1% instances), (15; 1% instances), ADV (5; 0% instances), CCONJ (3; 0% instances), DET (3; 0% instances), SYM (3; 0% instances), NUM (2; 0% instances), PRON (2; 0% instances), X (2; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)

311 (15%) NOUN nodes are leaves.

741 (35%) NOUN nodes have one child.

666 (32%) NOUN nodes have two children.

370 (18%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 10.

Children of NOUN nodes are attached using 27 different relations: nmod (927; 27% instances), amod (742; 22% instances), case (643; 19% instances), punct (284; 8% instances), conj (141; 4% instances), det (119; 3% instances), cc (89; 3% instances), flat (58; 2% instances), nummod:gov (54; 2% instances), acl:relcl (52; 2% instances), nummod (41; 1% instances), appos (36; 1% instances), acl (34; 1% instances), obl (29; 1% instances), parataxis (29; 1% instances), advmod (28; 1% instances), nsubj (28; 1% instances), compound (20; 1% instances), xcomp (16; 0% instances), cop (13; 0% instances), advmod:discourse (9; 0% instances), mark (8; 0% instances), orphan (8; 0% instances), obj (4; 0% instances), advcl (3; 0% instances), iobj (3; 0% instances), flat:name (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (965; 28% instances), ADJ (740; 22% instances), ADP (647; 19% instances), PUNCT (284; 8% instances), PROPN (248; 7% instances), VERB (128; 4% instances), DET (124; 4% instances), NUM (98; 3% instances), CCONJ (86; 3% instances), PRON (29; 1% instances), ADV (23; 1% instances), PART (15; 0% instances), AUX (13; 0% instances), X (10; 0% instances), SCONJ (8; 0% instances), SYM (1; 0% instances)