Treebank Statistics: UD_Belarusian: POS Tags: NOUN
There are 825 NOUN
lemmas (37%), 1206 NOUN
types (39%) and 2088 NOUN
tokens (26%).
Out of 16 observed tags, the rank of NOUN
is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN
lemmas: сакавік, год, мова, краіна, чалавек, красавік, санкцыя, бок, камітэт, конкурс
The 10 most frequent NOUN
types: сакавіка, года, рублёў, красавіка, млн, дачыненні, мова, санкцыі, выкананне, даляраў
The 10 most frequent ambiguous lemmas: імя (NOUN 3, ADP 1), м (NOUN 1, PROPN 1), рускі (ADJ 1, NOUN 1)
The 10 most frequent ambiguous types: стане (NOUN 4, VERB 3), дадзеных (NOUN 3, ADJ 1), асноўным (ADJ 1, NOUN 1), разам (ADV 5, NOUN 1), імя (ADP 1, NOUN 1)
- стане
- дадзеных
- асноўным
- разам
- імя
Morphology
The form / lemma ratio of NOUN
is 1.461818 (the average of all parts of speech is 1.397401).
The 1st highest number of forms (7) was observed with the lemma “год”: Год, гадах, гадоў, гады, года, годзе, году.
The 2nd highest number of forms (6) was observed with the lemma “бок”: Бакі, бакоў, баку, бок, бокам, боку.
The 3rd highest number of forms (6) was observed with the lemma “кампанія”: кампаній, кампанію, кампанія, кампаніяй, кампаніямі, кампаніі.
NOUN
occurs with 4 features: Animacy (2088; 100% instances), Case (2088; 100% instances), Number (2088; 100% instances), Gender (2087; 100% instances)
NOUN
occurs with 13 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
NOUN
occurs with 55 feature combinations.
The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing
(245 tokens).
Examples: сакавіка, года, млн, красавіка, конкурсу, ўніверсітэта, бульбы, камітэта, боку, аналізу
Relations
NOUN
nodes are attached to their parents using 24 different relations: nmod (748; 36% instances), obl (419; 20% instances), nsubj (317; 15% instances), obj (276; 13% instances), conj (140; 7% instances), fixed (48; 2% instances), parataxis (34; 2% instances), appos (19; 1% instances), root (15; 1% instances), nummod:gov (12; 1% instances), iobj (11; 1% instances), nummod (9; 0% instances), orphan (9; 0% instances), advcl (6; 0% instances), flat (6; 0% instances), ccomp (5; 0% instances), nsubj:pass (4; 0% instances), obl:agent (3; 0% instances), advmod (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), case (1; 0% instances), csubj (1; 0% instances), flat:name (1; 0% instances)
Parents of NOUN
nodes belong to 15 different parts of speech: VERB (977; 47% instances), NOUN (965; 46% instances), ADP (47; 2% instances), ADJ (46; 2% instances), PROPN (16; 1% instances), (15; 1% instances), ADV (5; 0% instances), CCONJ (3; 0% instances), DET (3; 0% instances), SYM (3; 0% instances), NUM (2; 0% instances), PRON (2; 0% instances), X (2; 0% instances), AUX (1; 0% instances), PART (1; 0% instances)
311 (15%) NOUN
nodes are leaves.
741 (35%) NOUN
nodes have one child.
666 (32%) NOUN
nodes have two children.
370 (18%) NOUN
nodes have three or more children.
The highest child degree of a NOUN
node is 10.
Children of NOUN
nodes are attached using 27 different relations: nmod (927; 27% instances), amod (742; 22% instances), case (643; 19% instances), punct (284; 8% instances), conj (141; 4% instances), det (119; 3% instances), cc (89; 3% instances), flat (58; 2% instances), nummod:gov (54; 2% instances), acl:relcl (52; 2% instances), nummod (41; 1% instances), appos (36; 1% instances), acl (34; 1% instances), obl (29; 1% instances), parataxis (29; 1% instances), advmod (28; 1% instances), nsubj (28; 1% instances), compound (20; 1% instances), xcomp (16; 0% instances), cop (13; 0% instances), advmod:discourse (9; 0% instances), mark (8; 0% instances), orphan (8; 0% instances), obj (4; 0% instances), advcl (3; 0% instances), iobj (3; 0% instances), flat:name (1; 0% instances)
Children of NOUN
nodes belong to 16 different parts of speech: NOUN (965; 28% instances), ADJ (740; 22% instances), ADP (647; 19% instances), PUNCT (284; 8% instances), PROPN (248; 7% instances), VERB (128; 4% instances), DET (124; 4% instances), NUM (98; 3% instances), CCONJ (86; 3% instances), PRON (29; 1% instances), ADV (23; 1% instances), PART (15; 0% instances), AUX (13; 0% instances), X (10; 0% instances), SCONJ (8; 0% instances), SYM (1; 0% instances)