home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: ADJ

There are 4520 ADJ lemmas (15%), 9443 ADJ types (18%) and 26816 ADJ tokens (9%). Out of 17 observed tags, the rank of ADJ is: 3 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent ADJ lemmas: беларускі, новы, першы, вялікі, расейскі, вядомы, менскі, добры, апошні, дзяржаўны

The 10 most frequent ADJ types: беларускай, новы, надзвычайных, беларускіх, беларускую, беларускі, беларуская, беларускія, першы, беларускага

The 10 most frequent ambiguous lemmas: родны (ADJ 111, NOUN 1), 2019 (ADJ 101, NUM 6), стары (ADJ 86, NOUN 1), 2018 (ADJ 78, NUM 4), 23 (ADJ 69, NUM 28), 12 (ADJ 68, NUM 45), 1 (NUM 133, ADJ 66), 18 (NUM 89, ADJ 63), 25 (ADJ 63, NUM 61), 3 (NUM 122, ADJ 62, X 1)

The 10 most frequent ambiguous types: беларускай (ADJ 282, NOUN 2), беларускі (ADJ 82, NOUN 3), 2019 (ADJ 101, NUM 6), вядома (ADJ 78, ADV 2), 2018 (ADJ 78, NUM 4), 12 (ADJ 68, NUM 45), 23 (ADJ 67, NUM 28), 1 (NUM 133, ADJ 66), 18 (NUM 89, ADJ 63), 25 (ADJ 63, NUM 62)

Morphology

The form / lemma ratio of ADJ is 2.089159 (the average of all parts of speech is 1.756638).

The 1st highest number of forms (17) was observed with the lemma “уласны”: уласнай, уласнаму, уласная, уласную, уласны, уласных, уласныя, ўласнага, ўласнае, ўласнай, ўласнаму, ўласную, ўласны, ўласным, ўласнымі, ўласных, ўласныя.

The 2nd highest number of forms (17) was observed with the lemma “унікальны”: Унікальнае, Унікальная, унікальнага, унікальнай, унікальную, унікальны, унікальным, унікальных, унікальныя, ўнікальнае, ўнікальнай, ўнікальная, ўнікальную, ўнікальны, ўнікальным, ўнікальных, ўнікальныя.

The 3rd highest number of forms (16) was observed with the lemma “украінскі”: украінскага, украінскай, украінскі, украінскім, украінскімі, украінскіх, украінскія, ўкраінскага, ўкраінскае, ўкраінскай, ўкраінскую, ўкраінскі, ўкраінскім, ўкраінскімі, ўкраінскіх, ўкраінскія.

ADJ occurs with 10 features: Degree (23536; 88% instances), Number (23498; 88% instances), Case (22796; 85% instances), Gender (17068; 64% instances), Animacy (3041; 11% instances), Variant (688; 3% instances), Abbr (96; 0% instances), Typo (7; 0% instances), Foreign (2; 0% instances), VerbForm (1; 0% instances)

ADJ occurs with 21 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Typo=Yes, Variant=Short, VerbForm=Part

ADJ occurs with 100 feature combinations. The most frequent feature combination is _ (3179 tokens). Examples: 2019, 2018, 12, 23, 1, 18, 25, 3, 9, 29

Relations

ADJ nodes are attached to their parents using 27 different relations: amod (21023; 78% instances), root (1186; 4% instances), conj (1034; 4% instances), obl (1021; 4% instances), nmod (848; 3% instances), parataxis (362; 1% instances), xcomp (247; 1% instances), nsubj (191; 1% instances), acl (166; 1% instances), appos (164; 1% instances), list (154; 1% instances), obj (106; 0% instances), ccomp (79; 0% instances), acl:relcl (62; 0% instances), advcl (51; 0% instances), orphan (25; 0% instances), iobj (22; 0% instances), compound (19; 0% instances), csubj (13; 0% instances), advmod (12; 0% instances), fixed (9; 0% instances), vocative (7; 0% instances), nsubj:pass (6; 0% instances), flat (3; 0% instances), obl:agent (3; 0% instances), dep (2; 0% instances), flat:name (1; 0% instances)

Parents of ADJ nodes belong to 16 different parts of speech: NOUN (21258; 79% instances), VERB (1847; 7% instances), ADJ (1251; 5% instances), (1186; 4% instances), PROPN (625; 2% instances), X (206; 1% instances), PRON (140; 1% instances), ADV (123; 0% instances), NUM (92; 0% instances), DET (45; 0% instances), SYM (33; 0% instances), AUX (3; 0% instances), INTJ (3; 0% instances), ADP (2; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)

20518 (77%) ADJ nodes are leaves.

2998 (11%) ADJ nodes have one child.

1465 (5%) ADJ nodes have two children.

1835 (7%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 16.

Children of ADJ nodes are attached using 38 different relations: punct (3656; 28% instances), flat (1250; 9% instances), conj (1102; 8% instances), nsubj (959; 7% instances), advmod (907; 7% instances), cc (741; 6% instances), case (722; 5% instances), obl (701; 5% instances), nmod (695; 5% instances), parataxis (383; 3% instances), list (359; 3% instances), det (330; 2% instances), cop (248; 2% instances), csubj (223; 2% instances), mark (168; 1% instances), xcomp (161; 1% instances), iobj (124; 1% instances), dep (82; 1% instances), amod (74; 1% instances), advcl (63; 0% instances), appos (44; 0% instances), compound (35; 0% instances), acl:relcl (26; 0% instances), discourse (23; 0% instances), obj (23; 0% instances), nummod:gov (20; 0% instances), expl (15; 0% instances), orphan (14; 0% instances), ccomp (12; 0% instances), nummod (12; 0% instances), acl (11; 0% instances), aux (11; 0% instances), vocative (10; 0% instances), nsubj:pass (8; 0% instances), obl:agent (7; 0% instances), aux:pass (5; 0% instances), fixed (2; 0% instances), nsubj:outer (1; 0% instances)

Children of ADJ nodes belong to 17 different parts of speech: PUNCT (3656; 28% instances), NOUN (3071; 23% instances), ADJ (1251; 9% instances), CCONJ (736; 6% instances), ADV (732; 6% instances), VERB (717; 5% instances), ADP (707; 5% instances), PRON (452; 3% instances), DET (370; 3% instances), NUM (288; 2% instances), AUX (267; 2% instances), PART (254; 2% instances), PROPN (243; 2% instances), X (184; 1% instances), SCONJ (182; 1% instances), SYM (116; 1% instances), INTJ (1; 0% instances)