home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-GSD: POS Tags: ADJ

There are 3772 ADJ lemmas (20%), 6685 ADJ types (22%) and 12379 ADJ tokens (12%). Out of 16 observed tags, the rank of ADJ is: 3 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: ПЕРВЫЙ, Й, НОВЫЙ, ДРУГОЙ, ВТОРОЙ, БОЛЬШОЙ, САМЫЙ, ИЗВЕСТНЫЙ, РОССИЙСКИЙ, ОСНОВНОЙ

The 10 most frequent ADJ types: второй, й, 2008, 2010, х, других, 2004, первый, 2012, 1

The 10 most frequent ambiguous lemmas: ДРУГОЙ (ADJ 100, NOUN 10), РУССКИЙ (ADJ 57, NOUN 11), ВОЕННЫЙ (ADJ 52, NOUN 2), ПОСЛЕДНИЙ (ADJ 46, NOUN 8), 2008 (ADJ 41, NUM 2, ADV 1), ЛУЧШИЙ (ADJ 40, NOUN 1), 2010 (ADJ 39, ADV 1, NUM 1), НЕМЕЦКИЙ (ADJ 35, NOUN 1), 2012 (ADJ 34, ADV 1, NUM 1), 1 (NUM 43, ADJ 33, ADV 19)

The 10 most frequent ambiguous types: 2008 (ADJ 41, NUM 2, ADV 1), 2010 (ADJ 39, ADV 1, NUM 1), х (ADJ 39, NUM 5, NOUN 1), 2012 (ADJ 34, ADV 1, NUM 1), 1 (NUM 43, ADJ 33, ADV 19), 2011 (ADJ 33, NUM 1), 2007 (ADJ 32, NUM 1), II (ADJ 30, NUM 3), 12 (ADJ 29, NUM 17, ADV 6), 2005 (ADJ 25, NUM 4)

Morphology

The form / lemma ratio of ADJ is 1.772269 (the average of all parts of speech is 1.592402).

The 1st highest number of forms (14) was observed with the lemma “ИЗВЕСТНЫЙ”: известен, известная, известно, известного, известное, известной, известному, известную, известны, известные, известный, известным, известными, известных.

The 2nd highest number of forms (13) was observed with the lemma “БОЛЬШОЙ”: Большом, большая, большие, большим, большими, больших, большого, большое, большой, большому, большую, велика, велико.

The 3rd highest number of forms (12) was observed with the lemma “ПЕРВЫЙ”: первая, первого, первое, первой, первом, первому, первую, первые, первый, первым, первыми, первых.

ADJ occurs with 6 features: Number (12346; 100% instances), Animacy (12314; 99% instances), Case (12314; 99% instances), Gender (9623; 78% instances), Variant (274; 2% instances), Degree (165; 1% instances)

ADJ occurs with 17 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Variant=Short

ADJ occurs with 81 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing (1549 tokens). Examples: го, 2010, 2012, 2009, 2011, 1941, государственного, 2004, 2006, 2008

Relations

ADJ nodes are attached to their parents using 27 different relations: amod (10342; 84% instances), conj (489; 4% instances), goeswith (428; 3% instances), nmod (294; 2% instances), root (229; 2% instances), appos (130; 1% instances), obl (91; 1% instances), parataxis (76; 1% instances), xcomp (55; 0% instances), acl (44; 0% instances), obj (34; 0% instances), acl:relcl (29; 0% instances), nsubj (28; 0% instances), ccomp (22; 0% instances), list (19; 0% instances), advcl (16; 0% instances), advmod (14; 0% instances), orphan (13; 0% instances), flat (5; 0% instances), compound (4; 0% instances), nsubj:pass (4; 0% instances), case (3; 0% instances), fixed (3; 0% instances), nummod (3; 0% instances), iobj (2; 0% instances), det (1; 0% instances), discourse (1; 0% instances)

Parents of ADJ nodes belong to 11 different parts of speech: NOUN (10298; 83% instances), ADJ (584; 5% instances), ADV (476; 4% instances), PROPN (420; 3% instances), VERB (306; 2% instances), (229; 2% instances), AUX (24; 0% instances), PRON (23; 0% instances), NUM (9; 0% instances), DET (7; 0% instances), PUNCT (3; 0% instances)

10382 (84%) ADJ nodes are leaves.

1150 (9%) ADJ nodes have one child.

331 (3%) ADJ nodes have two children.

516 (4%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 11.

Children of ADJ nodes are attached using 35 different relations: punct (1079; 27% instances), conj (484; 12% instances), nmod (415; 11% instances), cc (350; 9% instances), advmod (333; 8% instances), nsubj (293; 7% instances), case (255; 6% instances), xcomp (92; 2% instances), obl (84; 2% instances), cop (80; 2% instances), amod (71; 2% instances), mark (52; 1% instances), iobj (44; 1% instances), parataxis (44; 1% instances), appos (37; 1% instances), advcl (36; 1% instances), aux (19; 0% instances), ccomp (17; 0% instances), orphan (17; 0% instances), nsubj:pass (14; 0% instances), det (13; 0% instances), list (13; 0% instances), obj (13; 0% instances), acl (10; 0% instances), aux:pass (10; 0% instances), nummod (10; 0% instances), discourse (9; 0% instances), nummod:gov (9; 0% instances), goeswith (8; 0% instances), acl:relcl (4; 0% instances), obl:agent (4; 0% instances), flat (2; 0% instances), compound (1; 0% instances), dep (1; 0% instances), fixed (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: PUNCT (1181; 30% instances), NOUN (682; 17% instances), ADJ (584; 15% instances), CCONJ (343; 9% instances), ADV (316; 8% instances), VERB (226; 6% instances), ADP (157; 4% instances), AUX (113; 3% instances), PRON (84; 2% instances), PART (71; 2% instances), PROPN (62; 2% instances), SCONJ (52; 1% instances), NUM (30; 1% instances), DET (17; 0% instances), SYM (6; 0% instances)