Treebank Statistics: UD_Russian-Taiga: POS Tags: ADJ
There are 10961 ADJ lemmas (19%), 34899 ADJ types (23%) and 155075 ADJ tokens (9%).
Out of 17 observed tags, the rank of ADJ is: 2 in number of lemmas, 3 in number of types and 4 in number of tokens.
The 10 most frequent ADJ lemmas: русский, первый, новый, большой, хороший, художественный, должен, литературный, маленький, разный
The 10 most frequent ADJ types: русского, XIX, нужно, большой, русской, первый, русском, разных, п., хорошо
The 10 most frequent ambiguous lemmas: русский (ADJ 2470, NOUN 77), простой (ADJ 420, NOUN 2), милый (ADJ 302, NOUN 1), золотой (ADJ 280, NOUN 1), близкий (ADJ 266, NOUN 2), старший (ADJ 241, NOUN 33), богатый (ADJ 235, NOUN 2), больной (ADJ 224, NOUN 83), малый (ADJ 214, NOUN 6), возможный (ADJ 211, ADV 1)
The 10 most frequent ambiguous types: п. (ADJ 399, NOUN 43, X 1), хорошо (ADV 311, ADJ 267, PART 23), лучше (ADJ 258, ADV 230), русский (ADJ 229, NOUN 10), русских (ADJ 226, NOUN 26), равно (ADJ 220, ADV 19), трудно (ADJ 142, ADV 19), видно (ADJ 161, ADV 17), I (ADJ 160, X 4, NUM 2), древних (ADJ 160, NOUN 2)
- п.
- хорошо
- лучше
- русский
- русских
- равно
- трудно
- видно
- I
- древних
Morphology
The form / lemma ratio of ADJ is 3.183925 (the average of all parts of speech is 2.706171).
The 1st highest number of forms (31) was observed with the lemma “черный”: черна, черная, чернее, черно, черного, черное, черноей, черной, черном, черному, черною, черную, черны, черные, черный, черным, черными, черных, чёрен, чёрная, чёрного, чёрное, чёрной, чёрном, чёрному, чёрную, чёрные, чёрный, чёрным, чёрными, чёрных.
The 2nd highest number of forms (30) was observed with the lemma “зеленый”: Зелен, Зелё-о-оные, Зелё-оные, зелена, зеленая, зелене, зеленее, зелено, зеленого, зеленое, зеленой, зеленом, зеленому, зеленую, зелены, зеленые, зеленый, зеленым, зелеными, зеленых, зелёная, зелёного, зелёное, зелёной, зелёную, зелёные, зелёный, зелёным, зелёными, зелёных.
The 3rd highest number of forms (30) was observed with the lemma “темный”: темна, темная, темнее, темней, темно, темного, темное, темной, темном, темному, темною, темную, темны, темные, темный, темным, темными, темных, тёмная, тёмного, тёмное, тёмной, тёмном, тёмному, тёмную, тёмные, тёмный, тёмным, тёмными, тёмных.
ADJ occurs with 14 features: Number (146580; 95% instances), Degree (144401; 93% instances), Case (133624; 86% instances), Gender (106254; 69% instances), Variant (13128; 8% instances), Animacy (12748; 8% instances), NumForm (9711; 6% instances), NumType (9711; 6% instances), Abbr (871; 1% instances), Typo (179; 0% instances), Poss (93; 0% instances), ExtPos (31; 0% instances), InflClass (3; 0% instances), Foreign (2; 0% instances)
ADJ occurs with 32 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Degree=Pos, Degree=Sup, ExtPos=ADJ, ExtPos=ADV, ExtPos=CCONJ, ExtPos=VERB, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, InflClass=Ind, NumForm=Combi, NumForm=Digit, NumForm=Roman, NumForm=Word, NumType=Frac, NumType=Ord, Number=Plur, Number=Sing, Poss=Yes, Typo=Yes, Variant=Short
ADJ occurs with 215 feature combinations.
The most frequent feature combination is Case=Nom|Degree=Pos|Gender=Masc|Number=Sing (14587 tokens).
Examples: хороший, большой, русский, добрый, старый, железный, новый, маленький, великий, отличный
Relations
ADJ nodes are attached to their parents using 36 different relations: amod (113685; 73% instances), conj (13852; 9% instances), root (10417; 7% instances), parataxis (2565; 2% instances), xcomp (2479; 2% instances), nmod (2156; 1% instances), acl (1963; 1% instances), obl (1528; 1% instances), nsubj (982; 1% instances), fixed (780; 1% instances), ccomp (758; 0% instances), advcl (716; 0% instances), obj (568; 0% instances), acl:relcl (530; 0% instances), appos (459; 0% instances), list (266; 0% instances), parataxis:discourse (253; 0% instances), iobj (183; 0% instances), orphan (167; 0% instances), obl:tmod (144; 0% instances), vocative (112; 0% instances), obl:float (103; 0% instances), obl:pronmod (100; 0% instances), csubj (79; 0% instances), compound (51; 0% instances), obl:depict (51; 0% instances), nsubj:pass (50; 0% instances), flat:name (41; 0% instances), obl:agent (15; 0% instances), cc (12; 0% instances), advmod (3; 0% instances), dislocated (2; 0% instances), flat (2; 0% instances), csubj:outer (1; 0% instances), flat:goeswith (1; 0% instances), nsubj:outer (1; 0% instances)
Parents of ADJ nodes belong to 17 different parts of speech: NOUN (115620; 75% instances), ADJ (12919; 8% instances), (10417; 7% instances), VERB (9317; 6% instances), PROPN (3216; 2% instances), PRON (1806; 1% instances), X (472; 0% instances), DET (421; 0% instances), ADV (396; 0% instances), NUM (239; 0% instances), ADP (161; 0% instances), PART (50; 0% instances), INTJ (17; 0% instances), AUX (10; 0% instances), SYM (8; 0% instances), CCONJ (4; 0% instances), SCONJ (2; 0% instances)
105684 (68%) ADJ nodes are leaves.
23197 (15%) ADJ nodes have one child.
9086 (6%) ADJ nodes have two children.
17108 (11%) ADJ nodes have three or more children.
The highest child degree of a ADJ node is 19.
Children of ADJ nodes are attached using 47 different relations: punct (34255; 31% instances), conj (14754; 13% instances), advmod (12215; 11% instances), nsubj (10102; 9% instances), cc (8494; 8% instances), obl (5936; 5% instances), cop (3371; 3% instances), parataxis (2732; 2% instances), case (2451; 2% instances), csubj (2438; 2% instances), iobj (2271; 2% instances), det (2123; 2% instances), mark (1933; 2% instances), nmod (1825; 2% instances), xcomp (1322; 1% instances), parataxis:discourse (1069; 1% instances), advcl (827; 1% instances), obl:tmod (470; 0% instances), discourse (284; 0% instances), vocative (265; 0% instances), ccomp (239; 0% instances), amod (196; 0% instances), flat (191; 0% instances), aux (186; 0% instances), acl (167; 0% instances), orphan (166; 0% instances), compound (156; 0% instances), list (144; 0% instances), appos (136; 0% instances), obj (134; 0% instances), acl:relcl (128; 0% instances), nsubj:pass (119; 0% instances), nummod:gov (78; 0% instances), expl (59; 0% instances), obl:float (46; 0% instances), fixed (42; 0% instances), nummod (37; 0% instances), aux:pass (16; 0% instances), obl:agent (16; 0% instances), goeswith (9; 0% instances), dislocated (4; 0% instances), flat:goeswith (4; 0% instances), flat:name (4; 0% instances), csubj:outer (2; 0% instances), dep (2; 0% instances), nsubj:outer (1; 0% instances), obl:depict (1; 0% instances)
Children of ADJ nodes belong to 17 different parts of speech: PUNCT (34255; 31% instances), NOUN (14797; 13% instances), ADJ (12919; 12% instances), ADV (9627; 9% instances), VERB (9028; 8% instances), CCONJ (8332; 7% instances), PRON (6096; 5% instances), PART (3836; 3% instances), AUX (3578; 3% instances), DET (2908; 3% instances), ADP (2284; 2% instances), SCONJ (2004; 2% instances), PROPN (1051; 1% instances), NUM (325; 0% instances), X (143; 0% instances), INTJ (139; 0% instances), SYM (98; 0% instances)