Treebank Statistics: UD_Russian-Taiga: POS Tags: ADV
There are 921 ADV
lemmas (4%), 1033 ADV
types (3%) and 10859 ADV
tokens (6%).
Out of 17 observed tags, the rank of ADV
is: 5 in number of lemmas, 5 in number of types and 7 in number of tokens.
The 10 most frequent ADV
lemmas: очень, так, как, еще, там, уже, где, всегда, сейчас, здесь
The 10 most frequent ADV
types: очень, так, как, там, уже, где, всегда, еще, сейчас, здесь
The 10 most frequent ambiguous lemmas: так (ADV 633, SCONJ 5, PART 3, DET 1), как (SCONJ 732, ADV 476, CCONJ 1, PART 1), там (ADV 380, PART 2), где (ADV 256, SCONJ 9), вроде (ADV 76, ADP 14), конечно (ADV 73, PART 1), хорошо (ADV 71, PART 4, ADJ 1), всё (PRON 750, ADV 62, DET 1), пока (ADV 62, SCONJ 45), также (ADV 51, PART 16, CCONJ 8)
The 10 most frequent ambiguous types: так (ADV 480, SCONJ 4, PART 2), как (SCONJ 657, ADV 320, X 3, CCONJ 1, DET 1, PART 1), там (ADV 328, PART 2), где (ADV 210, SCONJ 8, X 1), потом (ADV 118, NOUN 1), больше (ADV 102, NUM 30, ADJ 16, X 1), рядом (ADV 68, NOUN 1), быстро (ADV 66, ADJ 9), вроде (ADV 56, ADP 11), конечно (ADV 60, PART 1)
- так
- как
- SCONJ 657: как красиво )))
- ADV 320: А как туда добраться лучше ?
- X 3: поэтому кое как на 4 !!!
- CCONJ 1: Цвет росписей мог варьироваться от чёрного до коричневатого , фиолетового и тёмно-зеленого ; в зависимости от обжига керамика могла приобретать как цвет бычьей кожи , так и красно-коричневый или даже зелено-коричневый оттенки .
- DET 1: Но никогда ни при как их условии никогда что там у россиян ?
- PART 1: дай - как , я свечу задую …
- там
- где
- потом
- больше
- рядом
- быстро
- вроде
- конечно
Morphology
The form / lemma ratio of ADV
is 1.121607 (the average of all parts of speech is 1.879397).
The 1st highest number of forms (9) was observed with the lemma “очень”: О-оочень, ООО-очень, Ооо-очень, ооооочень, ооочень, оочень, оч, оч., очень.
The 2nd highest number of forms (6) was observed with the lemma “быстро”: б[ы]стро, быстрее, быстрей, быстро, бытрее, побыстрее.
The 3rd highest number of forms (5) was observed with the lemma “как-то”: Как-, как, как-то, как_-то, както.
ADV
occurs with 6 features: Degree (10685; 98% instances), PronType (4292; 40% instances), Abbr (171; 2% instances), Typo (127; 1% instances), Polarity (6; 0% instances), Foreign (1; 0% instances)
ADV
occurs with 14 feature-value pairs: Abbr=Yes
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Foreign=Yes
, Polarity=Neg
, PronType=Dem
, PronType=Exc
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Rel
, PronType=Tot
, Typo=Yes
ADV
occurs with 26 feature combinations.
The most frequent feature combination is Degree=Pos
(6081 tokens).
Examples: очень, уже, так, как, еще, там, ещё, где, часто, вообще
Relations
ADV
nodes are attached to their parents using 23 different relations: advmod (9192; 85% instances), parataxis (436; 4% instances), conj (369; 3% instances), root (306; 3% instances), mark (180; 2% instances), fixed (126; 1% instances), orphan (49; 0% instances), advcl (39; 0% instances), ccomp (25; 0% instances), case (24; 0% instances), cc (22; 0% instances), acl:relcl (16; 0% instances), amod (16; 0% instances), appos (12; 0% instances), acl (11; 0% instances), compound (9; 0% instances), nmod (8; 0% instances), csubj (6; 0% instances), obl (5; 0% instances), xcomp (4; 0% instances), dislocated (2; 0% instances), iobj (1; 0% instances), obj (1; 0% instances)
Parents of ADV
nodes belong to 17 different parts of speech: VERB (6434; 59% instances), ADJ (1429; 13% instances), NOUN (1173; 11% instances), ADV (698; 6% instances), (306; 3% instances), NUM (219; 2% instances), PRON (179; 2% instances), PART (106; 1% instances), DET (98; 1% instances), PROPN (93; 1% instances), AUX (86; 1% instances), CCONJ (12; 0% instances), X (10; 0% instances), INTJ (6; 0% instances), SYM (4; 0% instances), ADP (3; 0% instances), SCONJ (3; 0% instances)
8160 (75%) ADV
nodes are leaves.
1821 (17%) ADV
nodes have one child.
474 (4%) ADV
nodes have two children.
404 (4%) ADV
nodes have three or more children.
The highest child degree of a ADV
node is 8.
Children of ADV
nodes are attached using 33 different relations: punct (1083; 25% instances), advmod (1002; 23% instances), fixed (393; 9% instances), nsubj (339; 8% instances), cc (331; 8% instances), conj (298; 7% instances), obl (288; 7% instances), parataxis (110; 3% instances), mark (94; 2% instances), advcl (76; 2% instances), goeswith (75; 2% instances), cop (40; 1% instances), case (25; 1% instances), discourse (22; 1% instances), iobj (15; 0% instances), nmod (12; 0% instances), orphan (12; 0% instances), acl:relcl (11; 0% instances), acl (9; 0% instances), aux (6; 0% instances), expl (5; 0% instances), vocative (5; 0% instances), ccomp (4; 0% instances), csubj (4; 0% instances), dislocated (4; 0% instances), appos (3; 0% instances), dep (2; 0% instances), det (2; 0% instances), list (2; 0% instances), amod (1; 0% instances), nsubj:outer (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)
Children of ADV
nodes belong to 17 different parts of speech: PUNCT (1083; 25% instances), PART (701; 16% instances), ADV (698; 16% instances), NOUN (515; 12% instances), CCONJ (330; 8% instances), SCONJ (261; 6% instances), PRON (210; 5% instances), VERB (183; 4% instances), X (78; 2% instances), ADJ (50; 1% instances), AUX (49; 1% instances), PROPN (45; 1% instances), ADP (40; 1% instances), DET (11; 0% instances), SYM (11; 0% instances), NUM (6; 0% instances), INTJ (5; 0% instances)