home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: ADV

There are 2506 ADV lemmas (4%), 2838 ADV types (2%) and 84005 ADV tokens (5%). Out of 17 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 7 in number of tokens.

The 10 most frequent ADV lemmas: так, еще, как, уже, очень, где, например, там, теперь, здесь

The 10 most frequent ADV types: так, как, уже, очень, еще, где, например, там, теперь, здесь

The 10 most frequent ambiguous lemmas: так (ADV 5062, SCONJ 33, PART 14, CCONJ 8, DET 1), еще (ADV 2925, PART 22), как (SCONJ 6720, ADV 2844, PART 22, CCONJ 1), уже (ADV 2333, PART 7), где (ADV 1622, SCONJ 10), там (ADV 1303, PART 9, X 1), тогда (ADV 809, X 1), тоже (ADV 786, PART 140, PRON 2, X 1), всё (PRON 2778, ADV 754, DET 1), также (ADV 715, PART 16, CCONJ 8)

The 10 most frequent ambiguous types: так (ADV 3480, SCONJ 31, CCONJ 7, PART 2), как (SCONJ 6276, ADV 2043, PART 20, X 3, CCONJ 1, DET 1), уже (ADV 2179, PART 7, ADJ 1), еще (ADV 1862, PART 11), где (ADV 1404, SCONJ 9, X 1), там (ADV 1015, PART 9, X 1), потом (ADV 805, NOUN 6), тоже (ADV 765, PART 138, DET 4, PRON 3, X 1), также (ADV 702, PART 16, CCONJ 1), больше (ADV 599, NUM 161, ADJ 67, X 1)

Morphology

The form / lemma ratio of ADV is 1.132482 (the average of all parts of speech is 2.706171).

The 1st highest number of forms (14) was observed with the lemma “очень”: О-оочень, О-очень, ООО-очень, Ооо-очень, Оч-ч-чень, Очено, о-о-очень, ооооочень, ооочень, оочень, оч, оч., очень, ошень.

The 2nd highest number of forms (7) was observed with the lemma “быстро”: б[ы]стро, быстрее, быстрей, быстро, бытрее, побыстрее, побыстрей.

The 3rd highest number of forms (6) was observed with the lemma “хорошо”: зорошо, карошо, корошо, лучше, получше, хорошо.

ADV occurs with 7 features: Degree (79005; 94% instances), PronType (31536; 38% instances), ExtPos (3194; 4% instances), Abbr (845; 1% instances), Typo (169; 0% instances), Polarity (115; 0% instances), Foreign (1; 0% instances)

ADV occurs with 22 feature-value pairs: Abbr=Yes, Degree=Cmp, Degree=Pos, Degree=Sup, ExtPos=ADJ, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=NOUN, ExtPos=PART, ExtPos=SCONJ, ExtPos=VERB, Foreign=Yes, Polarity=Neg, PronType=Dem, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Rel, PronType=Tot, Typo=Yes

ADV occurs with 59 feature combinations. The most frequent feature combination is Degree=Pos (44392 tokens). Examples: уже, так, очень, еще, как, ещё, где, совсем, вдруг, снова

Relations

ADV nodes are attached to their parents using 32 different relations: advmod (69673; 83% instances), parataxis:discourse (4623; 6% instances), conj (2871; 3% instances), root (2299; 3% instances), mark (1145; 1% instances), fixed (1092; 1% instances), parataxis (729; 1% instances), orphan (330; 0% instances), case (242; 0% instances), cc (181; 0% instances), advcl (138; 0% instances), ccomp (128; 0% instances), compound (100; 0% instances), appos (80; 0% instances), obl (68; 0% instances), acl:relcl (48; 0% instances), nmod (47; 0% instances), xcomp (47; 0% instances), obj (31; 0% instances), csubj (29; 0% instances), acl (25; 0% instances), list (21; 0% instances), amod (18; 0% instances), nsubj (18; 0% instances), obl:pronmod (6; 0% instances), dislocated (4; 0% instances), discourse (3; 0% instances), iobj (3; 0% instances), flat:name (2; 0% instances), vocative (2; 0% instances), obl:depict (1; 0% instances), obl:tmod (1; 0% instances)

Parents of ADV nodes belong to 17 different parts of speech: VERB (53993; 64% instances), ADJ (9627; 11% instances), NOUN (7300; 9% instances), ADV (6005; 7% instances), (2299; 3% instances), PRON (1150; 1% instances), PART (973; 1% instances), NUM (912; 1% instances), DET (638; 1% instances), PROPN (519; 1% instances), CCONJ (340; 0% instances), AUX (90; 0% instances), X (87; 0% instances), ADP (32; 0% instances), INTJ (23; 0% instances), SCONJ (11; 0% instances), SYM (6; 0% instances)

59264 (71%) ADV nodes are leaves.

16863 (20%) ADV nodes have one child.

4960 (6%) ADV nodes have two children.

2918 (3%) ADV nodes have three or more children.

The highest child degree of a ADV node is 15.

Children of ADV nodes are attached using 41 different relations: punct (13558; 36% instances), advmod (7858; 21% instances), fixed (3247; 9% instances), conj (2759; 7% instances), cc (2432; 6% instances), obl (2321; 6% instances), nsubj (1654; 4% instances), advcl (1164; 3% instances), parataxis (887; 2% instances), mark (367; 1% instances), cop (277; 1% instances), obl:tmod (214; 1% instances), parataxis:discourse (180; 0% instances), discourse (147; 0% instances), iobj (141; 0% instances), acl:relcl (136; 0% instances), vocative (113; 0% instances), case (107; 0% instances), goeswith (78; 0% instances), obl:pronmod (61; 0% instances), orphan (45; 0% instances), nmod (40; 0% instances), expl (27; 0% instances), csubj (24; 0% instances), aux (20; 0% instances), amod (16; 0% instances), appos (14; 0% instances), list (13; 0% instances), det (12; 0% instances), obj (11; 0% instances), dislocated (10; 0% instances), acl (9; 0% instances), xcomp (9; 0% instances), ccomp (6; 0% instances), obl:float (6; 0% instances), compound (5; 0% instances), flat:name (3; 0% instances), dep (2; 0% instances), flat:goeswith (1; 0% instances), nsubj:outer (1; 0% instances), nummod:gov (1; 0% instances)

Children of ADV nodes belong to 17 different parts of speech: PUNCT (13558; 36% instances), ADV (6005; 16% instances), PART (5620; 15% instances), NOUN (3391; 9% instances), CCONJ (2413; 6% instances), VERB (2126; 6% instances), PRON (1615; 4% instances), SCONJ (1591; 4% instances), ADJ (396; 1% instances), PROPN (376; 1% instances), AUX (313; 1% instances), ADP (262; 1% instances), DET (116; 0% instances), X (110; 0% instances), INTJ (38; 0% instances), NUM (31; 0% instances), SYM (15; 0% instances)