home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: VERB

There are 4372 VERB lemmas (21%), 9935 VERB types (26%) and 24747 VERB tokens (13%). Out of 17 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: быть, мочь, можно, сказать, нет, хотеть, знать, делать, работать, говорить

The 10 most frequent VERB types: есть, можно, нет, может, надо, могу, делать, хочу, здравствуйте, нравится

The 10 most frequent ambiguous lemmas: быть (AUX 1316, VERB 789), мочь (VERB 590, NOUN 2), нет (VERB 297, PART 69), знать (VERB 248, NOUN 1), стать (VERB 199, NOUN 1), надо (VERB 174, ADP 2), пропасть (VERB 11, NOUN 2), некогда (ADV 4, VERB 4), пора (NOUN 67, VERB 4), пасть (VERB 3, NOUN 2)

The 10 most frequent ambiguous types: есть (VERB 403, AUX 144), нет (VERB 271, PART 46), надо (VERB 156, ADP 1), было (AUX 230, VERB 74, PART 2), быть (AUX 112, VERB 57), был (AUX 182, VERB 32), была (AUX 117, VERB 27), начала (VERB 18, NOUN 15), были (AUX 119, VERB 15), дали (VERB 18, NOUN 3)

Morphology

The form / lemma ratio of VERB is 2.272415 (the average of all parts of speech is 1.879397).

The 1st highest number of forms (21) was observed with the lemma “идти”: идем, идет, иди, идите, идти, иду, идут, идущая, идущей, идущие, идущий, идя, идём, идёт, идёшь, шедшая, шел, шла, шли, шло, шёл.

The 2nd highest number of forms (20) was observed with the lemma “иметь”: Имей, имеем, имеет, имееть, имеешь, имееют, имейте, имел, имела, имели, имело, иметь, имею, имеют, имеющее, имеющий, имеющим, имеющих, имея, имут.

The 3rd highest number of forms (20) was observed with the lemma “работать”: работавший, работаем, работает, работаешь, работал, работала, работали, работало, работать, работаю, работают, работающей, работающие, работающии, работающий, работающим, работающих, работая, работют, рботаю.

VERB occurs with 15 features: VerbForm (23645; 96% instances), Voice (23645; 96% instances), Aspect (23624; 95% instances), Number (18407; 74% instances), Tense (17934; 72% instances), Mood (16769; 68% instances), Person (10203; 41% instances), Gender (5884; 24% instances), Case (1011; 4% instances), Variant (641; 3% instances), Polarity (356; 1% instances), Typo (304; 1% instances), Animacy (116; 0% instances), Abbr (24; 0% instances), Foreign (1; 0% instances)

VERB occurs with 35 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 291 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (2928 tokens). Examples: есть, может, стоит, работает, говорит, отвечает, знает, хочет, хватает, бывает

Relations

VERB nodes are attached to their parents using 26 different relations: root (9576; 39% instances), conj (5284; 21% instances), parataxis (2171; 9% instances), xcomp (2019; 8% instances), advcl (1503; 6% instances), csubj (1280; 5% instances), acl (962; 4% instances), ccomp (821; 3% instances), acl:relcl (609; 2% instances), amod (318; 1% instances), fixed (59; 0% instances), nmod (29; 0% instances), appos (27; 0% instances), obl (17; 0% instances), obj (14; 0% instances), nsubj (11; 0% instances), flat (9; 0% instances), case (7; 0% instances), vocative (7; 0% instances), list (6; 0% instances), iobj (5; 0% instances), orphan (5; 0% instances), dep (4; 0% instances), reparandum (2; 0% instances), csubj:pass (1; 0% instances), discourse (1; 0% instances)

Parents of VERB nodes belong to 17 different parts of speech: VERB (10159; 41% instances), (9576; 39% instances), NOUN (2601; 11% instances), ADJ (1315; 5% instances), PRON (464; 2% instances), ADV (183; 1% instances), DET (116; 0% instances), PROPN (105; 0% instances), NUM (71; 0% instances), AUX (51; 0% instances), X (46; 0% instances), PART (41; 0% instances), INTJ (10; 0% instances), CCONJ (4; 0% instances), SCONJ (2; 0% instances), SYM (2; 0% instances), ADP (1; 0% instances)

1142 (5%) VERB nodes are leaves.

3061 (12%) VERB nodes have one child.

5195 (21%) VERB nodes have two children.

15349 (62%) VERB nodes have three or more children.

The highest child degree of a VERB node is 15.

Children of VERB nodes are attached using 41 different relations: punct (16918; 22% instances), obl (10281; 14% instances), nsubj (9313; 12% instances), advmod (9271; 12% instances), obj (7028; 9% instances), conj (4952; 7% instances), cc (3674; 5% instances), parataxis (2469; 3% instances), xcomp (2321; 3% instances), mark (2230; 3% instances), iobj (1996; 3% instances), advcl (1451; 2% instances), ccomp (1101; 1% instances), csubj (817; 1% instances), nsubj:pass (582; 1% instances), discourse (526; 1% instances), aux (366; 0% instances), vocative (211; 0% instances), aux:pass (152; 0% instances), obl:agent (80; 0% instances), acl (66; 0% instances), cop (56; 0% instances), expl (37; 0% instances), det (36; 0% instances), case (34; 0% instances), orphan (17; 0% instances), dep (11; 0% instances), fixed (11; 0% instances), flat (10; 0% instances), appos (8; 0% instances), nummod (8; 0% instances), dislocated (7; 0% instances), nummod:gov (6; 0% instances), flat:foreign (5; 0% instances), goeswith (5; 0% instances), list (4; 0% instances), nmod (4; 0% instances), reparandum (4; 0% instances), acl:relcl (3; 0% instances), amod (3; 0% instances), csubj:pass (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (20393; 27% instances), PUNCT (16918; 22% instances), VERB (10159; 13% instances), PRON (8128; 11% instances), ADV (6433; 8% instances), CCONJ (3671; 5% instances), PART (3539; 5% instances), SCONJ (2109; 3% instances), PROPN (1397; 2% instances), ADJ (1317; 2% instances), AUX (615; 1% instances), DET (436; 1% instances), SYM (347; 0% instances), NUM (275; 0% instances), X (140; 0% instances), ADP (109; 0% instances), INTJ (89; 0% instances)