home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: VERB

There are 687 VERB lemmas (14%), 2050 VERB types (17%) and 2641 VERB tokens (9%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 3 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: взѧти, прислати, дати, послати, быти, вдати, ити, даꙗти, бити, хотѣти

The 10 most frequent VERB types: возми, даи, далъ, пришли, шло, възьми, посли, присъли, възѧле, дале

The 10 most frequent ambiguous lemmas: дати (VERB 82, SCONJ 9), быти (AUX 372, VERB 68), _ (X 26, NUM 3, VERB 3, PROPN 2, PUNCT 2, DET 1), дѣти (NOUN 51, VERB 1), на… (VERB 1, X 1), ни (PART 44, CCONJ 11, X 2, VERB 1)

The 10 most frequent ambiguous types: дати (VERB 6, SCONJ 2), нѣ (VERB 5, CCONJ 2, PART 2), покланѧю (VERB 5, PRON 1), буде (AUX 4, VERB 4), бѹде (VERB 4, AUX 1), бѹдѹ (VERB 4, AUX 2), бꙑло (AUX 6, VERB 4), въ (ADP 88, VERB 4), дать (SCONJ 4, VERB 4), хотѧ (PART 5, VERB 4)

Morphology

The form / lemma ratio of VERB is 2.983988 (the average of all parts of speech is 2.421872).

The 1st highest number of forms (112) was observed with the lemma “взѧти”: (в)[ъ]зми, (в)[ъ]зьми, (в)зѧло, (в)озьми, (въ)[зъм]и, (въ)зѧ[т]и, (въз)ем[и], (възьми), -т[ое], [в]ъзѧла, [во]зьми, [въз]ѧ[ле, [възьми], в)зѧт[и, взми, в[озѧти], в[ъ]земи, в[ъз]ьми, в[ъзь]ми, в, взьмъ, взѧ, взѧ)[л]ѣ, взѧлъ, взѧв·ъ, взѧл, взѧл[и], взѧла, взѧле, взѧли, взѧло, взѧлъ, взѧлѣ, взѧл, взѧти, взѧтъ, взѧть, вз…, во)з…, возми, возмѧ, возѧти, во[зм]и, воз(мите), возми, воз[е]ми, воземеше, воземи, воземо, возмете, возметъ, возми, возмь, возмѹ, возьм, возѧ, возѧти, возѧле, возѧло, возѧлъ, возѧль, возѧти, возѧто, возѧтъ, восми, во…, всѧло, въ(зѧ)ль, въземи, въ·змуть, възмѧ, въз[ь]ми, въз[ѧти], възалъ, въземи, въземо, въземѹ, възимить, възме, възмете, възми, възмите, възмї, възм…, възмꙋ, възъми, възъмъ, възь(ми), възь[ми], възьль, възьми, възьму, възьмъши, възьмѧ, възѣми, възѧти, възѧ[ти, възѧл[е, възѧле, възѧлъ, възѧль, възѧль], възѧти, възѧто, вь:зѧлъ, вьз[ьми], ѹвзѧлъ, ѹзѧле, ѹзѧлѣ, ѹзѧти, …мите, ꙋзѧле.

The 2nd highest number of forms (58) was observed with the lemma “прислати”: (п)[рисл]а[в]о, (п)ришлю, (пр)[и]съли, ——–шли, [п]рис[ъ]…, [прис]ли[те], [присъ]л[е]ши, прислать, п[р]и[с]ъ[л](и), п[ри]сълале, пр, при:съ:ле:ши, при:съ:ли, при(с)[л]ати, прис[ъли, прислалъ, присли, присоли, пришли, пришьлить, при[с]ли, при[ш]и, при·сли, присли, присъ, прис[ъ]лали, присл{л}авъши, прислаль, приславъ, прислале, прислало, прислалъ, прислати, прислеши, присли, прислите, прислꙑ, присолеши, присоли, присълале, присълана, присъле, присълеши, присъли, присълѹ, присъте, пришли, приш[ли], приши·ли, пришл(и), пришл[и], пришле, пришлете, пришли, пришлите, пришльши, пришлю, …сълеть.

The 3rd highest number of forms (53) was observed with the lemma “вдати”: (в)[ъда]ж[ъ], (въда)и, [в]ъдамъ, [въда](ти), в(да)ль, вдали, вдало, во[д]…, во[дад]ѧт[ь], во·да·ти, вод[ал-, вода, вода)ти, вода…, вода[д]ить, вода[н]о, водавоше, водадѧ, водала, водале, водамо, водано, водаси, водасте, водат(и), водати, въдажь, въдаль, въ[д]а[л]ь, въд)[адѧ], въд[а]ле, въд]але, въда, въда…, въда[д]ѧ, въда[но], въдавоше, въдадите, въдади…, въдадѧть, въдаже, въдал[ь], въдала, въдале, въдало, въдалъ, въдаль, въдамъ, въдаси, въдасть, въдати, въделе, ꙋдасте.

VERB occurs with 13 features: Voice (2578; 98% instances), VerbForm (2555; 97% instances), Number (2342; 89% instances), Tense (1610; 61% instances), Person (1473; 56% instances), Gender (827; 31% instances), Mood (779; 29% instances), Case (191; 7% instances), Analyt (63; 2% instances), Variant (20; 1% instances), Fragment (9; 0% instances), Polarity (3; 0% instances), Typo (1; 0% instances)

VERB occurs with 34 feature-value pairs: Analyt=Yes, Case=Acc, Case=Dat, Case=Gen, Case=Loc, Case=Nom, Fragment=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pqp, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=PartRes, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 157 feature combinations. The most frequent feature combination is Mood=Imp|Number=Sing|Person=2|VerbForm=Fin|Voice=Act (592 tokens). Examples: возми, даи, възьми, пришли, посли, присъли, иди, молови, присли, въдаи

Relations

VERB nodes are attached to their parents using 22 different relations: root (1471; 56% instances), conj (457; 17% instances), advcl (315; 12% instances), parataxis (116; 4% instances), xcomp (91; 3% instances), ccomp (50; 2% instances), dislocated (35; 1% instances), dep (27; 1% instances), acl:relcl (22; 1% instances), acl (20; 1% instances), amod (10; 0% instances), csubj (10; 0% instances), orphan (5; 0% instances), list (3; 0% instances), iobj (2; 0% instances), appos (1; 0% instances), case (1; 0% instances), fixed (1; 0% instances), nsubj (1; 0% instances), obj (1; 0% instances), obl (1; 0% instances), reparandum (1; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: (1471; 56% instances), VERB (928; 35% instances), NOUN (117; 4% instances), X (45; 2% instances), ADJ (20; 1% instances), PRON (15; 1% instances), PROPN (15; 1% instances), DET (14; 1% instances), NUM (7; 0% instances), ADV (2; 0% instances), AUX (2; 0% instances), CCONJ (2; 0% instances), ADP (1; 0% instances), PART (1; 0% instances), SYM (1; 0% instances)

88 (3%) VERB nodes are leaves.

248 (9%) VERB nodes have one child.

500 (19%) VERB nodes have two children.

1805 (68%) VERB nodes have three or more children.

The highest child degree of a VERB node is 19.

Children of VERB nodes are attached using 33 different relations: punct (1195; 14% instances), obj (1132; 13% instances), obl (1044; 12% instances), cc (917; 11% instances), advmod (804; 9% instances), nsubj (641; 7% instances), iobj (538; 6% instances), conj (482; 6% instances), dep (370; 4% instances), advcl (314; 4% instances), mark (273; 3% instances), aux (264; 3% instances), parataxis (170; 2% instances), vocative (148; 2% instances), xcomp (100; 1% instances), expl (80; 1% instances), ccomp (57; 1% instances), dislocated (43; 0% instances), nsubj:pass (28; 0% instances), case (11; 0% instances), det (7; 0% instances), reparandum (7; 0% instances), aux:pass (5; 0% instances), csubj (5; 0% instances), fixed (4; 0% instances), orphan (4; 0% instances), appos (3; 0% instances), acl (2; 0% instances), cop (2; 0% instances), list (2; 0% instances), nmod (2; 0% instances), obl:agent (2; 0% instances), amod (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: NOUN (1908; 22% instances), PUNCT (1195; 14% instances), PRON (980; 11% instances), VERB (928; 11% instances), CCONJ (916; 11% instances), PROPN (598; 7% instances), PART (528; 6% instances), X (337; 4% instances), ADV (296; 3% instances), AUX (279; 3% instances), SCONJ (277; 3% instances), DET (219; 3% instances), ADJ (106; 1% instances), NUM (37; 0% instances), ADP (29; 0% instances), SYM (24; 0% instances)