home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: VERB

There are 2300 VERB lemmas (18%), 6741 VERB types (21%) and 14852 VERB tokens (9%). Out of 17 observed tags, the rank of VERB is: 4 in number of lemmas, 2 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: велѣти, быти, взяти, дати, послати, писати, бити, пожаловати, приити, сказати

The 10 most frequent VERB types: смерено, взято, велел, пожаловал, дал, писал, сказал, велѣлъ, принето, послал

The 10 most frequent ambiguous lemmas: быти (AUX 1200, VERB 379), писати (VERB 260, NOUN 1), нѣтъ (VERB 79, NOUN 1), яти (VERB 13, AUX 3), не (PART 1443, VERB 1)

The 10 most frequent ambiguous types: бысть (AUX 64, VERB 45), будет (AUX 101, VERB 34, SCONJ 11), было (AUX 121, VERB 33, PART 2), есть (AUX 91, VERB 28), быть (AUX 28, VERB 26), были (AUX 84, VERB 19), быти (AUX 40, VERB 17), будетъ (AUX 47, SCONJ 18, VERB 16, PART 2), был (AUX 35, VERB 16), была (AUX 39, VERB 14)

Morphology

The form / lemma ratio of VERB is 2.930870 (the average of all parts of speech is 2.481645).

The 1st highest number of forms (58) was observed with the lemma “быти”: [е]сть, Ест, Сый, будемъ, будет, будетъ, будеть, будешъ, буди, буду, будут, будутъ, будучи, будущаго, будущей, будущий, будущими, будь, будяше, быв, бывшемъ, бывший, бывшим, бывших, бывшихъ, бывшу, бывыи, был, была, были, было, былъ, бысть, быт, быт(ь), быти, быто, быть, быша, бышя, бяше, бѣ, бѣаху, бѣста, бѣша, бꙋдет, бꙋдетъ, бꙋдꙋщіи, естъ, есть, есь, есьтя, суть, сущая, сущии, сущиим, сущимъ, єсть.

The 2nd highest number of forms (58) was observed with the lemma “взяти”: взат, взат(о), взато, взатъ, взаты, взела, взем, вземше, вземъ, вземь, взета, взя, взя[ти], взя[то], взяв, взявши, взявъ, взял, взяла, взяли, взялъ, взят, взят(ь), взята, взяти, взято, взятъ, взяты, взятыхъ, взять, взяша, взѧл, взѧла, взѧт[и], взѧт[о], взѧти, взѧто, взѧтое, взꙗ(т), взꙗтыꙗ, возмет, возметъ, возмеши, возмешъ, возми, возмут, возмꙋт, возъмутъ, возьмем, возьмутъ, въземше, възмут, възялъ, възять, възяѳъ, вѕяв, вѕял, вѕят.

The 3rd highest number of forms (48) was observed with the lemma “видѣти”: Видевъ, Видѣвъше, Видѣхо(м), вид[ѣ], виде, видев, видевше, видел, видела, видели, видети, видеть, види(м), види(т), видим, видима, видимое, видимою, видимъ, видимый, видимыя, видит, видите, видить, видиши, видишъ, видишь, видя, видялъ, видят, видяще, видящи, видѣ, видѣвше, видѣвши, видѣвъ, видѣл, видѣли, видѣлъ, видѣти, видѣх, видѣша, видꙗ, видꙗщи, виж(д), виждь, виждꙋ, вижу.

VERB occurs with 15 features: VerbForm (14677; 99% instances), Voice (14677; 99% instances), Tense (11261; 76% instances), Number (11211; 75% instances), Aspect (8392; 57% instances), Gender (5058; 34% instances), Mood (5012; 34% instances), Person (5001; 34% instances), Case (2209; 15% instances), Variant (1706; 11% instances), Reflex (236; 2% instances), Polarity (140; 1% instances), Animacy (42; 0% instances), Analyt (28; 0% instances), Typo (2; 0% instances)

VERB occurs with 39 feature-value pairs: Analyt=Yes, Animacy=Anim, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Reflex=Yes, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, Typo=Yes, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=PartRes, VerbForm=Sup, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 492 feature combinations. The most frequent feature combination is VerbForm=Inf|Voice=Act (1305 tokens). Examples: имати, ѣхати, быть, писать, жити, велеть, говорить, платить, ѣздити, быти

Relations

VERB nodes are attached to their parents using 30 different relations: conj (5098; 34% instances), root (4145; 28% instances), advcl (1550; 10% instances), xcomp (1268; 9% instances), parataxis (695; 5% instances), acl (568; 4% instances), acl:relcl (538; 4% instances), ccomp (445; 3% instances), csubj (167; 1% instances), amod (146; 1% instances), parataxis:discourse (40; 0% instances), discourse (25; 0% instances), obj (25; 0% instances), dislocated (22; 0% instances), nmod (20; 0% instances), nsubj (19; 0% instances), obl:depict (18; 0% instances), obl (17; 0% instances), iobj (10; 0% instances), obl:pronmod (7; 0% instances), orphan (7; 0% instances), fixed (5; 0% instances), list (3; 0% instances), nsubj:pass (3; 0% instances), csubj:outer (2; 0% instances), csubj:pass (2; 0% instances), dep (2; 0% instances), flat (2; 0% instances), reparandum (2; 0% instances), appos (1; 0% instances)

Parents of VERB nodes belong to 14 different parts of speech: VERB (8302; 56% instances), (4145; 28% instances), NOUN (1423; 10% instances), ADJ (431; 3% instances), PRON (252; 2% instances), DET (98; 1% instances), PROPN (97; 1% instances), ADV (67; 0% instances), AUX (15; 0% instances), NUM (14; 0% instances), PART (3; 0% instances), X (3; 0% instances), ADP (1; 0% instances), SYM (1; 0% instances)

437 (3%) VERB nodes are leaves.

1012 (7%) VERB nodes have one child.

1773 (12%) VERB nodes have two children.

11630 (78%) VERB nodes have three or more children.

The highest child degree of a VERB node is 19.

Children of VERB nodes are attached using 43 different relations: punct (11415; 19% instances), obl (10191; 17% instances), cc (6954; 11% instances), obj (5400; 9% instances), conj (5201; 9% instances), nsubj (5116; 8% instances), advmod (3755; 6% instances), iobj (2840; 5% instances), advcl (1572; 3% instances), xcomp (1435; 2% instances), mark (1375; 2% instances), obl:tmod (1089; 2% instances), nsubj:pass (1029; 2% instances), parataxis (750; 1% instances), aux (542; 1% instances), ccomp (528; 1% instances), vocative (472; 1% instances), discourse (330; 1% instances), dep (143; 0% instances), aux:pass (130; 0% instances), obl:agent (88; 0% instances), obl:float (86; 0% instances), expl (69; 0% instances), obl:depict (68; 0% instances), csubj (67; 0% instances), case (52; 0% instances), acl (46; 0% instances), expl:pv (46; 0% instances), nmod (40; 0% instances), cop (39; 0% instances), dislocated (37; 0% instances), parataxis:discourse (36; 0% instances), det (29; 0% instances), appos (10; 0% instances), acl:relcl (8; 0% instances), orphan (8; 0% instances), amod (5; 0% instances), nummod:gov (5; 0% instances), nsubj:outer (4; 0% instances), csubj:pass (2; 0% instances), flat (2; 0% instances), reparandum (2; 0% instances), flat:name (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (17758; 29% instances), PUNCT (11415; 19% instances), VERB (8302; 14% instances), CCONJ (6984; 11% instances), PRON (5686; 9% instances), PROPN (2810; 5% instances), ADV (2188; 4% instances), PART (2050; 3% instances), SCONJ (1299; 2% instances), DET (744; 1% instances), AUX (735; 1% instances), ADJ (727; 1% instances), X (142; 0% instances), NUM (98; 0% instances), ADP (64; 0% instances), INTJ (12; 0% instances), SYM (3; 0% instances)