home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-PUD: POS Tags: VERB

There are 876 VERB lemmas (17%), 1511 VERB types (19%) and 2115 VERB tokens (11%). Out of 17 observed tags, the rank of VERB is: 3 in number of lemmas, 3 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: мочь, стать, являться, сказать, получить, иметь, говорить, находиться, сделать, использовать

The 10 most frequent VERB types: является, может, сказал, говорит, стало, могут, можно, заявил, находится, стал

The 10 most frequent ambiguous lemmas: следовать (VERB 2, ADJ 1), правило (NOUN 7, VERB 1), фиксировать (ADJ 1, VERB 1)

The 10 most frequent ambiguous types: начала (VERB 5, NOUN 3), правил (VERB 2, NOUN 1), смог (VERB 2, NOUN 1), улучшенный (ADJ 1, VERB 1)

Morphology

The form / lemma ratio of VERB is 1.724886 (the average of all parts of speech is 1.496727).

The 1st highest number of forms (11) was observed with the lemma “использовать”: использовала, использовали, использован, использована, использованные, использовано, использованы, использовать, используемые, используют, используя.

The 2nd highest number of forms (10) was observed with the lemma “иметь”: имеет, имел, имела, имели, имело, иметь, имеют, имеющий, имеющих, имеющую.

The 3rd highest number of forms (10) was observed with the lemma “сделать”: сделал, сделала, сделали, сделало, сделан, сделанную, сделанные, сделанных, сделано, сделать.

VERB occurs with 12 features: Aspect (2099; 99% instances), VerbForm (2099; 99% instances), Voice (2099; 99% instances), Tense (1777; 84% instances), Number (1713; 81% instances), Mood (1352; 64% instances), Gender (811; 38% instances), Person (563; 27% instances), Case (203; 10% instances), Variant (158; 7% instances), Animacy (24; 1% instances), Abbr (3; 0% instances)

VERB occurs with 32 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Mid, Voice=Pass

VERB occurs with 149 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (201 tokens). Examples: может, говорит, имеет, нет, работает, составляет, делает, помогает, стоит, включает

Relations

VERB nodes are attached to their parents using 16 different relations: root (839; 40% instances), conj (247; 12% instances), acl (236; 11% instances), xcomp (195; 9% instances), advcl (179; 8% instances), acl:relcl (148; 7% instances), ccomp (101; 5% instances), parataxis (101; 5% instances), csubj (47; 2% instances), amod (11; 1% instances), fixed (3; 0% instances), nmod (3; 0% instances), appos (2; 0% instances), obj (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)

Parents of VERB nodes belong to 13 different parts of speech: (839; 40% instances), VERB (722; 34% instances), NOUN (373; 18% instances), ADJ (78; 4% instances), PRON (44; 2% instances), PROPN (23; 1% instances), AUX (13; 1% instances), ADV (12; 1% instances), DET (5; 0% instances), NUM (3; 0% instances), PART (1; 0% instances), SYM (1; 0% instances), X (1; 0% instances)

55 (3%) VERB nodes are leaves.

157 (7%) VERB nodes have one child.

270 (13%) VERB nodes have two children.

1633 (77%) VERB nodes have three or more children.

The highest child degree of a VERB node is 9.

Children of VERB nodes are attached using 27 different relations: punct (1883; 25% instances), obl (1332; 17% instances), nsubj (1148; 15% instances), obj (740; 10% instances), advmod (534; 7% instances), xcomp (300; 4% instances), conj (252; 3% instances), mark (236; 3% instances), cc (230; 3% instances), nsubj:pass (182; 2% instances), iobj (161; 2% instances), advcl (157; 2% instances), parataxis (134; 2% instances), aux:pass (127; 2% instances), ccomp (113; 1% instances), aux (32; 0% instances), csubj (26; 0% instances), obl:agent (12; 0% instances), discourse (4; 0% instances), det (2; 0% instances), nmod (2; 0% instances), acl (1; 0% instances), amod (1; 0% instances), appos (1; 0% instances), case (1; 0% instances), nummod (1; 0% instances), vocative (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (2599; 34% instances), PUNCT (1883; 25% instances), VERB (722; 9% instances), PRON (645; 8% instances), ADV (420; 6% instances), PROPN (402; 5% instances), CCONJ (253; 3% instances), SCONJ (219; 3% instances), AUX (172; 2% instances), ADJ (111; 1% instances), PART (109; 1% instances), NUM (27; 0% instances), ADP (25; 0% instances), SYM (12; 0% instances), DET (9; 0% instances), X (4; 0% instances), INTJ (1; 0% instances)