Treebank Statistics: UD_Russian-Poetry: POS Tags: VERB
There are 2806 VERB
lemmas (28%), 5440 VERB
types (30%) and 8234 VERB
tokens (13%).
Out of 17 observed tags, the rank of VERB
is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.
The 10 most frequent VERB
lemmas: знать, быть, нет, идти, любить, мочь, жить, стать, видеть, петь
The 10 most frequent VERB
types: нет, знаю, может, надо, стоит, быть, жить, есть, люблю, вижу
The 10 most frequent ambiguous lemmas: знать (VERB 111, NOUN 1), быть (AUX 236, VERB 102), нет (VERB 85, PART 33), мочь (VERB 81, NOUN 1), стать (VERB 66, NOUN 1), надо (VERB 33, ADP 10), пасть (VERB 11, NOUN 5), пора (NOUN 37, VERB 10), пропасть (VERB 4, NOUN 2), лень (NOUN 4, VERB 2)
The 10 most frequent ambiguous types: нет (VERB 73, PART 14), надо (VERB 30, ADP 10), быть (VERB 25, AUX 16), есть (VERB 21, AUX 2), был (AUX 50, VERB 5), будет (AUX 31, VERB 8), жил (VERB 9, NOUN 1), пора (VERB 9, NOUN 5), было (AUX 27, VERB 8), стали (VERB 7, NOUN 5)
- нет
- надо
- быть
- есть
- был
- будет
- жил
- пора
- было
- стали
Morphology
The form / lemma ratio of VERB
is 1.938703 (the average of all parts of speech is 1.831021).
The 1st highest number of forms (18) was observed with the lemma “забыть”: Забыты, забудем, забудешь, забуду, забудут, забудь, забыв, забывший, забыл, забыла, забыли, забытая, забытое, забытой, забытые, забытый, забытых, забыть.
The 2nd highest number of forms (17) was observed with the lemma “знать”: Знала, знавшие, знаем, знает, знаете, знаешь, знай, знайте, знал, знали, знать, знаю, знают, знающей, знающем, знающие, зная.
The 3rd highest number of forms (15) was observed with the lemma “идти”: Идучи, Идущей, Идя, Идёт, идем, идет, иди, идти, иду, идут, идущих, шел, шла, шли, шёл.
VERB
occurs with 14 features: VerbForm (8051; 98% instances), Voice (8051; 98% instances), Aspect (7949; 97% instances), Tense (6766; 82% instances), Number (6720; 82% instances), Mood (5758; 70% instances), Person (3682; 45% instances), Gender (2263; 27% instances), Case (720; 9% instances), Variant (247; 3% instances), Polarity (114; 1% instances), Animacy (88; 1% instances), Reflex (35; 0% instances), Typo (4; 0% instances)
VERB
occurs with 34 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Aspect=Imp
, Aspect=Perf
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Reflex=Yes
, Tense=Fut
, Tense=Past
, Tense=Pres
, Typo=Yes
, Variant=Short
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
, Voice=Mid
, Voice=Pass
VERB
occurs with 183 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act
(944 tokens).
Examples: может, стоит, поет, знает, пахнет, проходит, идет, значит, смотрит, зовет
Relations
VERB
nodes are attached to their parents using 21 different relations: root (3271; 40% instances), conj (2212; 27% instances), advcl (750; 9% instances), parataxis (447; 5% instances), acl (353; 4% instances), amod (325; 4% instances), xcomp (313; 4% instances), acl:relcl (162; 2% instances), csubj (140; 2% instances), ccomp (136; 2% instances), parataxis:discourse (66; 1% instances), nsubj (11; 0% instances), obj (11; 0% instances), csubj:pass (8; 0% instances), nmod (8; 0% instances), iobj (5; 0% instances), fixed (4; 0% instances), obl (4; 0% instances), obl:depict (4; 0% instances), appos (3; 0% instances), obl:agent (1; 0% instances)
Parents of VERB
nodes belong to 10 different parts of speech: VERB (3493; 42% instances), (3271; 40% instances), NOUN (931; 11% instances), ADJ (308; 4% instances), PRON (96; 1% instances), ADV (65; 1% instances), DET (50; 1% instances), PROPN (13; 0% instances), NUM (5; 0% instances), PART (2; 0% instances)
474 (6%) VERB
nodes are leaves.
623 (8%) VERB
nodes have one child.
1188 (14%) VERB
nodes have two children.
5949 (72%) VERB
nodes have three or more children.
The highest child degree of a VERB
node is 11.
Children of VERB
nodes are attached using 36 different relations: punct (7523; 26% instances), nsubj (3966; 14% instances), obl (3745; 13% instances), advmod (2657; 9% instances), obj (2342; 8% instances), conj (2260; 8% instances), cc (1550; 5% instances), iobj (1178; 4% instances), advcl (728; 3% instances), parataxis (507; 2% instances), mark (464; 2% instances), xcomp (412; 1% instances), vocative (185; 1% instances), ccomp (176; 1% instances), nsubj:pass (176; 1% instances), obl:agent (129; 0% instances), obl:tmod (113; 0% instances), aux (111; 0% instances), parataxis:discourse (96; 0% instances), csubj (79; 0% instances), discourse (72; 0% instances), obl:float (36; 0% instances), aux:pass (22; 0% instances), obl:depict (20; 0% instances), expl (19; 0% instances), case (8; 0% instances), csubj:pass (8; 0% instances), det (4; 0% instances), acl (3; 0% instances), amod (3; 0% instances), appos (2; 0% instances), cop (2; 0% instances), nmod (2; 0% instances), acl:relcl (1; 0% instances), dislocated (1; 0% instances), nummod:gov (1; 0% instances)
Children of VERB
nodes belong to 17 different parts of speech: NOUN (9074; 32% instances), PUNCT (7523; 26% instances), VERB (3493; 12% instances), PRON (2579; 9% instances), ADV (1895; 7% instances), CCONJ (1547; 5% instances), PART (929; 3% instances), ADJ (499; 2% instances), SCONJ (436; 2% instances), PROPN (210; 1% instances), DET (163; 1% instances), AUX (138; 0% instances), INTJ (57; 0% instances), NUM (36; 0% instances), ADP (15; 0% instances), X (5; 0% instances), SYM (2; 0% instances)