home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: VERB

There are 926 VERB lemmas (28%), 2417 VERB types (35%) and 3729 VERB tokens (18%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: молемс, меремс, теемс, лисемс, самс, ваномс, ютамс, саемс, аштемс, туемс

The 10 most frequent VERB types: мерсь, лиссь, ютась, мольсь, ашти, неяви, совась, маряви, саизе, сась

The 10 most frequent ambiguous lemmas: пелемс (VERB 21, NOUN 1), чавомс (VERB 13, ADJ 1), стямс (VERB 12, ADV 1), улемс (AUX 50, VERB 6), эрявомс (AUX 21, VERB 8), карамс (VERB 7, ADV 1), азё (VERB 5, PART 1), менемс (VERB 4, NOUN 1), ульнемс (AUX 45, VERB 3), молема (NOUN 1, VERB 1)

The 10 most frequent ambiguous types: эрямо (VERB 5, NOUN 2), Улить (VERB 4, AUX 2), азё (PART 1, VERB 1), вечкевикс (VERB 3, ADJ 2), кенере (VERB 3, NOUN 1), эряви (AUX 13, VERB 3), Ярсамодо (VERB 2, NOUN 1), вечкема (VERB 2, NOUN 1), кадык (AUX 3, VERB 2), лисема (NOUN 1, VERB 1)

Morphology

The form / lemma ratio of VERB is 2.610151 (the average of all parts of speech is 2.080194).

The 1st highest number of forms (29) was observed with the lemma “самс”: Сыде, Сынек, са, садо, сазо, сазь, сак, сакшность, сакшнось, самаль, само, самодо, самодон, самодост, самосонзо, самось, самоськак, самс, састь, састькак, сась, сат, сы, сыль, сынь, сыть, сыця, сыцятненень, сыцятнень.

The 2nd highest number of forms (22) was observed with the lemma “марямс”: Марить, Марясть, Марясы, Марясынек, Марясынк, мари, мариде, маризе, маризь, маринек, маринь, мария, маря, марязь, марямга, марямо, маряса, марясак, маряст, марясынзе, марясь, маряяк.

The 3rd highest number of forms (21) was observed with the lemma “неемс”: Неезденть, Нейсы, нее, неезь, неемеде, неемс, нееяк, неи, неизе, неиль, неинек, неинзе, нейсамизь, нейсызь, нейсь, несамам, несы, несызь, несынзе, несь, неят.

VERB occurs with 20 features: Person[subj] (2612; 70% instances), Number[subj] (2611; 70% instances), Mood (2598; 70% instances), Tense (2492; 67% instances), VerbForm (933; 25% instances), Case (515; 14% instances), Person[obj] (509; 14% instances), Number[obj] (505; 14% instances), Derivation (269; 7% instances), Number (226; 6% instances), Definite (212; 6% instances), Connegative (183; 5% instances), Nomzr (79; 2% instances), Clitic (49; 1% instances), Aspect (48; 1% instances), Number[psor] (38; 1% instances), Person[psor] (38; 1% instances), PartForm (15; 0% instances), Style (14; 0% instances), Typo (2; 0% instances)

VERB occurs with 64 feature-value pairs: Aspect=Hab, Aspect=Inch, Case=Abl, Case=Dat, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Loc, Case=Nom, Case=Prl, Case=Tra, Clitic=Add, Connegative=Yes, Definite=Def, Definite=Ind, Derivation=OkshnOms, Derivation=Omka, Derivation=OvOms, Derivation=Ovt, Derivation=Ozj, Derivation=VGen, Derivation=VSj, Mood=Cnd, Mood=CndSub, Mood=Des, Mood=Imp, Mood=Ind, Mood=Nec, Mood=Opt, Mood=Prec, Mood=Sub, Nomzr=Ag, Number=Plur, Number=Plur,Sing, Number=Sing, Number[obj]=Plur, Number[obj]=Sing, Number[psor]=Plur, Number[psor]=Sing, Number[subj]=Plur, Number[subj]=Plur,Sing, Number[subj]=Sing, PartForm=PastDyn, PartForm=PrsDet, PartForm=PrsTra, Person[obj]=1, Person[obj]=2, Person[obj]=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Person[subj]=1, Person[subj]=2, Person[subj]=3, Style=Arch, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Conv,Part, VerbForm=Inf, VerbForm=Part, VerbForm=Vnoun

VERB occurs with 259 feature combinations. The most frequent feature combination is Mood=Ind|Number[subj]=Sing|Person[subj]=3|Tense=Past (731 tokens). Examples: мерсь, лиссь, мольсь, ютась, совась, сась, пшкадсь, тейсь, чийсь, кадовсь

Relations

VERB nodes are attached to their parents using 31 different relations: root (1680; 45% instances), conj (886; 24% instances), advcl (309; 8% instances), parataxis (188; 5% instances), xcomp (151; 4% instances), acl (128; 3% instances), ccomp (90; 2% instances), acl:relcl (59; 2% instances), appos (31; 1% instances), obl (27; 1% instances), nsubj (25; 1% instances), csubj (22; 1% instances), amod (19; 1% instances), xcomp:ds (18; 0% instances), compound (17; 0% instances), advcl:tcl (15; 0% instances), nmod (15; 0% instances), obl:cmp (11; 0% instances), obj (10; 0% instances), fixed (5; 0% instances), advcl:eval (4; 0% instances), discourse (4; 0% instances), nsubj:cop (3; 0% instances), advcl:cmp (2; 0% instances), dislocated (2; 0% instances), obl:tmod (2; 0% instances), vocative (2; 0% instances), csubj:cop (1; 0% instances), obl:inst (1; 0% instances), obl:lmod (1; 0% instances), orphan (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: (1680; 45% instances), VERB (1549; 42% instances), NOUN (319; 9% instances), ADJ (72; 2% instances), ADV (48; 1% instances), PRON (30; 1% instances), INTJ (12; 0% instances), PROPN (9; 0% instances), ADP (3; 0% instances), AUX (3; 0% instances), DET (3; 0% instances), NUM (1; 0% instances)

269 (7%) VERB nodes are leaves.

436 (12%) VERB nodes have one child.

628 (17%) VERB nodes have two children.

2396 (64%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 62 different relations: punct (3261; 28% instances), nsubj (1571; 14% instances), obl (1347; 12% instances), obj (954; 8% instances), conj (885; 8% instances), advmod (461; 4% instances), advmod:tmod (414; 4% instances), obl:lmod (375; 3% instances), aux (354; 3% instances), advcl (294; 3% instances), cc (260; 2% instances), xcomp (193; 2% instances), parataxis (147; 1% instances), ccomp (108; 1% instances), mark (106; 1% instances), aux:aspect (104; 1% instances), obl:inst (80; 1% instances), discourse (79; 1% instances), advmod:lmod (76; 1% instances), obl:tmod (73; 1% instances), vocative (69; 1% instances), advmod:eval (34; 0% instances), obl:agent (33; 0% instances), appos (25; 0% instances), advmod:foc (24; 0% instances), nmod:gobj (23; 0% instances), nmod (20; 0% instances), advcl:tcl (18; 0% instances), aux:nec (18; 0% instances), xcomp:ds (18; 0% instances), aux:neg (16; 0% instances), advmod:mmod (15; 0% instances), aux:cnd (12; 0% instances), csubj (12; 0% instances), nmod:gsubj (12; 0% instances), advmod:deg (10; 0% instances), compound (10; 0% instances), det (9; 0% instances), nsubj:cop (9; 0% instances), acl (8; 0% instances), aux:opt (8; 0% instances), case (8; 0% instances), aux:q (7; 0% instances), cop (7; 0% instances), dislocated (6; 0% instances), amod (5; 0% instances), aux:imp (5; 0% instances), expl (5; 0% instances), acl:relcl (4; 0% instances), cc:preconj (3; 0% instances), compound:prt (3; 0% instances), dep (3; 0% instances), obl:cmp (3; 0% instances), nmod:poss (2; 0% instances), nummod (2; 0% instances), obl:own (2; 0% instances), orphan (2; 0% instances), advcl:eval (1; 0% instances), advmod:cau (1; 0% instances), advmod:cmp (1; 0% instances), fixed (1; 0% instances), obl:cau (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: PUNCT (3261; 28% instances), NOUN (3225; 28% instances), VERB (1549; 13% instances), ADV (1196; 10% instances), PRON (725; 6% instances), AUX (540; 5% instances), PROPN (430; 4% instances), CCONJ (265; 2% instances), ADP (111; 1% instances), ADJ (106; 1% instances), SCONJ (50; 0% instances), INTJ (46; 0% instances), PART (46; 0% instances), DET (40; 0% instances), NUM (26; 0% instances), X (1; 0% instances)