home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: VERB

There are 830 VERB lemmas (29%), 2050 VERB types (35%) and 3114 VERB tokens (18%). Out of 16 observed tags, the rank of VERB is: 2 in number of lemmas, 2 in number of types and 3 in number of tokens.

The 10 most frequent VERB lemmas: молемс, меремс, ваномс, теемс, лисемс, самс, саемс, содамс, ютамс, туемс

The 10 most frequent VERB types: мерсь, лиссь, мольсь, ютась, ашти, совась, ваны, неяви, маряви, саизе

The 10 most frequent ambiguous lemmas: сёрмадомс (VERB 13, NOUN 1), чавомс (VERB 12, ADJ 1), стямс (VERB 11, ADV 1), карамс (VERB 7, ADV 1), эрявомс (AUX 15, VERB 7), азё (VERB 4, PART 1), арась (AUX 37, INTJ 5, VERB 2), молема (NOUN 1, VERB 1), чачтомс (NOUN 1, VERB 1)

The 10 most frequent ambiguous types: Улить (VERB 3, AUX 2), азё (PART 1, VERB 1), эряви (AUX 9, VERB 3), вечкема (VERB 2, NOUN 1), кадык (AUX 3, VERB 2), кенере (VERB 2, NOUN 1), лисема (NOUN 1, VERB 1), эрямо (NOUN 2, VERB 2), Аволить (AUX 1, VERB 1), Сынь (PRON 11, VERB 1)

Morphology

The form / lemma ratio of VERB is 2.469880 (the average of all parts of speech is 2.044845).

The 1st highest number of forms (26) was observed with the lemma “самс”: Сыде, Сынек, Сынь, са, сазь, сак, сакшность, сакшнось, самаль, само, самодо, самодон, самодост, самосонзо, самось, самоськак, самс, састь, састькак, сась, сы, сыль, сыть, сыця, сыцятненень, сыцятнень.

The 2nd highest number of forms (22) was observed with the lemma “марямс”: Марить, Марясть, Марясы, Марясынек, Марясынк, мари, мариде, маризе, маризь, маринек, маринь, мария, маря, марязь, марямга, марямо, маряса, марясак, маряст, марясынзе, марясь, маряяк.

The 3rd highest number of forms (20) was observed with the lemma “ваномс”: Ванадо, Ванан, Ванодоя, Ванса, ванат, вано, ванодо, ванозь, ваномо, ваномс, вансть, вансы, вансынь, вансь, вант, ваны, ваныка, ваныль, ваныть, ваныцякс.

VERB occurs with 20 features: Valency (2447; 79% instances), Person[subj] (2181; 70% instances), Number[subj] (2180; 70% instances), Mood (2169; 70% instances), Tense (2095; 67% instances), VerbForm (742; 24% instances), Person[obj] (440; 14% instances), Number[obj] (436; 14% instances), Case (421; 14% instances), Derivation (313; 10% instances), Number (175; 6% instances), Definite (165; 5% instances), Connegative (152; 5% instances), Clitic (43; 1% instances), Aspect (40; 1% instances), Number[psor] (35; 1% instances), Person[psor] (35; 1% instances), Style (14; 0% instances), Polarity (2; 0% instances), Typo (1; 0% instances)

VERB occurs with 63 feature-value pairs: Aspect=Hab, Aspect=Inch, Case=Abl, Case=Dat, Case=Ela, Case=Gen, Case=Ill, Case=Ine, Case=Loc, Case=Nom, Case=Prl, Case=Tra, Clitic=Add, Connegative=Yes, Definite=Def, Definite=Ind, Derivation=OkshnOms, Derivation=Omka, Derivation=OvOms, Derivation=Ovt, Derivation=Ozj, Derivation=VGen, Derivation=VSj, Derivation=VerbYcja, Mood=Cnd, Mood=CndSub, Mood=Des, Mood=Imp, Mood=Ind, Mood=Nec, Mood=Opt, Mood=Prec, Mood=Sub, Number=Plur, Number=Plur,Sing, Number=Sing, Number[obj]=Plur, Number[obj]=Sing, Number[psor]=Plur, Number[psor]=Sing, Number[subj]=Plur, Number[subj]=Plur,Sing, Number[subj]=Sing, Person[obj]=1, Person[obj]=2, Person[obj]=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, Style=Arch, Tense=Past, Tense=Pres, Typo=Yes, Valency=1, Valency=2, VerbForm=Conv, VerbForm=Inf, VerbForm=Part, VerbForm=Vnoun

VERB occurs with 327 feature combinations. The most frequent feature combination is Mood=Ind|Number[subj]=Sing|Person[subj]=3|Tense=Past|Valency=1 (326 tokens). Examples: лиссь, совась, мольсь, чийсь, шачсь, мерсь, сась, пшкадсь, лоткась, понксь

Relations

VERB nodes are attached to their parents using 28 different relations: root (1344; 43% instances), conj (782; 25% instances), advcl (272; 9% instances), parataxis (147; 5% instances), xcomp (122; 4% instances), acl (111; 4% instances), ccomp (86; 3% instances), acl:relcl (55; 2% instances), obl (27; 1% instances), appos (23; 1% instances), nsubj (20; 1% instances), amod (19; 1% instances), compound (17; 1% instances), csubj (16; 1% instances), xcomp:ds (16; 1% instances), nmod:comp (11; 0% instances), advcl:tcl (10; 0% instances), nmod (10; 0% instances), obj (9; 0% instances), discourse (4; 0% instances), fixed (4; 0% instances), nsubj:cop (2; 0% instances), obl:tmod (2; 0% instances), csubj:cop (1; 0% instances), dislocated (1; 0% instances), obl:inst (1; 0% instances), obl:lmod (1; 0% instances), orphan (1; 0% instances)

Parents of VERB nodes belong to 12 different parts of speech: (1344; 43% instances), VERB (1333; 43% instances), NOUN (288; 9% instances), ADJ (48; 2% instances), ADV (43; 1% instances), PRON (29; 1% instances), AUX (10; 0% instances), INTJ (8; 0% instances), PROPN (5; 0% instances), DET (3; 0% instances), ADP (2; 0% instances), NUM (1; 0% instances)

223 (7%) VERB nodes are leaves.

361 (12%) VERB nodes have one child.

531 (17%) VERB nodes have two children.

1999 (64%) VERB nodes have three or more children.

The highest child degree of a VERB node is 11.

Children of VERB nodes are attached using 63 different relations: punct (2732; 28% instances), nsubj (1309; 13% instances), obl (1182; 12% instances), obj (831; 8% instances), conj (778; 8% instances), advmod (372; 4% instances), advmod:tmod (369; 4% instances), aux:neg (278; 3% instances), advcl (256; 3% instances), cc (230; 2% instances), xcomp (155; 2% instances), parataxis (116; 1% instances), ccomp (105; 1% instances), mark (103; 1% instances), obl:lmod (93; 1% instances), obl:lmp (91; 1% instances), aux:aspect (83; 1% instances), obl:inst (70; 1% instances), obl:lto (70; 1% instances), discourse (59; 1% instances), obl:tmod (55; 1% instances), vocative (45; 0% instances), obl:agent (29; 0% instances), advmod:lto (28; 0% instances), obl:lfrom (28; 0% instances), advmod:foc (24; 0% instances), advmod:lmod (24; 0% instances), advmod:eval (23; 0% instances), aux (21; 0% instances), aux:nec (16; 0% instances), xcomp:ds (16; 0% instances), appos (15; 0% instances), nmod (15; 0% instances), advmod:mmod (14; 0% instances), advcl:tcl (13; 0% instances), nmod:gsubj (11; 0% instances), advmod:deg (10; 0% instances), compound (10; 0% instances), csubj (9; 0% instances), acl (8; 0% instances), aux:cnd (8; 0% instances), case (8; 0% instances), cop (8; 0% instances), aux:q (7; 0% instances), det (7; 0% instances), nmod:gobj (7; 0% instances), aux:opt (6; 0% instances), nsubj:cop (6; 0% instances), aux:imp (5; 0% instances), dislocated (5; 0% instances), expl (5; 0% instances), advmod:lfrom (4; 0% instances), advmod:lmp (4; 0% instances), amod (4; 0% instances), acl:relcl (3; 0% instances), cc:preconj (3; 0% instances), compound:prt (3; 0% instances), advmod:cau (1; 0% instances), advmod:comp (1; 0% instances), dep (1; 0% instances), nummod (1; 0% instances), obl:cau (1; 0% instances), orphan (1; 0% instances)

Children of VERB nodes belong to 16 different parts of speech: PUNCT (2732; 28% instances), NOUN (2705; 28% instances), VERB (1333; 14% instances), ADV (1031; 11% instances), PRON (580; 6% instances), AUX (441; 5% instances), PROPN (387; 4% instances), CCONJ (235; 2% instances), ADP (95; 1% instances), ADJ (81; 1% instances), SCONJ (48; 0% instances), INTJ (39; 0% instances), PART (35; 0% instances), DET (30; 0% instances), NUM (22; 0% instances), X (1; 0% instances)