home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Erzya-JR: POS Tags: AUX

There are 14 AUX lemmas (0%), 78 AUX types (1%) and 697 AUX tokens (4%). Out of 16 observed tags, the rank of AUX is: 13 in number of lemmas, 8 in number of types and 6 in number of tokens.

The 10 most frequent AUX lemmas: а, кармамс, аволь, улемс, ульнемс, арась, савомс, эрявомс, ли, кадык

The 10 most frequent AUX types: а, аволь, эзь, кармась, апак, ульнесь, арась, кармасть, ули, иля

The 10 most frequent ambiguous lemmas: а (AUX 343, CCONJ 8, INTJ 1), арась (AUX 37, INTJ 5, VERB 2), эрявомс (AUX 15, VERB 7)

The 10 most frequent ambiguous types: а (AUX 157, CCONJ 5), арась (AUX 19, VERB 1), эряви (AUX 9, VERB 3), кадык (AUX 3, VERB 2), арасть (AUX 5, VERB 1)

Morphology

The form / lemma ratio of AUX is 5.571429 (the average of all parts of speech is 2.044845).

The 1st highest number of forms (25) was observed with the lemma “а”: Илять, Эзик, а, аволизе, аволинек, аволинь, аволить, аволь, апак, иля, илядо, илязо, илязт, иляст, эзизе, эзизь, эзимизь, эзинзе, эзинь, эзить, эзия, эзть, эзь, эссе, эсть.

The 2nd highest number of forms (15) was observed with the lemma “улемс”: уле, улевель, улевельть, улезт, улезэ, улеме, улемс, улест, ули, улить, улияк, ульгак, ульдядо, ульдянок, улян.

The 3rd highest number of forms (13) was observed with the lemma “кармамс”: Карминдерят, карма, кармавлинь, кармакшнось, кармасть, кармась, кармат, карми, кармиде, кармильть, карминь, кармить, кармитьдеряй.

AUX occurs with 20 features: Polarity (433; 62% instances), Number[subj] (367; 53% instances), Person[subj] (367; 53% instances), Mood (358; 51% instances), Tense (314; 45% instances), Valency (190; 27% instances), VerbType (82; 12% instances), NegationType (52; 7% instances), VerbForm (43; 6% instances), Number[obj] (23; 3% instances), Person[obj] (23; 3% instances), PartType (18; 3% instances), Connegative (9; 1% instances), Case (8; 1% instances), Clitic (7; 1% instances), Definite (4; 1% instances), Number (4; 1% instances), Derivation (3; 0% instances), Style (2; 0% instances), Aspect (1; 0% instances)

AUX occurs with 34 feature-value pairs: Aspect=Hab, Case=Loc, Case=Nom, Clitic=Add, Connegative=Yes, Definite=Ind, Derivation=OkshnOms, Mood=Cnd, Mood=Imp, Mood=Ind, Mood=Opt, Mood=Proh, Mood=Sub, NegationType=Contrastive, Number=Sing, Number[obj]=Plur, Number[obj]=Sing, Number[subj]=Plur, Number[subj]=Sing, PartType=Emp, Person[obj]=1, Person[obj]=3, Person[subj]=1, Person[subj]=2, Person[subj]=3, Polarity=Neg, Style=Arch, Tense=Past, Tense=Pres, Valency=1, Valency=2, VerbForm=Conv,Part, VerbForm=Inf, VerbType=Aux

AUX occurs with 89 feature combinations. The most frequent feature combination is Polarity=Neg (177 tokens). Examples: а, аволь, эзь, апак, арась, эзть, эзинь, эсть, эзизе, арасель

Relations

AUX nodes are attached to their parents using 19 different relations: aux:neg (405; 58% instances), aux:aspect (86; 12% instances), cop (86; 12% instances), aux (22; 3% instances), root (22; 3% instances), aux:nec (16; 2% instances), aux:q (12; 2% instances), conj (9; 1% instances), aux:cnd (8; 1% instances), aux:opt (8; 1% instances), aux:imp (5; 1% instances), fixed (4; 1% instances), acl:relcl (3; 0% instances), ccomp (3; 0% instances), parataxis (3; 0% instances), appos (2; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), orphan (1; 0% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (441; 63% instances), NOUN (71; 10% instances), ADV (68; 10% instances), ADJ (46; 7% instances), PRON (32; 5% instances), (22; 3% instances), AUX (6; 1% instances), ADP (5; 1% instances), NUM (4; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances)

645 (93%) AUX nodes are leaves.

18 (3%) AUX nodes have one child.

10 (1%) AUX nodes have two children.

24 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 20 different relations: punct (51; 39% instances), nsubj (27; 21% instances), conj (8; 6% instances), fixed (7; 5% instances), obl (7; 5% instances), cc (4; 3% instances), advmod:lmod (3; 2% instances), advmod:tmod (3; 2% instances), vocative (3; 2% instances), advcl (2; 2% instances), advmod (2; 2% instances), mark (2; 2% instances), obl:lmod (2; 2% instances), parataxis (2; 2% instances), xcomp (2; 2% instances), advmod:eval (1; 1% instances), aux (1; 1% instances), aux:neg (1; 1% instances), csubj (1; 1% instances), discourse (1; 1% instances)

Children of AUX nodes belong to 13 different parts of speech: PUNCT (51; 39% instances), NOUN (32; 25% instances), ADV (12; 9% instances), VERB (10; 8% instances), PRON (8; 6% instances), AUX (6; 5% instances), CCONJ (4; 3% instances), PROPN (2; 2% instances), ADJ (1; 1% instances), INTJ (1; 1% instances), PART (1; 1% instances), SCONJ (1; 1% instances), X (1; 1% instances)