home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: AUX

There are 4 AUX lemmas (0%), 71 AUX types (0%) and 803 AUX tokens (1%). Out of 17 observed tags, the rank of AUX is: 16 in number of lemmas, 12 in number of types and 14 in number of tokens.

The 10 most frequent AUX lemmas: быти, бы, бъ, яти

The 10 most frequent AUX types: бы, было, есть, были, будет, б, былъ, еси, бысть, будетъ

The 10 most frequent ambiguous lemmas: быти (AUX 629, VERB 158), бъ (AUX 54, ADP 1), яти (VERB 5, AUX 3)

The 10 most frequent ambiguous types: было (AUX 75, VERB 18), есть (AUX 56, VERB 15), были (AUX 49, VERB 9), будет (AUX 46, VERB 8, SCONJ 5), былъ (AUX 29, VERB 1), бысть (AUX 25, VERB 8), будетъ (AUX 24, SCONJ 9, VERB 5, PART 2), был (AUX 22, VERB 9), была (AUX 22, VERB 8), быти (AUX 18, VERB 11)

Morphology

The form / lemma ratio of AUX is 17.750000 (the average of all parts of speech is 2.250521).

The 1st highest number of forms (67) was observed with the lemma “быти”: Буд[ь], бе, бех, будемъ, будет, будете, будетъ, будеть, будеши, буди, буду, будут, будутъ, будуть, будучи, бы, бы(ст), бывше, бывши, бывших, бывшу, бывшіꙗ, был, была, были, было, былъ, бысть, быт(ь), быт[и], быти, быть, бых, быхом, быша, быше, бяху, бяше, бѣ, бꙋд, бꙋдет, бꙋдетъ, бꙋдеш[ь], бꙋдь, бꙋдꙋ, бꙋдꙋт, бꙗхꙋ, бꙗше, е(ст), е., еси, есми, есмы, есмь, есмя, есте, есть, есь, есьмы, несть, суть, сушу, сущий, сущу, сущъ, сꙋ(т), сꙋщꙋ.

The 2nd highest number of forms (2) was observed with the lemma “бъ”: б, бъ.

The 3rd highest number of forms (2) was observed with the lemma “яти”: имете, имꙋ.

AUX occurs with 12 features: VerbForm (647; 81% instances), Voice (647; 81% instances), Number (601; 75% instances), Tense (590; 73% instances), Mood (555; 69% instances), Person (391; 49% instances), Gender (161; 20% instances), Analyt (159; 20% instances), Case (11; 1% instances), Variant (7; 1% instances), Aspect (1; 0% instances), Polarity (1; 0% instances)

AUX occurs with 29 feature-value pairs: Analyt=Yes, Aspect=Imp, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Tense=Fut, Tense=Imp, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=PartRes, Voice=Act

AUX occurs with 49 feature combinations. The most frequent feature combination is Analyt=Yes|Mood=Cnd (140 tokens). Examples: бы, б, бъ

Relations

AUX nodes are attached to their parents using 14 different relations: cop (369; 46% instances), aux (331; 41% instances), aux:pass (67; 8% instances), conj (11; 1% instances), advcl (8; 1% instances), fixed (5; 1% instances), root (4; 0% instances), orphan (2; 0% instances), acl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), list (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of AUX nodes belong to 12 different parts of speech: VERB (397; 49% instances), NOUN (173; 22% instances), ADJ (124; 15% instances), PRON (37; 5% instances), ADV (31; 4% instances), PROPN (15; 2% instances), DET (9; 1% instances), NUM (7; 1% instances), (4; 0% instances), AUX (3; 0% instances), PART (2; 0% instances), SCONJ (1; 0% instances)

774 (96%) AUX nodes are leaves.

4 (0%) AUX nodes have one child.

0 (0%) AUX nodes have two children.

25 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 14 different relations: advmod (28; 19% instances), punct (25; 17% instances), nsubj (23; 16% instances), obl (21; 14% instances), conj (12; 8% instances), cc (11; 8% instances), mark (7; 5% instances), discourse (5; 3% instances), advcl (4; 3% instances), aux (3; 2% instances), iobj (3; 2% instances), vocative (2; 1% instances), obl:tmod (1; 1% instances), parataxis (1; 1% instances)

Children of AUX nodes belong to 11 different parts of speech: NOUN (37; 25% instances), PART (27; 18% instances), PUNCT (25; 17% instances), VERB (14; 10% instances), CCONJ (10; 7% instances), PRON (10; 7% instances), ADV (7; 5% instances), SCONJ (7; 5% instances), PROPN (4; 3% instances), AUX (3; 2% instances), ADJ (2; 1% instances)