home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: AUX

There are 3 AUX lemmas (0%), 30 AUX types (0%) and 12987 AUX tokens (1%). Out of 17 observed tags, the rank of AUX is: 17 in number of lemmas, 17 in number of types and 13 in number of tokens.

The 10 most frequent AUX lemmas: быть, бы, б

The 10 most frequent AUX types: было, бы, был, была, были, быть, будет, есть, буду, будут

The 10 most frequent ambiguous lemmas: быть (AUX 10794, VERB 3946, X 1), бы (AUX 2126, PART 642, X 1), б (AUX 67, NOUN 35, PART 4, X 3, ADJ 1)

The 10 most frequent ambiguous types: было (AUX 2459, VERB 626, PART 85), бы (AUX 2124, PART 641, X 22), был (AUX 1999, VERB 158), была (AUX 1476, VERB 102), были (AUX 1390, VERB 128), быть (AUX 947, VERB 543, X 1), будет (AUX 833, VERB 150), есть (VERB 1510, AUX 384, INTJ 1), буду (AUX 264, VERB 2), будут (AUX 210, VERB 20)

Morphology

The form / lemma ratio of AUX is 10.000000 (the average of all parts of speech is 2.706171).

The 1st highest number of forms (28) was observed with the lemma “быть”: беаше, будем, будет, будете, будешь, будте, буду, будут, будучи, будь, будьте, бывшая, бывшего, бывшие, бывший, бывших, был, была, были, было, бысть, быть, еси, есмы, есмь, есте, есть, суть.

The 2nd highest number of forms (2) was observed with the lemma “бы”: б, бы.

The 3rd highest number of forms (1) was observed with the lemma “б”: б.

AUX occurs with 11 features: Mood (11965; 92% instances), VerbForm (10794; 83% instances), Voice (10794; 83% instances), Number (9777; 75% instances), Tense (9715; 75% instances), Gender (6055; 47% instances), Person (2303; 18% instances), Aspect (1001; 8% instances), Case (5; 0% instances), Animacy (1; 0% instances), Typo (1; 0% instances)

AUX occurs with 24 feature-value pairs: Animacy=Anim, Aspect=Imp, Case=Acc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 36 feature combinations. The most frequent feature combination is Gender=Neut|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin|Voice=Act (2273 tokens). Examples: было

Relations

AUX nodes are attached to their parents using 17 different relations: cop (7594; 58% instances), aux (3080; 24% instances), aux:pass (1989; 15% instances), root (127; 1% instances), fixed (91; 1% instances), conj (38; 0% instances), orphan (20; 0% instances), advcl (11; 0% instances), ccomp (10; 0% instances), acl:relcl (8; 0% instances), parataxis (6; 0% instances), xcomp (5; 0% instances), csubj (3; 0% instances), dep (2; 0% instances), appos (1; 0% instances), list (1; 0% instances), reparandum (1; 0% instances)

Parents of AUX nodes belong to 16 different parts of speech: VERB (5297; 41% instances), ADJ (3578; 28% instances), NOUN (2714; 21% instances), PRON (353; 3% instances), ADV (313; 2% instances), DET (263; 2% instances), (127; 1% instances), PROPN (124; 1% instances), NUM (114; 1% instances), PART (73; 1% instances), SCONJ (16; 0% instances), INTJ (7; 0% instances), X (4; 0% instances), AUX (2; 0% instances), ADP (1; 0% instances), SYM (1; 0% instances)

12775 (98%) AUX nodes are leaves.

18 (0%) AUX nodes have one child.

47 (0%) AUX nodes have two children.

147 (1%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 20 different relations: punct (193; 28% instances), advmod (135; 19% instances), nsubj (133; 19% instances), obl (61; 9% instances), conj (44; 6% instances), parataxis (34; 5% instances), cc (27; 4% instances), obl:tmod (25; 4% instances), mark (17; 2% instances), advcl (5; 1% instances), csubj (5; 1% instances), discourse (3; 0% instances), iobj (3; 0% instances), vocative (3; 0% instances), obj (2; 0% instances), parataxis:discourse (2; 0% instances), aux (1; 0% instances), det (1; 0% instances), nmod (1; 0% instances), reparandum (1; 0% instances)

Children of AUX nodes belong to 15 different parts of speech: PUNCT (193; 28% instances), NOUN (150; 22% instances), ADV (90; 13% instances), VERB (68; 10% instances), PRON (66; 9% instances), PART (55; 8% instances), CCONJ (27; 4% instances), SCONJ (17; 2% instances), ADJ (10; 1% instances), DET (7; 1% instances), NUM (4; 1% instances), PROPN (4; 1% instances), AUX (2; 0% instances), SYM (2; 0% instances), ADP (1; 0% instances)