home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-GSD: POS Tags: AUX

There are 27 AUX lemmas (0%), 55 AUX types (0%) and 1012 AUX tokens (1%). Out of 16 observed tags, the rank of AUX is: 11 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent AUX lemmas: БЫТЬ, ЯВЛЯТЬСЯ, ЭТО, СТАТЬ, ЯВИТЬСЯ, BIN, ОКАЗАТЬСЯ, ОКАЗЫВАТЬСЯ, AM, BE

The 10 most frequent AUX types: был, была, были, было, является, быть, будет, это, являются, будучи

The 10 most frequent ambiguous lemmas: БЫТЬ (AUX 795, VERB 129), ЯВЛЯТЬСЯ (AUX 135, VERB 5), ЭТО (PRON 148, AUX 27), СТАТЬ (VERB 119, AUX 25), ЯВИТЬСЯ (AUX 5, VERB 2), ОКАЗАТЬСЯ (VERB 11, AUX 2), ОКАЗЫВАТЬСЯ (VERB 3, AUX 2), БЫВАТЬ (VERB 3, AUX 1), ДЕЛАТЬ (VERB 12, AUX 1), МОЧЬ (VERB 101, AUX 1)

The 10 most frequent ambiguous types: был (AUX 297, VERB 16), была (AUX 130, VERB 11), были (AUX 131, VERB 15), было (AUX 125, VERB 32), является (AUX 74, VERB 5), быть (AUX 34, VERB 12), будет (AUX 28, VERB 4), это (PRON 56, AUX 24, DET 23), будучи (AUX 8, VERB 1), стал (VERB 44, AUX 9)

Morphology

The form / lemma ratio of AUX is 2.037037 (the average of all parts of speech is 1.592402).

The 1st highest number of forms (12) was observed with the lemma “БЫТЬ”: будет, будут, будучи, бывшего, бывшие, бывшим, был, была, были, было, быть, есть.

The 2nd highest number of forms (10) was observed with the lemma “ЯВЛЯТЬСЯ”: Являясь, являвшись, является, являлась, являлись, являлся, являться, являются, являющегося, являющееся.

The 3rd highest number of forms (6) was observed with the lemma “СТАТЬ”: ставших, стал, стала, стали, стало, стать.

AUX occurs with 11 features: VerbForm (983; 97% instances), Aspect (982; 97% instances), Number (932; 92% instances), Tense (931; 92% instances), Mood (924; 91% instances), Gender (625; 62% instances), Person (164; 16% instances), Voice (149; 15% instances), Animacy (8; 1% instances), Case (8; 1% instances), Variant (1; 0% instances)

AUX occurs with 26 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Gen, Case=Ins, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=3, Tense=Fut, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act, Voice=Mid, Voice=Pass

AUX occurs with 37 feature combinations. The most frequent feature combination is Aspect=Imp|Gender=Masc|Mood=Ind|Number=Sing|Tense=Past|VerbForm=Fin (311 tokens). Examples: был, состоял

Relations

AUX nodes are attached to their parents using 13 different relations: aux:pass (535; 53% instances), cop (246; 24% instances), root (115; 11% instances), aux (63; 6% instances), conj (19; 2% instances), advcl (10; 1% instances), acl:relcl (7; 1% instances), ccomp (6; 1% instances), acl (4; 0% instances), parataxis (3; 0% instances), xcomp (2; 0% instances), nsubj (1; 0% instances), orphan (1; 0% instances)

Parents of AUX nodes belong to 10 different parts of speech: VERB (595; 59% instances), NOUN (162; 16% instances), (115; 11% instances), ADJ (113; 11% instances), NUM (8; 1% instances), PROPN (7; 1% instances), ADV (4; 0% instances), PRON (4; 0% instances), DET (3; 0% instances), SYM (1; 0% instances)

829 (82%) AUX nodes are leaves.

18 (2%) AUX nodes have one child.

31 (3%) AUX nodes have two children.

134 (13%) AUX nodes have three or more children.

The highest child degree of a AUX node is 7.

Children of AUX nodes are attached using 17 different relations: xcomp (151; 27% instances), punct (147; 26% instances), nsubj (122; 22% instances), obl (29; 5% instances), advmod (23; 4% instances), obj (18; 3% instances), cc (17; 3% instances), mark (16; 3% instances), conj (13; 2% instances), parataxis (9; 2% instances), nmod (8; 1% instances), acl (3; 1% instances), goeswith (3; 1% instances), advcl (2; 0% instances), discourse (1; 0% instances), iobj (1; 0% instances), nsubj:pass (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: NOUN (245; 43% instances), PUNCT (148; 26% instances), PRON (25; 4% instances), PROPN (25; 4% instances), ADJ (24; 4% instances), VERB (21; 4% instances), ADV (18; 3% instances), CCONJ (16; 3% instances), PART (14; 2% instances), SCONJ (14; 2% instances), NUM (11; 2% instances), ADP (1; 0% instances), DET (1; 0% instances), SYM (1; 0% instances)