home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Belarusian-HSE: POS Tags: AUX

There are 3 AUX lemmas (0%), 19 AUX types (0%) and 2084 AUX tokens (1%). Out of 17 observed tags, the rank of AUX is: 17 in number of lemmas, 17 in number of types and 16 in number of tokens.

The 10 most frequent AUX lemmas: быць, б, бы

The 10 most frequent AUX types: будзе, быў, было, былі, была, будуць, б, быць, будзем, ёсць

The 10 most frequent ambiguous lemmas: быць (AUX 1914, VERB 354), б (AUX 135, PART 18, NOUN 4, ADJ 2), бы (PART 37, AUX 35, SCONJ 4, ADP 1)

The 10 most frequent ambiguous types: будзе (AUX 352, VERB 22), быў (AUX 334, VERB 14), было (AUX 319, VERB 61), былі (AUX 213, VERB 10), была (AUX 154, VERB 28), будуць (AUX 148, VERB 4), б (AUX 131, PART 18, NOUN 3), быць (AUX 117, VERB 20), будзем (AUX 64, VERB 5), ёсць (AUX 70, VERB 27)

Morphology

The form / lemma ratio of AUX is 6.333333 (the average of all parts of speech is 1.756638).

The 1st highest number of forms (17) was observed with the lemma “быць”: Будзь, Будзьма, будзе, будзем, будзеце, будзеш, будзьце, буду, будуць, будучы, была, было, былі, быць, быў, ёсць, ёсьць.

The 2nd highest number of forms (1) was observed with the lemma “б”: б.

The 3rd highest number of forms (1) was observed with the lemma “бы”: бы.

AUX occurs with 11 features: Mood (1943; 93% instances), VerbForm (1906; 91% instances), Voice (1902; 91% instances), Number (1784; 86% instances), Tense (1782; 86% instances), Gender (848; 41% instances), Person (721; 35% instances), Aspect (245; 12% instances), Case (3; 0% instances), Animacy (2; 0% instances), Degree (2; 0% instances)

AUX occurs with 24 feature-value pairs: Animacy=Inan, Aspect=Imp, Aspect=Perf, Case=Acc, Case=Loc, Degree=Pos, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, Voice=Act

AUX occurs with 32 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act (349 tokens). Examples: будзе, буду

Relations

AUX nodes are attached to their parents using 18 different relations: cop (901; 43% instances), aux (586; 28% instances), aux:pass (516; 25% instances), root (29; 1% instances), conj (13; 1% instances), ccomp (9; 0% instances), advcl (6; 0% instances), acl:relcl (5; 0% instances), acl (4; 0% instances), parataxis (3; 0% instances), xcomp (3; 0% instances), appos (2; 0% instances), orphan (2; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances), fixed (1; 0% instances), flat (1; 0% instances), obj (1; 0% instances)

Parents of AUX nodes belong to 15 different parts of speech: VERB (1119; 54% instances), NOUN (421; 20% instances), ADJ (267; 13% instances), PRON (76; 4% instances), ADV (74; 4% instances), DET (42; 2% instances), (29; 1% instances), PROPN (24; 1% instances), NUM (17; 1% instances), AUX (5; 0% instances), SYM (4; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)

2006 (96%) AUX nodes are leaves.

9 (0%) AUX nodes have one child.

13 (1%) AUX nodes have two children.

56 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 7.

Children of AUX nodes are attached using 17 different relations: punct (73; 28% instances), nsubj (59; 23% instances), advmod (37; 14% instances), obl (28; 11% instances), mark (16; 6% instances), cc (8; 3% instances), conj (7; 3% instances), parataxis (7; 3% instances), advcl (6; 2% instances), aux (4; 2% instances), csubj (4; 2% instances), dep (3; 1% instances), iobj (2; 1% instances), nmod (2; 1% instances), discourse (1; 0% instances), flat (1; 0% instances), xcomp (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: PUNCT (73; 28% instances), NOUN (61; 24% instances), PART (21; 8% instances), ADV (19; 7% instances), PRON (17; 7% instances), VERB (17; 7% instances), SCONJ (16; 6% instances), PROPN (10; 4% instances), CCONJ (7; 3% instances), AUX (5; 2% instances), DET (4; 2% instances), ADJ (3; 1% instances), SYM (3; 1% instances), X (3; 1% instances)