Treebank Statistics: UD_Belarusian-HSE: POS Tags: AUX
There are 3 AUX
lemmas (0%), 19 AUX
types (0%) and 2084 AUX
tokens (1%).
Out of 17 observed tags, the rank of AUX
is: 17 in number of lemmas, 17 in number of types and 16 in number of tokens.
The 10 most frequent AUX
lemmas: быць, б, бы
The 10 most frequent AUX
types: будзе, быў, было, былі, была, будуць, б, быць, будзем, ёсць
The 10 most frequent ambiguous lemmas: быць (AUX 1914, VERB 354), б (AUX 135, PART 18, NOUN 4, ADJ 2), бы (PART 37, AUX 35, SCONJ 4, ADP 1)
The 10 most frequent ambiguous types: будзе (AUX 352, VERB 22), быў (AUX 334, VERB 14), было (AUX 319, VERB 61), былі (AUX 213, VERB 10), была (AUX 154, VERB 28), будуць (AUX 148, VERB 4), б (AUX 131, PART 18, NOUN 3), быць (AUX 117, VERB 20), будзем (AUX 64, VERB 5), ёсць (AUX 70, VERB 27)
- будзе
- быў
- было
- былі
- была
- будуць
- б
- быць
- будзем
- ёсць
Morphology
The form / lemma ratio of AUX
is 6.333333 (the average of all parts of speech is 1.756638).
The 1st highest number of forms (17) was observed with the lemma “быць”: Будзь, Будзьма, будзе, будзем, будзеце, будзеш, будзьце, буду, будуць, будучы, была, было, былі, быць, быў, ёсць, ёсьць.
The 2nd highest number of forms (1) was observed with the lemma “б”: б.
The 3rd highest number of forms (1) was observed with the lemma “бы”: бы.
AUX
occurs with 11 features: Mood (1943; 93% instances), VerbForm (1906; 91% instances), Voice (1902; 91% instances), Number (1784; 86% instances), Tense (1782; 86% instances), Gender (848; 41% instances), Person (721; 35% instances), Aspect (245; 12% instances), Case (3; 0% instances), Animacy (2; 0% instances), Degree (2; 0% instances)
AUX
occurs with 24 feature-value pairs: Animacy=Inan
, Aspect=Imp
, Aspect=Perf
, Case=Acc
, Case=Loc
, Degree=Pos
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Inf
, Voice=Act
AUX
occurs with 32 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Fut|VerbForm=Fin|Voice=Act
(349 tokens).
Examples: будзе, буду
Relations
AUX
nodes are attached to their parents using 18 different relations: cop (901; 43% instances), aux (586; 28% instances), aux:pass (516; 25% instances), root (29; 1% instances), conj (13; 1% instances), ccomp (9; 0% instances), advcl (6; 0% instances), acl:relcl (5; 0% instances), acl (4; 0% instances), parataxis (3; 0% instances), xcomp (3; 0% instances), appos (2; 0% instances), orphan (2; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances), fixed (1; 0% instances), flat (1; 0% instances), obj (1; 0% instances)
Parents of AUX
nodes belong to 15 different parts of speech: VERB (1119; 54% instances), NOUN (421; 20% instances), ADJ (267; 13% instances), PRON (76; 4% instances), ADV (74; 4% instances), DET (42; 2% instances), (29; 1% instances), PROPN (24; 1% instances), NUM (17; 1% instances), AUX (5; 0% instances), SYM (4; 0% instances), PART (2; 0% instances), SCONJ (2; 0% instances), INTJ (1; 0% instances), X (1; 0% instances)
2006 (96%) AUX
nodes are leaves.
9 (0%) AUX
nodes have one child.
13 (1%) AUX
nodes have two children.
56 (3%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 7.
Children of AUX
nodes are attached using 17 different relations: punct (73; 28% instances), nsubj (59; 23% instances), advmod (37; 14% instances), obl (28; 11% instances), mark (16; 6% instances), cc (8; 3% instances), conj (7; 3% instances), parataxis (7; 3% instances), advcl (6; 2% instances), aux (4; 2% instances), csubj (4; 2% instances), dep (3; 1% instances), iobj (2; 1% instances), nmod (2; 1% instances), discourse (1; 0% instances), flat (1; 0% instances), xcomp (1; 0% instances)
Children of AUX
nodes belong to 14 different parts of speech: PUNCT (73; 28% instances), NOUN (61; 24% instances), PART (21; 8% instances), ADV (19; 7% instances), PRON (17; 7% instances), VERB (17; 7% instances), SCONJ (16; 6% instances), PROPN (10; 4% instances), CCONJ (7; 3% instances), AUX (5; 2% instances), DET (4; 2% instances), ADJ (3; 1% instances), SYM (3; 1% instances), X (3; 1% instances)