home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-FicTree: POS Tags: AUX

There are 2 AUX lemmas (0%), 59 AUX types (0%) and 7534 AUX tokens (5%). Out of 16 observed tags, the rank of AUX is: 16 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat

The 10 most frequent AUX types: jsem, je, by, byl, byla, bylo, bych, jsme, bude, jsou

The 10 most frequent ambiguous lemmas: být (AUX 7488, PRON 1)

The 10 most frequent ambiguous types: je (AUX 863, PRON 228), si (PRON 1351, AUX 4), buď (CCONJ 6, AUX 1)

Morphology

The form / lemma ratio of AUX is 29.500000 (the average of all parts of speech is 1.970842).

The 1st highest number of forms (48) was observed with the lemma “být”: Buďme, Nebuď, bude, budeme, budete, budeš, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, bysme, byste, být, je, jsem, jsi, jsme, jsou, jsouc, jste, nebude, nebudeme, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejseš, nejsi, nejsme, nejsou, nejste, není, si.

The 2nd highest number of forms (11) was observed with the lemma “bývat”: Nebývají, bývají, býval, bývala, bývali, bývalo, bývá, bývám, nebýval, nebývala, nebývalo.

AUX occurs with 11 features: VerbForm (7534; 100% instances), Number (6573; 87% instances), Polarity (6258; 83% instances), Tense (6098; 81% instances), Voice (6098; 81% instances), Mood (5957; 79% instances), Person (5516; 73% instances), Gender (1423; 19% instances), Animacy (624; 8% instances), Style (7; 0% instances), Aspect (1; 0% instances)

AUX occurs with 25 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 55 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (2338 tokens). Examples: jsem, bývám

Relations

AUX nodes are attached to their parents using 14 different relations: aux (4216; 56% instances), cop (2965; 39% instances), aux:pass (138; 2% instances), root (82; 1% instances), advcl (35; 0% instances), conj (31; 0% instances), ccomp (27; 0% instances), xcomp (17; 0% instances), acl (9; 0% instances), dep (6; 0% instances), acl:relcl (3; 0% instances), orphan (3; 0% instances), csubj (1; 0% instances), parataxis (1; 0% instances)

Parents of AUX nodes belong to 13 different parts of speech: VERB (4087; 54% instances), ADJ (1452; 19% instances), NOUN (1098; 15% instances), ADV (352; 5% instances), PRON (210; 3% instances), DET (120; 2% instances), (82; 1% instances), PART (52; 1% instances), PROPN (37; 0% instances), NUM (29; 0% instances), AUX (13; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

7337 (97%) AUX nodes are leaves.

12 (0%) AUX nodes have one child.

58 (1%) AUX nodes have two children.

127 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 19 different relations: punct (236; 39% instances), nsubj (123; 21% instances), mark (58; 10% instances), conj (33; 6% instances), cc (31; 5% instances), advcl (21; 4% instances), ccomp (20; 3% instances), dep (16; 3% instances), csubj (14; 2% instances), advmod (12; 2% instances), xcomp (8; 1% instances), obl (7; 1% instances), aux (6; 1% instances), discourse (4; 1% instances), appos (3; 1% instances), parataxis (3; 1% instances), orphan (2; 0% instances), obj (1; 0% instances), obl:arg (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: PUNCT (236; 39% instances), NOUN (102; 17% instances), VERB (74; 12% instances), SCONJ (59; 10% instances), PRON (29; 5% instances), CCONJ (28; 5% instances), DET (25; 4% instances), ADV (16; 3% instances), AUX (13; 2% instances), PART (6; 1% instances), ADJ (5; 1% instances), NUM (4; 1% instances), INTJ (1; 0% instances), PROPN (1; 0% instances)