home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-FicTree: POS Tags: AUX

There are 2 AUX lemmas (0%), 59 AUX types (0%) and 7534 AUX tokens (5%). Out of 16 observed tags, the rank of AUX is: 16 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat

The 10 most frequent AUX types: jsem, je, by, byl, byla, bylo, bych, jsme, bude, jsou

The 10 most frequent ambiguous lemmas: být (AUX 7488, PRON 1)

The 10 most frequent ambiguous types: je (AUX 863, PRON 228), si (PRON 1351, AUX 4), buď (CCONJ 6, AUX 1)

Morphology

The form / lemma ratio of AUX is 29.500000 (the average of all parts of speech is 1.970842).

The 1st highest number of forms (48) was observed with the lemma “být”: Buďme, Nebuď, bude, budeme, budete, budeš, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, bysme, byste, být, je, jsem, jsi, jsme, jsou, jsouc, jste, nebude, nebudeme, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejseš, nejsi, nejsme, nejsou, nejste, není, si.

The 2nd highest number of forms (11) was observed with the lemma “bývat”: Nebývají, bývají, býval, bývala, bývali, bývalo, bývá, bývám, nebýval, nebývala, nebývalo.

AUX occurs with 11 features: VerbForm (7534; 100% instances), Number (6573; 87% instances), Polarity (6258; 83% instances), Tense (6098; 81% instances), Voice (6098; 81% instances), Mood (5957; 79% instances), Person (5516; 73% instances), Gender (1423; 19% instances), Animacy (624; 8% instances), Style (7; 0% instances), Aspect (1; 0% instances)

AUX occurs with 25 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 55 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=1|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (2338 tokens). Examples: jsem, bývám

Relations

AUX nodes are attached to their parents using 14 different relations: aux (4216; 56% instances), cop (2207; 29% instances), root (445; 6% instances), conj (152; 2% instances), ccomp (149; 2% instances), aux:pass (138; 2% instances), advcl (97; 1% instances), xcomp (42; 1% instances), acl:relcl (40; 1% instances), acl (16; 0% instances), dep (12; 0% instances), csubj (8; 0% instances), parataxis (7; 0% instances), orphan (5; 0% instances)

Parents of AUX nodes belong to 13 different parts of speech: VERB (4389; 58% instances), ADJ (1466; 19% instances), NOUN (872; 12% instances), (445; 6% instances), DET (96; 1% instances), PRON (80; 1% instances), AUX (79; 1% instances), ADV (49; 1% instances), NUM (26; 0% instances), PROPN (21; 0% instances), PART (9; 0% instances), SCONJ (1; 0% instances), X (1; 0% instances)

6580 (87%) AUX nodes are leaves.

28 (0%) AUX nodes have one child.

88 (1%) AUX nodes have two children.

838 (11%) AUX nodes have three or more children.

The highest child degree of a AUX node is 11.

Children of AUX nodes are attached using 21 different relations: punct (1146; 30% instances), nsubj (627; 17% instances), advmod (564; 15% instances), obl (483; 13% instances), conj (207; 5% instances), mark (195; 5% instances), cc (169; 4% instances), advcl (94; 2% instances), obl:arg (83; 2% instances), aux (56; 1% instances), ccomp (31; 1% instances), dep (27; 1% instances), csubj (23; 1% instances), xcomp (20; 1% instances), advmod:emph (19; 1% instances), discourse (14; 0% instances), parataxis (14; 0% instances), vocative (10; 0% instances), appos (8; 0% instances), orphan (3; 0% instances), obj (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: PUNCT (1146; 30% instances), NOUN (742; 20% instances), ADV (534; 14% instances), VERB (300; 8% instances), PRON (260; 7% instances), SCONJ (191; 5% instances), CCONJ (167; 4% instances), DET (167; 4% instances), PART (88; 2% instances), AUX (79; 2% instances), PROPN (52; 1% instances), NUM (32; 1% instances), ADJ (29; 1% instances), INTJ (5; 0% instances), ADP (1; 0% instances), X (1; 0% instances)