home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: POS Tags: AUX

There are 2 AUX lemmas (0%), 61 AUX types (0%) and 16120 AUX tokens (3%). Out of 16 observed tags, the rank of AUX is: 15 in number of lemmas, 10 in number of types and 9 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat

The 10 most frequent AUX types: je, jsou, by, bylo, bude, byl, být, byla, není, jsme

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: je (AUX 5165, PRON 356), buď (CCONJ 85, AUX 1), si (PRON 997, AUX 1)

Morphology

The form / lemma ratio of AUX is 30.500000 (the average of all parts of speech is 2.185616).

The 1st highest number of forms (52) was observed with the lemma “být”: Nebuď, bude, budeme, budete, budeš, budiž, budou, budu, buď, buďme, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, byste, byvše, být, býti, je, jest, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, seš, si.

The 2nd highest number of forms (9) was observed with the lemma “bývat”: Bývali, bývají, býval, bývala, bývalo, bývaly, bývá, nebývají, nebývá.

AUX occurs with 11 features: VerbForm (16120; 100% instances), Polarity (14049; 87% instances), Number (13689; 85% instances), Tense (13313; 83% instances), Voice (13313; 83% instances), Mood (12578; 78% instances), Person (12578; 78% instances), Gender (2821; 18% instances), Animacy (640; 4% instances), Aspect (116; 1% instances), Style (1; 0% instances)

AUX occurs with 29 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Aspect=Perf, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 67 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (5712 tokens). Examples: je, jest

Relations

AUX nodes are attached to their parents using 16 different relations: cop (9569; 59% instances), aux (3623; 22% instances), aux:pass (2523; 16% instances), root (140; 1% instances), acl (80; 0% instances), conj (56; 0% instances), advcl (54; 0% instances), xcomp (26; 0% instances), fixed (13; 0% instances), ccomp (12; 0% instances), parataxis (11; 0% instances), acl:relcl (6; 0% instances), csubj (2; 0% instances), csubj:pass (2; 0% instances), dep (2; 0% instances), appos (1; 0% instances)

Parents of AUX nodes belong to 15 different parts of speech: ADJ (7591; 47% instances), NOUN (3414; 21% instances), VERB (3122; 19% instances), ADV (887; 6% instances), DET (450; 3% instances), NUM (176; 1% instances), PRON (149; 1% instances), (140; 1% instances), SYM (99; 1% instances), PROPN (39; 0% instances), AUX (29; 0% instances), PART (21; 0% instances), ADP (1; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)

15734 (98%) AUX nodes are leaves.

44 (0%) AUX nodes have one child.

100 (1%) AUX nodes have two children.

242 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 6.

Children of AUX nodes are attached using 21 different relations: punct (365; 34% instances), nsubj (304; 28% instances), mark (136; 13% instances), conj (74; 7% instances), cc (63; 6% instances), advcl (25; 2% instances), obl (23; 2% instances), advmod (19; 2% instances), dep (19; 2% instances), ccomp (17; 2% instances), csubj (10; 1% instances), aux (8; 1% instances), obj (8; 1% instances), parataxis (7; 1% instances), advmod:emph (2; 0% instances), discourse (2; 0% instances), nmod (2; 0% instances), appos (1; 0% instances), obl:arg (1; 0% instances), orphan (1; 0% instances), xcomp (1; 0% instances)

Children of AUX nodes belong to 14 different parts of speech: PUNCT (365; 34% instances), NOUN (262; 24% instances), SCONJ (126; 12% instances), VERB (74; 7% instances), ADV (61; 6% instances), CCONJ (58; 5% instances), ADJ (29; 3% instances), AUX (29; 3% instances), DET (27; 2% instances), PRON (14; 1% instances), PART (13; 1% instances), PROPN (13; 1% instances), SYM (13; 1% instances), NUM (4; 0% instances)