home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDTC: POS Tags: AUX

There are 3 AUX lemmas (0%), 84 AUX types (0%) and 144986 AUX tokens (4%). Out of 17 observed tags, the rank of AUX is: 17 in number of lemmas, 13 in number of types and 8 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat, bývávat

The 10 most frequent AUX types: je, by, jsem, jsme, byl, jsou, bylo, byla, bude, být

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: je (AUX 29806, PRON 2116), by (AUX 17066, X 4), buď (CCONJ 271, AUX 19), sem (ADV 277, AUX 20), budiž (AUX 7, PART 1), bodu (NOUN 458, AUX 4), si (PRON 10804, AUX 4, X 1), Jdou (VERB 6, AUX 1)

Morphology

The form / lemma ratio of AUX is 28.000000 (the average of all parts of speech is 2.169184).

The 1st highest number of forms (65) was observed with the lemma “být”: Buďme, Jdou, Nebuďte, Neníť, bodu, bude, budem, budeme, budete, budeš, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, bysem, bysme, byste, být, býti, je, jest, jsa, jsem, jseš, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudem, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebuďme, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsi, nejsme, nejsou, nejste, není, nésó, sem, si, sme, ste.

The 2nd highest number of forms (16) was observed with the lemma “bývat”: bývají, býval, bývala, bývali, bývalo, bývaly, bývá, bývám, býváme, býváte, nebývají, nebýval, nebývala, nebývalo, nebývaly, nebývá.

The 3rd highest number of forms (3) was observed with the lemma “bývávat”: bývávala, bývávalo, bývávaly.

AUX occurs with 12 features: VerbForm (144986; 100% instances), Aspect (144985; 100% instances), Polarity (124446; 86% instances), Number (123805; 85% instances), Tense (120218; 83% instances), Voice (120218; 83% instances), Mood (107294; 74% instances), Person (90218; 62% instances), Gender (33583; 23% instances), Animacy (7554; 5% instances), Style (43; 0% instances), Typo (1; 0% instances)

AUX occurs with 30 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Style=Vrnc, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 62 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (33840 tokens). Examples: je, bývá, jest

Relations

AUX nodes are attached to their parents using 19 different relations: cop (71019; 49% instances), aux (54094; 37% instances), aux:pass (15422; 11% instances), root (1696; 1% instances), conj (664; 0% instances), advcl (602; 0% instances), appos (436; 0% instances), ccomp (336; 0% instances), xcomp (211; 0% instances), acl (131; 0% instances), dep (127; 0% instances), acl:relcl (110; 0% instances), fixed (49; 0% instances), parataxis (40; 0% instances), csubj (32; 0% instances), csubj:pass (11; 0% instances), compound (3; 0% instances), orphan (2; 0% instances), advcl:pred (1; 0% instances)

Parents of AUX nodes belong to 17 different parts of speech: VERB (49140; 34% instances), ADJ (41384; 29% instances), NOUN (32147; 22% instances), ADV (9968; 7% instances), PRON (2674; 2% instances), DET (2595; 2% instances), PROPN (2540; 2% instances), (1696; 1% instances), NUM (1308; 1% instances), PART (1039; 1% instances), AUX (345; 0% instances), X (66; 0% instances), ADP (40; 0% instances), SYM (34; 0% instances), CCONJ (7; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances)

140737 (97%) AUX nodes are leaves.

324 (0%) AUX nodes have one child.

879 (1%) AUX nodes have two children.

3046 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 26 different relations: punct (4122; 32% instances), nsubj (2835; 22% instances), mark (1390; 11% instances), csubj (746; 6% instances), obj (661; 5% instances), conj (621; 5% instances), cc (542; 4% instances), dep (380; 3% instances), advcl (368; 3% instances), ccomp (321; 3% instances), advmod (264; 2% instances), aux (225; 2% instances), obl (111; 1% instances), advmod:emph (59; 0% instances), iobj (35; 0% instances), discourse (30; 0% instances), parataxis (26; 0% instances), advcl:pred (16; 0% instances), xcomp (16; 0% instances), appos (13; 0% instances), obl:arg (11; 0% instances), expl:pv (2; 0% instances), vocative (2; 0% instances), case (1; 0% instances), cop (1; 0% instances), orphan (1; 0% instances)

Children of AUX nodes belong to 17 different parts of speech: PUNCT (4122; 32% instances), NOUN (2193; 17% instances), VERB (1659; 13% instances), SCONJ (991; 8% instances), CCONJ (927; 7% instances), ADV (914; 7% instances), DET (540; 4% instances), ADJ (394; 3% instances), PRON (364; 3% instances), AUX (345; 3% instances), PROPN (144; 1% instances), NUM (81; 1% instances), PART (80; 1% instances), X (32; 0% instances), ADP (9; 0% instances), SYM (3; 0% instances), INTJ (1; 0% instances)