home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: AUX

There are 3 AUX lemmas (0%), 56 AUX types (0%) and 10753 AUX tokens (3%). Out of 17 observed tags, the rank of AUX is: 17 in number of lemmas, 12 in number of types and 10 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat, bývávat

The 10 most frequent AUX types: je, by, jsou, bude, byl, být, jsem, bylo, není, jsme

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: je (AUX 2840, PRON 213), by (AUX 1819, X 1), buď (CCONJ 29, AUX 11), si (PRON 842, AUX 1)

Morphology

The form / lemma ratio of AUX is 18.666667 (the average of all parts of speech is 1.964432).

The 1st highest number of forms (45) was observed with the lemma “být”: Neníť, bude, budem, budeme, budete, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, byste, být, býti, je, jest, jsa, jsem, jsme, jsou, jsouce, jste, nebude, nebudeme, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nejsem, nejsme, nejsou, nejste, není, nésó, si.

The 2nd highest number of forms (9) was observed with the lemma “bývat”: bývají, býval, bývala, bývalo, bývaly, bývá, nebývala, nebývalo, nebývá.

The 3rd highest number of forms (2) was observed with the lemma “bývávat”: bývávalo, bývávaly.

AUX occurs with 11 features: VerbForm (10753; 100% instances), Aspect (10093; 94% instances), Polarity (8682; 81% instances), Mood (8563; 80% instances), Number (8489; 79% instances), Tense (8170; 76% instances), Voice (8170; 76% instances), Person (7326; 68% instances), Gender (1745; 16% instances), Animacy (334; 3% instances), Style (3; 0% instances)

AUX occurs with 28 feature-value pairs: Animacy=Anim, Animacy=Inan, Aspect=Imp, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Coll, Style=Vrnc, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 47 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (3114 tokens). Examples: je, bývá, jest

Relations

AUX nodes are attached to their parents using 17 different relations: cop (4781; 44% instances), aux (3463; 32% instances), aux:pass (1339; 12% instances), root (544; 5% instances), conj (181; 2% instances), acl:relcl (98; 1% instances), advcl (89; 1% instances), ccomp (83; 1% instances), acl (79; 1% instances), xcomp (36; 0% instances), fixed (19; 0% instances), csubj (12; 0% instances), appos (10; 0% instances), parataxis (10; 0% instances), csubj:pass (4; 0% instances), dep (3; 0% instances), orphan (2; 0% instances)

Parents of AUX nodes belong to 13 different parts of speech: ADJ (4190; 39% instances), VERB (3396; 32% instances), NOUN (2021; 19% instances), (544; 5% instances), DET (189; 2% instances), ADV (145; 1% instances), NUM (98; 1% instances), AUX (79; 1% instances), PRON (45; 0% instances), PROPN (28; 0% instances), SYM (9; 0% instances), X (6; 0% instances), PART (3; 0% instances)

9609 (89%) AUX nodes are leaves.

36 (0%) AUX nodes have one child.

86 (1%) AUX nodes have two children.

1022 (10%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 22 different relations: punct (1168; 27% instances), obl (921; 21% instances), nsubj (918; 21% instances), advmod (497; 11% instances), conj (241; 6% instances), mark (213; 5% instances), cc (167; 4% instances), advcl (73; 2% instances), dep (51; 1% instances), csubj (40; 1% instances), aux (30; 1% instances), obl:arg (16; 0% instances), ccomp (11; 0% instances), nmod (8; 0% instances), advmod:emph (6; 0% instances), parataxis (4; 0% instances), xcomp (4; 0% instances), discourse (3; 0% instances), appos (2; 0% instances), vocative (2; 0% instances), acl (1; 0% instances), obj (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: NOUN (1379; 32% instances), PUNCT (1168; 27% instances), ADV (519; 12% instances), VERB (245; 6% instances), DET (234; 5% instances), SCONJ (209; 5% instances), CCONJ (166; 4% instances), PRON (125; 3% instances), PROPN (102; 2% instances), AUX (79; 2% instances), NUM (56; 1% instances), ADJ (47; 1% instances), PART (39; 1% instances), ADP (4; 0% instances), X (3; 0% instances), SYM (2; 0% instances)