home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: AUX

There are 1 AUX lemmas (0%), 15 AUX types (0%) and 632 AUX tokens (2%). Out of 15 observed tags, the rank of AUX is: 15 in number of lemmas, 12 in number of types and 11 in number of tokens.

The 10 most frequent AUX lemmas: být

The 10 most frequent AUX types: je, jsou, není, nejsou, být, by, byly, bude, byl, bylo

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: je (AUX 209, PRON 11)

Morphology

The form / lemma ratio of AUX is 15.000000 (the average of all parts of speech is 1.723629).

The 1st highest number of forms (15) was observed with the lemma “být”: bude, budou, by, byl, byla, bylo, byly, být, je, jsou, nebyl, nebyla, nebyly, nejsou, není.

AUX occurs with 9 features: VerbForm (632; 100% instances), Polarity (606; 96% instances), Number (558; 88% instances), Tense (558; 88% instances), Voice (558; 88% instances), Mood (525; 83% instances), Person (515; 81% instances), Gender (59; 9% instances), Animacy (26; 4% instances)

AUX occurs with 20 feature-value pairs: Animacy=Inan, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=3, Polarity=Neg, Polarity=Pos, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 16 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (252 tokens). Examples: je, není

Relations

AUX nodes are attached to their parents using 13 different relations: cop (438; 69% instances), aux:pass (131; 21% instances), aux (40; 6% instances), advcl (7; 1% instances), acl:relcl (3; 0% instances), ccomp (2; 0% instances), conj (2; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), root (2; 0% instances), acl (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of AUX nodes belong to 10 different parts of speech: ADJ (445; 70% instances), NOUN (142; 22% instances), VERB (31; 5% instances), PRON (4; 1% instances), DET (3; 0% instances), ADV (2; 0% instances), (2; 0% instances), AUX (1; 0% instances), NUM (1; 0% instances), X (1; 0% instances)

612 (97%) AUX nodes are leaves.

3 (0%) AUX nodes have one child.

1 (0%) AUX nodes have two children.

16 (3%) AUX nodes have three or more children.

The highest child degree of a AUX node is 8.

Children of AUX nodes are attached using 10 different relations: punct (28; 36% instances), nsubj (16; 21% instances), mark (11; 14% instances), obl (10; 13% instances), conj (5; 6% instances), cc (3; 4% instances), obl:arg (2; 3% instances), advcl (1; 1% instances), amod (1; 1% instances), dep (1; 1% instances)

Children of AUX nodes belong to 11 different parts of speech: PUNCT (28; 36% instances), NOUN (23; 29% instances), PART (7; 9% instances), ADJ (6; 8% instances), SCONJ (4; 5% instances), CCONJ (3; 4% instances), PRON (2; 3% instances), VERB (2; 3% instances), AUX (1; 1% instances), DET (1; 1% instances), X (1; 1% instances)