Treebank Statistics: UD_Czech-CAC: POS Tags: AUX
There are 2 AUX
lemmas (0%), 61 AUX
types (0%) and 16120 AUX
tokens (3%).
Out of 16 observed tags, the rank of AUX
is: 15 in number of lemmas, 10 in number of types and 9 in number of tokens.
The 10 most frequent AUX
lemmas: být, bývat
The 10 most frequent AUX
types: je, jsou, by, bylo, bude, byl, být, byla, není, jsme
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types: je (AUX 5165, PRON 356), buď (CCONJ 85, AUX 1), si (PRON 997, AUX 1)
- je
- buď
- si
Morphology
The form / lemma ratio of AUX
is 30.500000 (the average of all parts of speech is 2.185616).
The 1st highest number of forms (52) was observed with the lemma “být”: Nebuď, bude, budeme, budete, budeš, budiž, budou, budu, buď, buďme, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, byste, byvše, být, býti, je, jest, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, seš, si.
The 2nd highest number of forms (9) was observed with the lemma “bývat”: Bývali, bývají, býval, bývala, bývalo, bývaly, bývá, nebývají, nebývá.
AUX
occurs with 11 features: VerbForm (16120; 100% instances), Polarity (14049; 87% instances), Number (13689; 85% instances), Tense (13313; 83% instances), Voice (13313; 83% instances), Mood (12578; 78% instances), Person (12578; 78% instances), Gender (2821; 18% instances), Animacy (640; 4% instances), Aspect (116; 1% instances), Style (55; 0% instances)
AUX
occurs with 30 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Aspect=Imp
, Aspect=Perf
, Gender=Fem
, Gender=Fem,Masc
, Gender=Fem,Neut
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Style=Arch
, Style=Coll
, Tense=Fut
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
AUX
occurs with 69 feature combinations.
The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act
(5703 tokens).
Examples: je
Relations
AUX
nodes are attached to their parents using 17 different relations: cop (8112; 50% instances), aux (3623; 22% instances), aux:pass (2523; 16% instances), root (878; 5% instances), conj (254; 2% instances), advcl (187; 1% instances), acl:relcl (153; 1% instances), acl (126; 1% instances), ccomp (88; 1% instances), xcomp (82; 1% instances), parataxis (33; 0% instances), csubj (29; 0% instances), fixed (13; 0% instances), orphan (9; 0% instances), dep (4; 0% instances), appos (3; 0% instances), csubj:pass (3; 0% instances)
Parents of AUX
nodes belong to 14 different parts of speech: ADJ (7651; 47% instances), VERB (3495; 22% instances), NOUN (2857; 18% instances), (878; 5% instances), DET (416; 3% instances), ADV (405; 3% instances), NUM (163; 1% instances), AUX (118; 1% instances), SYM (63; 0% instances), PRON (50; 0% instances), PART (13; 0% instances), PROPN (9; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)
14277 (89%) AUX
nodes are leaves.
85 (1%) AUX
nodes have one child.
150 (1%) AUX
nodes have two children.
1608 (10%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 10.
Children of AUX
nodes are attached using 24 different relations: punct (1785; 26% instances), nsubj (1466; 21% instances), obl (1346; 20% instances), advmod (833; 12% instances), mark (388; 6% instances), conj (365; 5% instances), cc (257; 4% instances), advcl (101; 1% instances), csubj (62; 1% instances), dep (58; 1% instances), aux (49; 1% instances), obl:arg (36; 1% instances), ccomp (31; 0% instances), parataxis (23; 0% instances), discourse (14; 0% instances), obj (12; 0% instances), appos (7; 0% instances), xcomp (6; 0% instances), advmod:emph (5; 0% instances), amod (3; 0% instances), nmod (3; 0% instances), orphan (3; 0% instances), vocative (3; 0% instances), nummod (2; 0% instances)
Children of AUX
nodes belong to 16 different parts of speech: NOUN (2220; 32% instances), PUNCT (1785; 26% instances), ADV (887; 13% instances), VERB (358; 5% instances), SCONJ (351; 5% instances), DET (333; 5% instances), CCONJ (248; 4% instances), PRON (185; 3% instances), AUX (118; 2% instances), ADJ (114; 2% instances), PROPN (79; 1% instances), SYM (73; 1% instances), PART (52; 1% instances), NUM (51; 1% instances), ADP (3; 0% instances), INTJ (1; 0% instances)