Treebank Statistics: UD_Czech-PDT: POS Tags: AUX
There are 3 AUX
lemmas (0%), 56 AUX
types (0%) and 10753 AUX
tokens (3%).
Out of 17 observed tags, the rank of AUX
is: 17 in number of lemmas, 12 in number of types and 10 in number of tokens.
The 10 most frequent AUX
lemmas: být, bývat, bývávat
The 10 most frequent AUX
types: je, by, jsou, bude, byl, být, jsem, bylo, není, jsme
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types: je (AUX 2840, PRON 213), by (AUX 1819, X 1), buď (CCONJ 29, AUX 11), si (PRON 842, AUX 1)
- je
- by
- buď
- si
Morphology
The form / lemma ratio of AUX
is 18.666667 (the average of all parts of speech is 1.961704).
The 1st highest number of forms (45) was observed with the lemma “být”: Neníť, bude, budem, budeme, budete, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, byste, být, býti, je, jest, jsa, jsem, jsme, jsou, jsouce, jste, nebude, nebudeme, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nejsem, nejsme, nejsou, nejste, není, nésó, si.
The 2nd highest number of forms (9) was observed with the lemma “bývat”: bývají, býval, bývala, bývalo, bývaly, bývá, nebývala, nebývalo, nebývá.
The 3rd highest number of forms (2) was observed with the lemma “bývávat”: bývávalo, bývávaly.
AUX
occurs with 11 features: VerbForm (10753; 100% instances), Aspect (10752; 100% instances), Polarity (8682; 81% instances), Mood (8563; 80% instances), Number (8489; 79% instances), Tense (8170; 76% instances), Voice (8170; 76% instances), Person (6743; 63% instances), Gender (1745; 16% instances), Animacy (334; 3% instances), Style (3; 0% instances)
AUX
occurs with 28 feature-value pairs: Animacy=Anim
, Animacy=Inan
, Aspect=Imp
, Gender=Fem,Masc
, Gender=Fem,Neut
, Gender=Masc
, Gender=Neut
, Mood=Cnd
, Mood=Imp
, Mood=Ind
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Person=1
, Person=2
, Person=3
, Polarity=Neg
, Polarity=Pos
, Style=Coll
, Style=Vrnc
, Tense=Fut
, Tense=Past
, Tense=Pres
, VerbForm=Conv
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, Voice=Act
AUX
occurs with 43 feature combinations.
The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act
(3114 tokens).
Examples: je, bývá, jest
Relations
AUX
nodes are attached to their parents using 15 different relations: cop (5716; 53% instances), aux (3463; 32% instances), aux:pass (1339; 12% instances), root (63; 1% instances), conj (58; 1% instances), acl (32; 0% instances), advcl (30; 0% instances), fixed (19; 0% instances), xcomp (14; 0% instances), ccomp (10; 0% instances), acl:relcl (3; 0% instances), dep (2; 0% instances), parataxis (2; 0% instances), appos (1; 0% instances), csubj (1; 0% instances)
Parents of AUX
nodes belong to 15 different parts of speech: ADJ (4161; 39% instances), VERB (3159; 29% instances), NOUN (2444; 23% instances), ADV (380; 4% instances), DET (209; 2% instances), PRON (115; 1% instances), NUM (97; 1% instances), (63; 1% instances), PROPN (62; 1% instances), AUX (21; 0% instances), PART (20; 0% instances), SYM (8; 0% instances), X (7; 0% instances), CCONJ (4; 0% instances), ADP (3; 0% instances)
10544 (98%) AUX
nodes are leaves.
17 (0%) AUX
nodes have one child.
42 (0%) AUX
nodes have two children.
150 (1%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 7.
Children of AUX
nodes are attached using 16 different relations: punct (212; 34% instances), nsubj (166; 27% instances), mark (69; 11% instances), conj (46; 7% instances), cc (39; 6% instances), dep (30; 5% instances), advcl (11; 2% instances), obl (11; 2% instances), aux (8; 1% instances), ccomp (7; 1% instances), csubj (7; 1% instances), advmod (6; 1% instances), nmod (5; 1% instances), xcomp (2; 0% instances), acl (1; 0% instances), parataxis (1; 0% instances)
Children of AUX
nodes belong to 15 different parts of speech: PUNCT (212; 34% instances), NOUN (155; 25% instances), SCONJ (68; 11% instances), VERB (45; 7% instances), CCONJ (35; 6% instances), ADV (21; 3% instances), AUX (21; 3% instances), NUM (20; 3% instances), DET (11; 2% instances), ADJ (10; 2% instances), PROPN (8; 1% instances), PART (7; 1% instances), PRON (6; 1% instances), SYM (1; 0% instances), X (1; 0% instances)