home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: AUX

There are 3 AUX lemmas (0%), 72 AUX types (0%) and 46600 AUX tokens (3%). Out of 17 observed tags, the rank of AUX is: 17 in number of lemmas, 11 in number of types and 10 in number of tokens.

The 10 most frequent AUX lemmas: být, bývat, bývávat

The 10 most frequent AUX types: je, by, jsou, bude, byl, být, není, bylo, jsem, jsme

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: je (AUX 12138, PRON 886), by (AUX 7864, ADP 1, PART 1), buď (CCONJ 112, AUX 13), j (NOUN 13, AUX 6, ADJ 3), si (PRON 3737, AUX 2)

Morphology

The form / lemma ratio of AUX is 24.000000 (the average of all parts of speech is 2.181221).

The 1st highest number of forms (57) was observed with the lemma “být”: Buďme, Nebuďte, Neníť, bolo, bude, budem, budeme, budete, budeš, budiž, budou, budu, buď, buďte, by, bych, bychom, byl, byla, byli, bylo, byly, bys, bysme, byste, být, býti, j, je, jest, jsa, jsem, jsi, jsme, jsou, jsouc, jsouce, jste, nebude, nebudeme, nebudete, nebudeš, nebudou, nebudu, nebyl, nebyla, nebyli, nebylo, nebyly, nebýt, nejsem, nejsme, nejsou, nejste, není, nésó, si.

The 2nd highest number of forms (13) was observed with the lemma “bývat”: bývají, býval, bývala, bývali, bývalo, bývaly, bývá, bývám, nebývají, nebýval, nebývala, nebývalo, nebývá.

The 3rd highest number of forms (2) was observed with the lemma “bývávat”: bývávalo, bývávaly.

AUX occurs with 13 features: VerbForm (46600; 100% instances), Polarity (37694; 81% instances), Mood (36678; 79% instances), Number (36631; 79% instances), Tense (35502; 76% instances), Voice (35502; 76% instances), Person (31232; 67% instances), Gender (7822; 17% instances), Animacy (1479; 3% instances), Aspect (226; 0% instances), Style (49; 0% instances), Abbr (6; 0% instances), Foreign (2; 0% instances)

AUX occurs with 31 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Aspect=Imp, Foreign=Yes, Gender=Fem, Gender=Fem,Masc, Gender=Fem,Neut, Gender=Masc, Gender=Neut, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Polarity=Pos, Style=Arch, Style=Coll, Tense=Fut, Tense=Past, Tense=Pres, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 73 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Polarity=Pos|Tense=Pres|VerbForm=Fin|Voice=Act (13211 tokens). Examples: je

Relations

AUX nodes are attached to their parents using 18 different relations: cop (20597; 44% instances), aux (14806; 32% instances), aux:pass (5983; 13% instances), root (2344; 5% instances), conj (763; 2% instances), ccomp (455; 1% instances), advcl (434; 1% instances), acl:relcl (377; 1% instances), acl (348; 1% instances), xcomp (195; 0% instances), csubj (79; 0% instances), parataxis (70; 0% instances), fixed (48; 0% instances), appos (42; 0% instances), dep (27; 0% instances), csubj:pass (18; 0% instances), orphan (12; 0% instances), flat:foreign (2; 0% instances)

Parents of AUX nodes belong to 14 different parts of speech: ADJ (18804; 40% instances), VERB (14844; 32% instances), NOUN (8462; 18% instances), (2344; 5% instances), DET (715; 2% instances), NUM (485; 1% instances), AUX (303; 1% instances), ADV (266; 1% instances), PRON (231; 0% instances), PROPN (117; 0% instances), PART (16; 0% instances), SYM (9; 0% instances), SCONJ (3; 0% instances), INTJ (1; 0% instances)

41482 (89%) AUX nodes are leaves.

207 (0%) AUX nodes have one child.

413 (1%) AUX nodes have two children.

4498 (10%) AUX nodes have three or more children.

The highest child degree of a AUX node is 10.

Children of AUX nodes are attached using 26 different relations: punct (5087; 26% instances), obl (4241; 22% instances), nsubj (4054; 21% instances), advmod (2222; 11% instances), mark (1069; 5% instances), conj (962; 5% instances), cc (733; 4% instances), advcl (314; 2% instances), csubj (179; 1% instances), dep (167; 1% instances), aux (116; 1% instances), ccomp (93; 0% instances), obl:arg (82; 0% instances), parataxis (40; 0% instances), advmod:emph (32; 0% instances), discourse (19; 0% instances), xcomp (18; 0% instances), nmod (17; 0% instances), appos (14; 0% instances), obj (6; 0% instances), orphan (6; 0% instances), vocative (4; 0% instances), acl (2; 0% instances), amod (2; 0% instances), cop (1; 0% instances), det (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: NOUN (6272; 32% instances), PUNCT (5087; 26% instances), ADV (2449; 13% instances), VERB (1031; 5% instances), SCONJ (1014; 5% instances), DET (925; 5% instances), CCONJ (725; 4% instances), PROPN (554; 3% instances), PRON (506; 3% instances), ADJ (304; 2% instances), AUX (303; 2% instances), NUM (169; 1% instances), PART (100; 1% instances), ADP (27; 0% instances), SYM (10; 0% instances), INTJ (5; 0% instances)