home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Polish-PDB: POS Tags: AUX

There are 10 AUX lemmas (0%), 52 AUX types (0%) and 8775 AUX tokens (3%). Out of 17 observed tags, the rank of AUX is: 16 in number of lemmas, 14 in number of types and 12 in number of tokens.

The 10 most frequent AUX lemmas: być, to, by, zostać, niech, zostawać, bywać, niechaj, niechby, niechże

The 10 most frequent AUX types: jest, em, to, by, będzie, są, m, śmy, był, być

The 10 most frequent ambiguous lemmas: być (AUX 6811, VERB 923, ADJ 27, NOUN 13), to (PRON 1644, AUX 769, PART 228, SCONJ 227, CCONJ 12, X 2), by (AUX 621, SCONJ 269), zostać (AUX 475, VERB 85, NOUN 1), zostawać (AUX 34, VERB 10), bywać (VERB 18, AUX 11)

The 10 most frequent ambiguous types: jest (AUX 1478, VERB 255), to (PRON 894, AUX 554, SCONJ 223, PART 173, DET 128, CCONJ 12, NOUN 2, X 2), by (AUX 619, SCONJ 265), będzie (AUX 530, VERB 80), (AUX 506, VERB 57), m (AUX 526, NOUN 25, ADV 1), był (AUX 324, VERB 54), być (AUX 310, VERB 66), było (AUX 266, VERB 143), będą (AUX 235, VERB 15)

Morphology

The form / lemma ratio of AUX is 5.200000 (the average of all parts of speech is 1.966055).

The 1st highest number of forms (31) was observed with the lemma “być”: bedzie, bedziesz, byc, byl, byli, być, był, była, było, były, bądź, będzie, będziecie, będziem, będziemy, będziesz, będą, będąc, będę, em, eś, jest, jestem, jesteś, jesteście, jesteśmy, m, są, ś, ście, śmy.

The 2nd highest number of forms (10) was observed with the lemma “zostać”: zostacie, zostali, zostanie, zostaną, zostanę, zostać, został, została, zostało, zostały.

The 3rd highest number of forms (3) was observed with the lemma “bywać”: bywa, bywają, bywali.

AUX occurs with 11 features: Aspect (7331; 84% instances), Number (6959; 79% instances), VerbForm (5961; 68% instances), Person (5609; 64% instances), Mood (5589; 64% instances), Tense (5587; 64% instances), Voice (4205; 48% instances), Variant (2139; 24% instances), Gender (1350; 15% instances), VerbType (769; 9% instances), Animacy (618; 7% instances)

AUX occurs with 25 feature-value pairs: Animacy=Hum, Animacy=Inan, Animacy=Nhum, Aspect=Imp, Aspect=Perf, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Past, Tense=Pres, Variant=Long, Variant=Short, VerbForm=Conv, VerbForm=Fin, VerbForm=Inf, VerbType=Quasi, Voice=Act

AUX occurs with 47 feature combinations. The most frequent feature combination is Aspect=Imp|Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (1597 tokens). Examples: jest, zostaje, bywa

Relations

AUX nodes are attached to their parents using 7 different relations: cop (3365; 38% instances), aux:clitic (2139; 24% instances), aux:pass (1401; 16% instances), aux (1193; 14% instances), aux:cnd (621; 7% instances), aux:imp (54; 1% instances), conj (2; 0% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (3418; 39% instances), ADJ (3033; 35% instances), NOUN (1936; 22% instances), ADV (143; 2% instances), PRON (117; 1% instances), DET (91; 1% instances), PROPN (30; 0% instances), X (3; 0% instances), AUX (2; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)

8305 (95%) AUX nodes are leaves.

468 (5%) AUX nodes have one child.

2 (0%) AUX nodes have two children.

The highest child degree of a AUX node is 2.

Children of AUX nodes are attached using 3 different relations: advmod:neg (467; 99% instances), conj (3; 1% instances), cc (2; 0% instances)

Children of AUX nodes belong to 4 different parts of speech: PART (467; 99% instances), AUX (2; 0% instances), CCONJ (2; 0% instances), VERB (1; 0% instances)