home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Karelian-KKPP: POS Tags: AUX

There are 4 AUX lemmas (0%), 28 AUX types (2%) and 134 AUX tokens (4%). Out of 14 observed tags, the rank of AUX is: 13 in number of lemmas, 8 in number of types and 9 in number of tokens.

The 10 most frequent AUX lemmas: olla, ei, voija, piteä

The 10 most frequent AUX types: on, oli, ei, voit, ois, olet, ollah, oltih, pitäy, en

The 10 most frequent ambiguous lemmas: olla (AUX 94, VERB 3), ei (AUX 20, CCONJ 5, ADV 2), piteä (VERB 13, AUX 4)

The 10 most frequent ambiguous types: on (AUX 37, VERB 1), ei (AUX 11, ADV 2), pitäy (VERB 5, AUX 4)

Morphology

The form / lemma ratio of AUX is 7.000000 (the average of all parts of speech is 1.495298).

The 1st highest number of forms (16) was observed with the lemma “olla”: Oletko, ois, ole, olemma, olen, olet, oli, olin, olis, olisko, olla, ollah, olleššah, ollun, oltih, on.

The 2nd highest number of forms (6) was observed with the lemma “voija”: voijah, voimma, voipi, vois, voisin, voit.

The 3rd highest number of forms (5) was observed with the lemma “ei”: Elä, ei, emmä, en, et.

AUX occurs with 10 features: Number (131; 98% instances), VerbForm (131; 98% instances), Mood (127; 95% instances), Person (127; 95% instances), Voice (115; 86% instances), Tense (114; 85% instances), Connegative (3; 2% instances), Case (2; 1% instances), Clitic (2; 1% instances), Person[psor] (1; 1% instances)

AUX occurs with 19 feature-value pairs: Case=Gen, Case=Ine, Clitic=Ko, Connegative=Yes, Mood=Cnd, Mood=Imp, Mood=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Person[psor]=3, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, Voice=Act

AUX occurs with 16 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin|Voice=Act (57 tokens). Examples: on, ei, pitäy, voipi

Relations

AUX nodes are attached to their parents using 10 different relations: aux (58; 43% instances), cop (48; 36% instances), root (8; 6% instances), ccomp (5; 4% instances), cop:own (5; 4% instances), conj (4; 3% instances), acl:relcl (3; 2% instances), acl (1; 1% instances), aux:pass (1; 1% instances), xcomp (1; 1% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (61; 46% instances), NOUN (32; 24% instances), ADJ (16; 12% instances), (8; 6% instances), ADP (5; 4% instances), PRON (5; 4% instances), ADV (2; 1% instances), NUM (2; 1% instances), AUX (1; 1% instances), PROPN (1; 1% instances), X (1; 1% instances)

111 (83%) AUX nodes are leaves.

2 (1%) AUX nodes have one child.

7 (5%) AUX nodes have two children.

14 (10%) AUX nodes have three or more children.

The highest child degree of a AUX node is 7.

Children of AUX nodes are attached using 13 different relations: obl (17; 26% instances), punct (14; 22% instances), nsubj (8; 12% instances), conj (6; 9% instances), obj (5; 8% instances), mark (4; 6% instances), advmod (3; 5% instances), xcomp (3; 5% instances), aux (1; 2% instances), cc (1; 2% instances), compound:prt (1; 2% instances), nmod (1; 2% instances), xcomp:ds (1; 2% instances)

Children of AUX nodes belong to 9 different parts of speech: NOUN (18; 28% instances), PUNCT (14; 22% instances), PRON (12; 18% instances), VERB (10; 15% instances), SCONJ (4; 6% instances), ADV (3; 5% instances), ADJ (2; 3% instances), AUX (1; 2% instances), CCONJ (1; 2% instances)