home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Zaar-Autogramm: POS Tags: AUX

There are 23 AUX lemmas (3%), 88 AUX types (6%) and 1039 AUX tokens (14%). Out of 16 observed tags, the rank of AUX is: 10 in number of lemmas, 7 in number of types and 3 in number of tokens.

The 10 most frequent AUX lemmas: á, wò, tə̀, yáː, àː, ʧáː, átâ, yí, átâyáː, ʧiká

The 10 most frequent AUX types: mə́, mə̀, wò, mə, á, àː, myáː, tə̀, má, yáː

The 10 most frequent ambiguous lemmas: á (AUX 254, INTJ 5), yáː (AUX 105, VERB 1), àː (AUX 99, INTJ 2), ʧiká (AUX 18, VERB 1)

The 10 most frequent ambiguous types: á (AUX 55, ADP 23, INTJ 5, X 1), àː (AUX 50, PART 6, INTJ 2), yáː (AUX 37, PRON 1, X 1), ʧáː (AUX 32, VERB 1), ka (AUX 24, X 1), tə́ (ADP 53, AUX 23, CCONJ 18, X 1), (AUX 21, PART 15, ADV 9, SCONJ 1, VERB 1), máː (PART 54, AUX 14, ADV 1), yǎː (AUX 10, ADV 1), kə́ (AUX 8, ADP 5)

Morphology

The form / lemma ratio of AUX is 3.826087 (the average of all parts of speech is 1.640000).

The 1st highest number of forms (11) was observed with the lemma “yí”: kìː, kîː, mìː, míyí, mîː, tíyí, tíː, ʧí, ʧíyí, ʧíː, ʧîː.

The 2nd highest number of forms (9) was observed with the lemma “átâ”: kətá, kə̀tà, mətá, mə̀tà, tà, tâ, tə̀tà, átá, átâ.

The 3rd highest number of forms (8) was observed with the lemma “yáː”: kyàː, kyáː, myàː, myáː, myǎː, mǎː, yáː, yǎː.

AUX occurs with 5 features: Number (1026; 99% instances), Person (1026; 99% instances), Aspect (522; 50% instances), Tense (318; 31% instances), Mood (243; 23% instances)

AUX occurs with 19 feature-value pairs: Aspect=Aor, Aspect=Conc, Aspect=Imp, Aspect=ImpIter, Aspect=Iter, Aspect=Perf, Aspect=Prog, Mood=Cnd, Mood=Irr, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Fut, Tense=Imm, Tense=Rec, Tense=Rem

AUX occurs with 79 feature combinations. The most frequent feature combination is Aspect=Aor|Number=Plur|Person=1 (71 tokens). Examples: mə́, mə̀

Relations

AUX nodes are attached to their parents using 10 different relations: aux (939; 90% instances), root (42; 4% instances), reparandum (31; 3% instances), parataxis (9; 1% instances), advcl (8; 1% instances), ccomp (3; 0% instances), conj (3; 0% instances), csubj (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances)

Parents of AUX nodes belong to 7 different parts of speech: VERB (982; 95% instances), (42; 4% instances), NOUN (5; 0% instances), AUX (4; 0% instances), X (4; 0% instances), PART (1; 0% instances), PRON (1; 0% instances)

958 (92%) AUX nodes are leaves.

18 (2%) AUX nodes have one child.

25 (2%) AUX nodes have two children.

38 (4%) AUX nodes have three or more children.

The highest child degree of a AUX node is 9.

Children of AUX nodes are attached using 18 different relations: punct (60; 26% instances), ccomp (50; 22% instances), advmod (33; 14% instances), discourse (25; 11% instances), mark (13; 6% instances), nsubj (9; 4% instances), parataxis (9; 4% instances), dislocated (8; 3% instances), advcl (7; 3% instances), obl (4; 2% instances), conj (3; 1% instances), dep (3; 1% instances), obj (2; 1% instances), reparandum (2; 1% instances), cc (1; 0% instances), compound (1; 0% instances), csubj (1; 0% instances), vocative (1; 0% instances)

Children of AUX nodes belong to 13 different parts of speech: VERB (72; 31% instances), PUNCT (60; 26% instances), PART (39; 17% instances), ADV (12; 5% instances), SCONJ (11; 5% instances), NOUN (10; 4% instances), X (9; 4% instances), INTJ (6; 3% instances), AUX (4; 2% instances), PRON (3; 1% instances), PROPN (3; 1% instances), ADP (2; 1% instances), CCONJ (1; 0% instances)